Markus Krimmel
Markus28
AI & ML interests
None yet
Organizations
None yet
Porting v2 models to flash attention
🔥 1
#15 opened almost 2 years ago
by
Markus28
feat: updated activation checkpointing
#14 opened almost 2 years ago
by
Markus28
feat: Allow LoRA to be merged into weights
#12 opened almost 2 years ago
by
Markus28
fix: remove cleaving
#13 opened almost 2 years ago
by
Markus28
feat: cleave off layers from encoder
#11 opened almost 2 years ago
by
Markus28
clean up embeddings.py
#7 opened almost 2 years ago
by
bwang0911
support-multiple-task-ids
#5 opened almost 2 years ago
by
michael-guenther
feat: implement task type embeddings
#1 opened about 2 years ago
by
Markus28
Use attention dropout during training
1
#1 opened about 2 years ago
by
Markus28
Use attention dropout during training
2
#10 opened about 2 years ago
by
Markus28
Fix sorting heuristic
1
#3 opened over 2 years ago
by
Markus28