Markus Krimmel's picture

Markus Krimmel

Markus28

·

Markus28

AI & ML interests

None yet

Organizations

None yet

New activity in jinaai/jina-bert-flash-implementation almost 2 years ago

Porting v2 models to flash attention

#15 opened almost 2 years ago by

feat: updated activation checkpointing

#14 opened almost 2 years ago by

feat: Allow LoRA to be merged into weights

#12 opened almost 2 years ago by

fix: remove cleaving

#13 opened almost 2 years ago by

feat: cleave off layers from encoder

#11 opened almost 2 years ago by

clean up embeddings.py

#7 opened almost 2 years ago by

support-multiple-task-ids

#5 opened almost 2 years ago by

michael-guenther

New activity in jinaai/jina-bert-flash-implementation about 2 years ago

feat: implement task type embeddings

#1 opened about 2 years ago by

New activity in jinaai/jina-bert-v2-qk-devlin-norm-1e-2 about 2 years ago

Use attention dropout during training

#1 opened about 2 years ago by

New activity in jinaai/jina-bert-implementation about 2 years ago

Use attention dropout during training

#10 opened about 2 years ago by

New activity in jinaai/jina-bert-implementation over 2 years ago

Fix sorting heuristic

#3 opened over 2 years ago by