ByT5: Towards a token-free future with pre-trained byte-to-byte models Paper • 2105.13626 • Published May 28, 2021 • 5
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 95
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated about 5 hours ago • 17
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 52
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family Jan 19 • 88
PyLate 🐕 Collection State-of-the-art late interaction models trained using PyLate • 5 items • Updated about 5 hours ago • 4