WildDet3D Collection This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 8 items • Updated 6 days ago • 17
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 12 days ago • 56
Falcon Perception Collection Falcon-Perception and Falcon-OCR model: early-fusion, natively multimodal, dense Autoregressive Transformer models. • 5 items • Updated 13 days ago • 14
view article Article SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation 27 days ago • 16
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face Feb 11, 2025 • 118
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 17 days ago • 864
SigLino: Vision Foundation Models (SigLIP2 + DINOv3) Collection Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. • 6 items • Updated 8 days ago • 17
view article Article Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline Mar 13 • 40
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 19 days ago • 49
view article Article How I contributed a new model to the Transformers library using Codex 20 days ago • 46
MolmoWeb Collection This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 8 items • Updated 5 days ago • 24