Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning Paper ⢠2504.11354 ⢠Published Apr 15, 2025 ⢠6
SmolVLM: Redefining small and efficient multimodal models Paper ⢠2504.05299 ⢠Published Apr 7, 2025 ⢠205
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning Paper ⢠2503.07572 ⢠Published Mar 10, 2025 ⢠47
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper ⢠2502.02737 ⢠Published Feb 4, 2025 ⢠254
RAFT: A Real-World Few-Shot Text Classification Benchmark Paper ⢠2109.14076 ⢠Published Sep 28, 2021 ⢠2
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code Paper ⢠2206.11249 ⢠Published Jun 22, 2022
AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages Paper ⢠2303.12582 ⢠Published Mar 22, 2023 ⢠21
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements Paper ⢠2210.01970 ⢠Published Sep 30, 2022 ⢠13