-
PretrainZero: Reinforcement Active Pretraining
Paper • 2512.03442 • Published • 49 -
UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs
Paper • 2512.03383 • Published • 5 -
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
Paper • 2511.21689 • Published • 126 -
Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models
Paper • 2511.18890 • Published • 35
Flavius Burca
flaviusburca
AI & ML interests
None yet
Recent Activity
updated a model 1 day ago
surogate/Qwen3.5-0.8B-NVFP4 updated a model 1 day ago
surogate/Qwen3.5-2B-NVFP4 updated a model 1 day ago
surogate/Qwen3.5-4B-NVFP4