-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 83 -
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Paper • 2408.02657 • Published • 35 -
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
Paper • 2508.10711 • Published • 146 -
Qwen3-Omni Technical Report
Paper • 2509.17765 • Published • 154
Charles Cai
charlescai2016
AI & ML interests
None yet
Recent Activity
liked a model about 13 hours ago
nvidia/nemotron-speech-streaming-en-0.6b liked a model about 13 hours ago
mistralai/Voxtral-4B-TTS-2603