ERNIE-Image The serieas of image generation models, including text2img、img2img. baidu/ERNIE-Image Text-to-Image • Updated Apr 17 • 71.7k • • 661 baidu/ERNIE-Image-Turbo Text-to-Image • Updated Apr 17 • 4.44k • • 399 baidu/ERNIE-Image-Aes 8B • Updated May 20 • 1.31k • 15 baidu/ERIA-1K-Benchmark Preview • Updated May 20 • 123 • 4
Qianfan-VL Qianfan-vl model series. The models are mainly domain enhanced vision language model, targeting enterprise level multi modal understanding scenarios. baidu/Qianfan-OCR Image-Text-to-Text • 5B • Updated Apr 29 • 258k • 1.19k baidu/Qianfan-VL-70B Image-Text-to-Text • 72B • Updated Apr 19 • 25 • 39 baidu/Qianfan-VL-8B Image-Text-to-Text • 9B • Updated Apr 19 • 1.97k • 41 baidu/Qianfan-VL-3B Image-Text-to-Text • 4B • Updated Sep 19, 2025 • 84 • 30
ERNIE 4.5 collection of ERNIE 4.5 models. baidu/ERNIE-4.5-VL-28B-A3B-Thinking Image-Text-to-Text • 30B • Updated Mar 6 • 123 • 541 baidu/ERNIE-4.5-21B-A3B-Thinking Text Generation • 22B • Updated Nov 26, 2025 • 15.1k • 786 baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle Image-Text-to-Text • 424B • Updated Aug 19, 2025 • 21 • 68 baidu/ERNIE-4.5-VL-424B-A47B-Base-PT Image-Text-to-Text • 424B • Updated Jan 16 • 208 • • 83
ERNIE-Image The serieas of image generation models, including text2img、img2img. baidu/ERNIE-Image Text-to-Image • Updated Apr 17 • 71.7k • • 661 baidu/ERNIE-Image-Turbo Text-to-Image • Updated Apr 17 • 4.44k • • 399 baidu/ERNIE-Image-Aes 8B • Updated May 20 • 1.31k • 15 baidu/ERIA-1K-Benchmark Preview • Updated May 20 • 123 • 4
ERNIE 4.5 collection of ERNIE 4.5 models. baidu/ERNIE-4.5-VL-28B-A3B-Thinking Image-Text-to-Text • 30B • Updated Mar 6 • 123 • 541 baidu/ERNIE-4.5-21B-A3B-Thinking Text Generation • 22B • Updated Nov 26, 2025 • 15.1k • 786 baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle Image-Text-to-Text • 424B • Updated Aug 19, 2025 • 21 • 68 baidu/ERNIE-4.5-VL-424B-A47B-Base-PT Image-Text-to-Text • 424B • Updated Jan 16 • 208 • • 83
Qianfan-VL Qianfan-vl model series. The models are mainly domain enhanced vision language model, targeting enterprise level multi modal understanding scenarios. baidu/Qianfan-OCR Image-Text-to-Text • 5B • Updated Apr 29 • 258k • 1.19k baidu/Qianfan-VL-70B Image-Text-to-Text • 72B • Updated Apr 19 • 25 • 39 baidu/Qianfan-VL-8B Image-Text-to-Text • 9B • Updated Apr 19 • 1.97k • 41 baidu/Qianfan-VL-3B Image-Text-to-Text • 4B • Updated Sep 19, 2025 • 84 • 30