Melvin Vivas PRO
AI & ML interests
Recent Activity
Organizations
-
Running on CPU Upgrade986
Open VLM Leaderboard
🌎986VLMEvalKit Evaluation Results Collection
-
Running on ZeroFeatured409
DeepSeek OCR 2 Demo
🚀409Try out DeepSeek-OCR-2 on your PDFs or images
-
Running on ZeroMCP61
Multimodal OCR3
🌖61nanonets2-ocr / chandra-ocr / dots.ocr / olm-ocr2
-
Qwen/Qwen3-VL-30B-A3B-Instruct
Image-Text-to-Text • 31B • Updated • 822k • • 524
-
Running1
Qwen-3-VL-8B OCR Receipts
🚀1structured data parser from receipt images
-
RunningFeatured248
Qwen3 Omni Demo
⚡248Chat with AI via text, voice, image or video; get spoken replies
-
Running on ZeroFeatured113
VLM Object Understanding
🦀113Explore object detection, visual grounding, keypoint Detecti
-
Running2
Dataset Card Drafter
😻2Create dataset descriptions and open PRs automatically
-
Running on ZeroFeatured169
VibeVoice-Realtime-0.5B
🐨169Generate natural speech from text with customizable voices
-
microsoft/VibeVoice-1.5B
Text-to-Speech • 3B • Updated • 272k • 2.21k -
RunningFeatured380
Qwen3 TTS Demo
🚀380Generate natural speech from text with many voices
-
mradermacher/Qwen3-1.7B-Multilingual-TTS-GGUF
2B • Updated • 4.18k • 4
-
Running1
Qwen-3-VL-8B OCR Receipts
🚀1structured data parser from receipt images
-
RunningFeatured248
Qwen3 Omni Demo
⚡248Chat with AI via text, voice, image or video; get spoken replies
-
Running on ZeroFeatured113
VLM Object Understanding
🦀113Explore object detection, visual grounding, keypoint Detecti
-
Running2
Dataset Card Drafter
😻2Create dataset descriptions and open PRs automatically
-
Running on CPU Upgrade986
Open VLM Leaderboard
🌎986VLMEvalKit Evaluation Results Collection
-
Running on ZeroFeatured409
DeepSeek OCR 2 Demo
🚀409Try out DeepSeek-OCR-2 on your PDFs or images
-
Running on ZeroMCP61
Multimodal OCR3
🌖61nanonets2-ocr / chandra-ocr / dots.ocr / olm-ocr2
-
Qwen/Qwen3-VL-30B-A3B-Instruct
Image-Text-to-Text • 31B • Updated • 822k • • 524
-
Running on ZeroFeatured169
VibeVoice-Realtime-0.5B
🐨169Generate natural speech from text with customizable voices
-
microsoft/VibeVoice-1.5B
Text-to-Speech • 3B • Updated • 272k • 2.21k -
RunningFeatured380
Qwen3 TTS Demo
🚀380Generate natural speech from text with many voices
-
mradermacher/Qwen3-1.7B-Multilingual-TTS-GGUF
2B • Updated • 4.18k • 4