Kreshnik Hasanaj
Kreshnik
AI & ML interests
None yet
Recent Activity
liked a model 8 days ago
NemoStation/Marlin-2B liked a model 14 days ago
Datadog/Toto-2.0-2.5B upvoted a paper about 1 month ago
Video Analysis and Generation via a Semantic Progress FunctionOrganizations
None yet
OCR
Language
Voice
-
microsoft/VibeVoice-1.5B
Text-to-Speech • 3B • Updated • 112k • 2.38k - Configuration errorFeatured445
FastVLM WebGPU
🍎445Real-time video captioning powered by FastVLM
-
openbmb/VoxCPM-0.5B
Text-to-Speech • Updated • 9.85k • 799 - Running on CPU Upgrade84
MiMo-Audio-Chat
💬84Chat with Xiaomi MiMo-Audio using voice
Model training
Structured Output
music
OCR
3D
Language
Image
Voice
-
microsoft/VibeVoice-1.5B
Text-to-Speech • 3B • Updated • 112k • 2.38k - Configuration errorFeatured445
FastVLM WebGPU
🍎445Real-time video captioning powered by FastVLM
-
openbmb/VoxCPM-0.5B
Text-to-Speech • Updated • 9.85k • 799 - Running on CPU Upgrade84
MiMo-Audio-Chat
💬84Chat with Xiaomi MiMo-Audio using voice
Papers
Model training