Ai-Model
-
Image-Text-to-Text β’ 25B β’ Updated β’ 63.8k β’ 637 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition β’ Updated β’ 5.03M β’ β’ 2.9k -
SWivid/F5-TTS
Text-to-Speech β’ Updated β’ 699k β’ 1.16k -
D-Edit
π84 -
FacePoke
π2.21kImport a portrait, click to move the head!
-
Expression Editor
π¨1.64kQuickly edit the expression of a face
-
F5-TTS
π£2.84kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
FLUX.1 [dev]
π₯9.42kGenerate images from text prompts with FLUX.1 diffusion model
-
Face Recognition SDK
π’234Face Recognition
-
Open NotebookLM
π1.09kPersonalised Podcasts For All - Available in 13 Languages
-
PMRF
πΌ313A gradio demo for Posterior-Mean Rectified Flow (PMRF)
-
stabilityai/stable-diffusion-3.5-large
Text-to-Image β’ Updated β’ 75.6k β’ β’ 3.39k -
genmo/mochi-1-preview
Text-to-Video β’ Updated β’ 9.41k β’ β’ 1.32k -
Freepik/flux.1-lite-8B-alpha
Text-to-Image β’ Updated β’ 338 β’ 427 -
rhymes-ai/Allegro
Text-to-Video β’ Updated β’ 166 β’ 264 -
CohereLabs/aya-expanse-8b
Text Generation β’ 8B β’ Updated β’ 15.3k β’ 424 -
deepseek-ai/Janus-1.3B
Any-to-Any β’ 2B β’ Updated β’ 4.29k β’ 594 -
Pangea
π50A Fully Open Multilingual Multimodal LLM for 39 Languages
-
Etched/oasis-500m
Updated β’ 71 β’ 490 -
microsoft/OmniParser
Image-Text-to-Text β’ Updated β’ 317 β’ 1.71k -
OuteAI/OuteTTS-0.1-350M
Text-to-Speech β’ Updated β’ 231 β’ 302 -
tencent/Tencent-Hunyuan-Large
Text Generation β’ Updated β’ 453 β’ 618 -
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation β’ 71B β’ Updated β’ 11k β’ 2.06k -
tencent/HunyuanVideo
Text-to-Video β’ Updated β’ 1.01k β’ β’ 2.14k -
zai-org/CogVideoX-5b
Text-to-Video β’ Updated β’ 30.2k β’ β’ 668 -
LanguageBind/Open-Sora-Plan-v1.2.0
Updated β’ 1 β’ 47 -
microsoft/phi-4
Text Generation β’ Updated β’ 707k β’ 2.23k -
TRELLIS
π’4.78kScalable and Versatile 3D Generation from images
-
Search Your Face Online
π835Track your online presence with reverse face search
-
Kolors Virtual Try-On
π10kGenerate a virtual tryβon image of a person wearing a garment
-
DeepSeek-R1 WebGPU
π§554Next-generation reasoning model that runs locally in-browser
-
AnyCoder
π3.18kGenerate AI-powered code for HTML, React, Streamlit, and more
-
tencent/Hunyuan3D-2
Image-to-3D β’ Updated β’ 93.7k β’ 1.72k -
openbmb/MiniCPM-o-2_6
Any-to-Any β’ 9B β’ Updated β’ 111k β’ 1.29k -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation β’ Updated β’ 133k β’ β’ 761 -
Magic Face
π€ͺ244Transform Your Face Into Legendary Characters!
-
Llasa 3b Tts
π₯313Zero Shot voice cloning with llasa 3b (Unofficial Demo)
-
mistralai/Mistral-Small-24B-Instruct-2501
Updated β’ 107k β’ 950 -
Pyramid Flow
β±669Generate videos from text prompts and optional images
-
microsoft/OmniParser-v2.0
Updated β’ 973 β’ 1.3k -
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech β’ Updated β’ 2.02k β’ 1.1k -
agentica-org/DeepScaleR-1.5B-Preview
Text Generation β’ 2B β’ Updated β’ 14.3k β’ 575 -
stepfun-ai/Step-Audio-Chat
Audio-Text-to-Text β’ 132B β’ Updated β’ 136 β’ 458 -
hexgrad/Kokoro-82M
Text-to-Speech β’ Updated β’ 9.54M β’ β’ 5.89k -
black-forest-labs/FLUX.1-dev
Text-to-Image β’ Updated β’ 678k β’ β’ 12.6k -
NousResearch/DeepHermes-3-Llama-3-8B-Preview
Text Generation β’ 8B β’ Updated β’ 234 β’ β’ 352