In a Training Loop 🔄

3 55

Alain Galvan

alaingalvan

https://alain.xyz

AI & ML interests

Ray tracing denoisers, autoencoders.

Recent Activity

liked a Space about 1 month ago

microsoft/VibeVoice-ASR

upvoted a collection about 1 month ago

VibeVoice

liked a Space about 1 month ago

microsoft/TRELLIS.2

View all activity

Organizations

None yet

liked a Space about 1 month ago

VibeVoice ASR

🌍

Official Playground of Microsoft VibeVoice-ASR

upvoted a collection about 1 month ago

VibeVoice

Collection

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 244

liked a Space about 1 month ago

TRELLIS.2

🏢

1.66k

High-fidelity 3D Generation from images

liked 4 Spaces 3 months ago

faster-qwen3-tts

🎙

241

Generate natural speech from text or voice samples

Parakeet-TDT-0.6b-V2

472

Transcribe audio files with timestamps and downloadable subtitles

The Smol Training Playbook

📚

3.19k

The secrets to building world-class LLMs

Parakeet STT Progressive Transcription

🎤

Transcribe speech to text instantly with WebGPU acceleration

liked a model 3 months ago

openai/whisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 8.05M • • 3.04k

upvoted an article 4 months ago

Article

Creating custom kernels for the AMD MI300

ror, seungrokj

•

Jul 9, 2025

• 54

upvoted a paper 4 months ago

SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations

Paper • 2108.01073 • Published Aug 2, 2021 • 9

liked a model 4 months ago

zai-org/GLM-OCR

Image-Text-to-Text • 1B • Updated 11 days ago • 5.43M • • 1.79k

liked 2 Spaces 4 months ago

PaddleOCR-VL-1.5 Online Demo

😻

PaddleOCR-VL-1.5_Online_Demo

Qwen3-ASR Demo

🎙

139

Transcribe audio to text with timestamps and visualization

liked 2 models 4 months ago

nvidia/personaplex-7b-v1

Audio-to-Audio • 8B • Updated Mar 2 • 357k • 2.51k

deepseek-ai/DeepSeek-OCR-2

Image-Text-to-Text • 3B • Updated Feb 3 • 1.51M • 969

liked 2 Spaces 4 months ago

Whisper

📉

2.76k

Transcribe audio files into text instantly

Omni Image Editor

🖼

1.79k

Image edit, text to image, image upscale, remove watermark

liked 2 models 4 months ago

fal/flux-2-klein-4B-background-remove-lora

Image-to-Image • Updated Jan 21 • • 24

Supertone/supertonic-2

Text-to-Speech • Updated Jan 6 • 5.31k • 390

liked a Space 4 months ago

Qwen3-TTS Demo

🎙

1.94k

Generate custom speech from text, voice descriptions, or samples

Alain Galvan

AI & ML interests

Recent Activity

Organizations

alaingalvan's activity

VibeVoice ASR

TRELLIS.2

faster-qwen3-tts

Parakeet-TDT-0.6b-V2

The Smol Training Playbook

Parakeet STT Progressive Transcription

Creating custom kernels for the AMD MI300

PaddleOCR-VL-1.5 Online Demo

Qwen3-ASR Demo

Whisper

Omni Image Editor

Qwen3-TTS Demo