Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
LMMs-Lab-Audio
community
Activity Feed
Request to join this org
Follow
5
AI & ML interests
Feeling and building the multimodal intelligence
Recent Activity
mwxely
authored
a paper
about 1 hour ago
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL
mwxely
authored
a paper
1 day ago
WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors
mwxely
authored
a paper
9 days ago
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling
View all activity
Team members
4
models
0
None public yet
datasets
3
Sort: Recently updated
lmms-lab-audio/timit-tts
Updated
Feb 15
•
3
lmms-lab-audio/song-describer
Viewer
•
Updated
Feb 13
•
1.85k
•
37
lmms-lab-audio/europal-asr
Viewer
•
Updated
Feb 13
•
215
•
13