An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex
-
OpenMOSS-Team/MOSS-Audio-4B-Instruct
Audio-Text-to-Text ⢠5B ⢠Updated ⢠134 ⢠17 -
OpenMOSS-Team/MOSS-Audio-4B-Thinking
Audio-Text-to-Text ⢠5B ⢠Updated ⢠150 ⢠14 -
OpenMOSS-Team/MOSS-Audio-8B-Instruct
Audio-Text-to-Text ⢠9B ⢠Updated ⢠129 ⢠19 -
OpenMOSS-Team/MOSS-Audio-8B-Thinking
Audio-Text-to-Text ⢠9B ⢠Updated ⢠133 ⢠27