Japanese SFT/DPO data convert to speech via TTS. And audio caption data generated by Qwen3-Omni. All datasets are available for commercial use.
Ayuto Tsutsumi
Atotti
AI & ML interests
None yet
Recent Activity
liked a model 1 day ago
nvidia/diar_streaming_sortformer_4spk-v2.1 liked a model 22 days ago
neuphonic/neucodec liked a model 22 days ago
maai-kyoto/vap_jp_kyoto