DPO fine-tuned version of Qwen/Qwen3-4B-Instruct-2507. Full-merged 16-bit weights. No adapter loading required.
Chat template
Files info
Base model