Pranavz/qwen3-0p6b-base-capybara-sft-1epoch-merged
Merged model (base model + LoRA adapter) based on Qwen/Qwen3-0.6B-Base, trained on trl-lib/Capybara with supervised fine-tuning (SFT).
This repository contains standalone merged weights.
LoRA adapter from the same run: Pranavz/qwen3-0p6b-base-capybara-sft-1epoch-lora
- Downloads last month
- 3
Model tree for Pranavz/qwen3-0p6b-base-capybara-sft-1epoch-merged
Base model
Qwen/Qwen3-0.6B-Base