Pranavz/qwen3-0p6b-base-capybara-sft-1epoch-merged

Merged model (base model + LoRA adapter) based on Qwen/Qwen3-0.6B-Base, trained on trl-lib/Capybara with supervised fine-tuning (SFT).

This repository contains standalone merged weights.

LoRA adapter from the same run: Pranavz/qwen3-0p6b-base-capybara-sft-1epoch-lora

Downloads last month
3
Safetensors
Model size
0.6B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Pranavz/qwen3-0p6b-base-capybara-sft-1epoch-merged

Finetuned
(577)
this model

Dataset used to train Pranavz/qwen3-0p6b-base-capybara-sft-1epoch-merged