smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-3e-5 Updated 1 day ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-8-summary-mean-1024-mlp-ov0-causal-2e-5 Updated 2 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-2e-5 Updated 13 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5 Updated 15 days ago
smcleish/tinyllama_4_8_4_last_8_layers_add_adapter Text Generation • 0.8B • Updated 27 days ago • 42
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-300-chkpt-step-100 Text Generation • 2B • Updated Jan 22 • 1
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-300-chkpt-step-200 Text Generation • 2B • Updated Jan 22 • 2
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-300-chkpt-step-300 Text Generation • 2B • Updated Jan 22 • 3
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-300-chkpt-step-400 Text Generation • 2B • Updated Jan 22 • 1
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-400-chkpt-step-100 Text Generation • 2B • Updated Jan 22 • 3
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-400-chkpt-step-200 Text Generation • 2B • Updated Jan 22 • 1
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-400-chkpt-step-300 Text Generation • 2B • Updated Jan 22 • 1
smcleish/deepscaler-1.5b-8k-hard-first-run-with-shuffle-8k-400-chkpt-16k-400-chkpt-step-200 Text Generation • 2B • Updated Jan 21 • 1
smcleish/deepscaler-1.5b-8k-easy-first-run-with-shuffle-8k-400-chkpt-16k-200-chkpt-step-200 Text Generation • 2B • Updated Jan 21 • 2
smcleish/deepscaler-1.5b-8k-easy-first-run-with-shuffle-8k-400-chkpt-16k-400-chkpt-step-400 Text Generation • 2B • Updated Jan 21 • 1
smcleish/deepscaler-1.5b-8k-easy-first-run-with-shuffle-8k-400-chkpt-16k-400-chkpt-step-200 Text Generation • 2B • Updated Jan 21 • 3
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-step-400 Text Generation • 2B • Updated Jan 20 • 1
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-step-500 Text Generation • 2B • Updated Jan 20 • 1
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-step-300 Text Generation • 2B • Updated Jan 20 • 1
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-step-200 Text Generation • 2B • Updated Jan 20 • 1
smcleish/deepscaler-1.5b-8k-hard-first-run-with-shuffle-8k-500-chkpt-step-400 Text Generation • 2B • Updated Jan 19 • 1