memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_185B Text Generation • 1B • Updated 1 day ago • 39
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_180B_Harvard_5B Text Generation • 1B • Updated 1 day ago • 20
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_180B_Harvard_5B Text Generation • 1B • Updated 1 day ago • 20
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_185B Text Generation • 1B • Updated 1 day ago • 39
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative Text Generation • 1B • Updated 8 days ago • 29
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative Text Generation • 1B • Updated 8 days ago • 29
aflah/llama32_1b_dclm-SL-2048-PGBS-16-GAS-4-NGPU-8-NNODES-1-TW-PERF-step-23999 1B • Updated Mar 17, 2025
aflah/llama32_1b_dclm-SL-2048-PGBS-16-GAS-4-NGPU-8-NNODES-1-TW-PERF-step-23999 1B • Updated Mar 17, 2025
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 37 items • Updated Mar 2 • 377