mnoukhov/gsm8k-platinum-openinstruct-qwen2.5-0.5b-instruct-1024samples-buckets Updated about 9 hours ago • 6
mnoukhov/gsm8k-platinum-openinstruct-qwen2.5-0.5b-instruct-128samples Viewer • Updated 3 days ago • 1.21k • 7
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_pythia6.9b Viewer • Updated Jun 20, 2024 • 177k • 15
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel2_llama8b Viewer • Updated Jun 19, 2024 • 92.1k • 9
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_llama8b Viewer • Updated Jun 19, 2024 • 176k • 4
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr_relabel_pythia1b Viewer • Updated May 17, 2024 • 107k • 7
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr Viewer • Updated May 17, 2024 • 107k • 9
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_pythia1b Viewer • Updated May 16, 2024 • 177k • 14
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144 Viewer • Updated May 13, 2024 • 179k • 18
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr-step873_relabel_pythia1b Viewer • Updated May 13, 2024 • 20k • 9
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr-step873 Viewer • Updated May 12, 2024 • 20k • 8
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_dpo_costa_2.8b_bf16.yml_6e799_new Viewer • Updated May 5, 2024 • 20k • 6
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_dpo_costa_2.8b_bf16.yml_6e799 Viewer • Updated Apr 22, 2024 • 107k • 5