Interesting Datasets A collection of datasets that I come across LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 1.54k • 248 teknium/openhermes Viewer • Updated Sep 7, 2023 • 243k • 1.08k • 219 VMware/open-instruct Viewer • Updated Jul 12, 2023 • 143k • 114 • 44 euclaise/WritingPrompts_preferences Viewer • Updated Dec 25, 2023 • 265k • 61 • 10
Requires Filtering Datasets that HAS TO BE FILTERED. Squish42/bluemoon-fandom-1-1-rp-cleaned Updated Jul 9, 2023 • 135 • 78
Augmentable A collection of datasets that should be augmented further with gpt-4 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 324k • 313 codeparrot/apps Updated Oct 20, 2022 • 16.3k • 200 facebook/belebele Viewer • Updated Aug 12, 2024 • 110k • 10k • 122 google/boolq Viewer • Updated Jan 22, 2024 • 12.7k • 30.3k • 98
Interesting Datasets A collection of datasets that I come across LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 1.54k • 248 teknium/openhermes Viewer • Updated Sep 7, 2023 • 243k • 1.08k • 219 VMware/open-instruct Viewer • Updated Jul 12, 2023 • 143k • 114 • 44 euclaise/WritingPrompts_preferences Viewer • Updated Dec 25, 2023 • 265k • 61 • 10
Augmentable A collection of datasets that should be augmented further with gpt-4 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 324k • 313 codeparrot/apps Updated Oct 20, 2022 • 16.3k • 200 facebook/belebele Viewer • Updated Aug 12, 2024 • 110k • 10k • 122 google/boolq Viewer • Updated Jan 22, 2024 • 12.7k • 30.3k • 98
Requires Filtering Datasets that HAS TO BE FILTERED. Squish42/bluemoon-fandom-1-1-rp-cleaned Updated Jul 9, 2023 • 135 • 78