HF datasets — OS-Platform
zayne
MC7ever
AI & ML interests
None yet
Recent Activity
updated a collection less than a minute ago
NLP Text Datasets updated a collection less than a minute ago
Medical Health Datasets updated a collection less than a minute ago
NLP Text DatasetsOrganizations
Education Datasets
HF datasets — Education
Social Media Datasets
HF datasets — Social-Media
NSFW Adult Datasets
HF datasets — NSFW-Adult
Instruction Finetuning Datasets
HF datasets — Instruction-Finetuning
Other Misc Datasets
HF datasets — Other-Misc
Synthetic Data Datasets
HF datasets — Synthetic-Data
-
broadfield-dev/gemma-3-synthetic-tool-data-1760974607
Viewer • Updated • 10 • 4 -
broadfield-dev/gemma-3-synthetic-tool-data-1761016184
Viewer • Updated • 10 • 5 -
broadfield-dev/gemma-3-synthetic-tool-data-1761021163
Viewer • Updated • 98 • 13 -
AlekseyKorshuk/dalio-synthetic-complete
Viewer • Updated • 159 • 9
Roleplay Characters Datasets
HF datasets — Roleplay-Characters
Benchmarks Evals Datasets
HF datasets — Benchmarks-Evals
Tools Function Calling Datasets
HF datasets — Tools-Function-Calling
-
111mohan111/phi4-function-calling-dataset-v3
Viewer • Updated • 7k • 6 -
1nstaller/mail-tool-use
Preview • Updated • 57 • 3 -
2328322889hxy/glaiveai-glaive-function-calling-v2
Viewer • Updated • 75.2k • 20 -
5CD-AI/Vietnamese-Salesforce-xlam-function-calling-60k-gg-translated
Viewer • Updated • 60k • 113 • 7
Traces Datasets
HF datasets — Traces
Chat Texting DMs Datasets
HF datasets — Chat-Texting-DMs
Optimisation Datasets
HF datasets — Optimisation
General Knowledge Datasets
HF datasets — General-Knowledge
Audio Speech Datasets
HF datasets — Audio-Speech
Legal Law Datasets
HF datasets — Legal-Law
-
ADRA-RL/dolma3-arxiv_paraphrased_unique_trio_ratio_1.50_adaptive_match_random_7_p0.25_a0.25
Viewer • Updated • 128 • 7 -
ADRA-RL/dolma3-arxiv_unique_trio_ratio_1.50_adaptive_match_loss_random_7_p0.25_a0.25
Viewer • Updated • 128 • 7 -
AIM-Harvard/NAACL-Accepted-Papers
Viewer • Updated • 2.04k • 33 -
Adanato/arxiv_similarity_300
Viewer • Updated • 38.8k • 149
Academic Arxiv
HF datasets — Academic-Arxiv
Vision Image
HF datasets — Vision-Image
Science Physics
HF datasets — Science-Physics
Roleplay Characters
HF datasets — Roleplay-Characters
QA RAG
HF datasets — QA-RAG
NLP Text
HF datasets — NLP-Text
Medical Health
HF datasets — Medical-Health
Legal Law
HF datasets — Legal-Law
Code
HF datasets — Code
Audio Speech
HF datasets — Audio-Speech
Memory Traces Datasets
HF datasets — Memory-Traces
Pop Culture Datasets
HF datasets — Pop-Culture
Uncensored Jailbreak Datasets
HF datasets — Uncensored-Jailbreak
-
torilab/uncensored-data
Preview • Updated • 1 • 1 -
Asap7772/prm800k_backtracks_onpolicy_bofn_valuemc_turn_dependent_sep_reward
Viewer • Updated • 226k • 69 -
Asap7772/prm800k_onpolicy_multiturn_cumm_rew_prefix0.2_roll4_maxrev100
Viewer • Updated • 24.7M • 26 -
Asap7772/prm800k_onpolicy_multiturn_cummrew_prefix0.2_roll4_maxrev100
Viewer • Updated • 10.7M • 25
NLP Text Datasets
HF datasets — NLP-Text
QA RAG Datasets
HF datasets — QA-RAG
Emotion Psychology Datasets
HF datasets — Emotion-Psychology
Code Datasets
HF datasets — Code
-
ClarusC64/ai-5node-chain-buf-lag-cpl-agent-loop-v0.1
Viewer • Updated • 9 • 13 -
ClarusC64/market-agent-reaction-path-mapping-v0.1
Viewer • Updated • 7 • 40 -
MartinElMolon/stocks_demo_react_agent_generated_train_dataset
Viewer • Updated • 497 • 11 -
agentlans/lightblue-tagengo-gpt4
Viewer • Updated • 76k • 354
Vision Image Datasets
HF datasets — Vision-Image
Code Execution Datasets
HF datasets — Code-Execution
Agent Tools Datasets
HF datasets — Agent-Tools
Science Physics Datasets
HF datasets — Science-Physics
Skills Datasets
HF datasets — Skills
Encyclopedia World Knowledge Datasets
HF datasets — Encyclopedia-World-Knowledge
Robotics Datasets
HF datasets — Robotics
Medical Health Datasets
HF datasets — Medical-Health
Academic Arxiv Datasets
HF datasets — Academic-Arxiv
Agent Tools
HF datasets — Agent-Tools
Synthetic Data
HF datasets — Synthetic-Data
Safety Alignment
HF datasets — Safety-Alignment
-
0x22almostEvil/words-operations-rewards-5k
Viewer • Updated • 5k • 31 • 1 -
AlekseyKorshuk/ak_edit_issue_analysis_128_v2-reward
Viewer • Updated • 17.6k • 13 -
AlekseyKorshuk/ak_edit_issue_analysis_128_v2_with_zl-reward
Viewer • Updated • 17.6k • 14 -
AlekseyKorshuk/reward-model-no-topic-predictions
Viewer • Updated • 8.81k • 26
Robotics
HF datasets — Robotics
Other Misc
HF datasets — Other-Misc
Multilingual
HF datasets — Multilingual
Math Reasoning
HF datasets — Math-Reasoning
-
0x22almostEvil/reasoning-gsm-qna-oa
Viewer • Updated • 8.79k • 63 • 8 -
0x22almostEvil/reasoning_bg_oa
Viewer • Updated • 2.63k • 50 • 2 -
AFFFPupu/Maths_competition_questions
Viewer • Updated • 120 • 35 • 2 -
AMAImedia/NOESIS-1M-reasoning-router-code-math-psych-opus47-deepseek4-qwen36-gemini31-r1-gpt54
Viewer • Updated • 1M • 443 • 4
Instruction Finetuning
HF datasets — Instruction-Finetuning
Benchmarks Evals
HF datasets — Benchmarks-Evals
OS Platform Datasets
HF datasets — OS-Platform
Memory Traces Datasets
HF datasets — Memory-Traces
Education Datasets
HF datasets — Education
Pop Culture Datasets
HF datasets — Pop-Culture
Social Media Datasets
HF datasets — Social-Media
Uncensored Jailbreak Datasets
HF datasets — Uncensored-Jailbreak
-
torilab/uncensored-data
Preview • Updated • 1 • 1 -
Asap7772/prm800k_backtracks_onpolicy_bofn_valuemc_turn_dependent_sep_reward
Viewer • Updated • 226k • 69 -
Asap7772/prm800k_onpolicy_multiturn_cumm_rew_prefix0.2_roll4_maxrev100
Viewer • Updated • 24.7M • 26 -
Asap7772/prm800k_onpolicy_multiturn_cummrew_prefix0.2_roll4_maxrev100
Viewer • Updated • 10.7M • 25
NSFW Adult Datasets
HF datasets — NSFW-Adult
NLP Text Datasets
HF datasets — NLP-Text
Instruction Finetuning Datasets
HF datasets — Instruction-Finetuning
QA RAG Datasets
HF datasets — QA-RAG
Other Misc Datasets
HF datasets — Other-Misc
Emotion Psychology Datasets
HF datasets — Emotion-Psychology
Synthetic Data Datasets
HF datasets — Synthetic-Data
-
broadfield-dev/gemma-3-synthetic-tool-data-1760974607
Viewer • Updated • 10 • 4 -
broadfield-dev/gemma-3-synthetic-tool-data-1761016184
Viewer • Updated • 10 • 5 -
broadfield-dev/gemma-3-synthetic-tool-data-1761021163
Viewer • Updated • 98 • 13 -
AlekseyKorshuk/dalio-synthetic-complete
Viewer • Updated • 159 • 9
Code Datasets
HF datasets — Code
-
ClarusC64/ai-5node-chain-buf-lag-cpl-agent-loop-v0.1
Viewer • Updated • 9 • 13 -
ClarusC64/market-agent-reaction-path-mapping-v0.1
Viewer • Updated • 7 • 40 -
MartinElMolon/stocks_demo_react_agent_generated_train_dataset
Viewer • Updated • 497 • 11 -
agentlans/lightblue-tagengo-gpt4
Viewer • Updated • 76k • 354
Roleplay Characters Datasets
HF datasets — Roleplay-Characters
Vision Image Datasets
HF datasets — Vision-Image
Benchmarks Evals Datasets
HF datasets — Benchmarks-Evals
Code Execution Datasets
HF datasets — Code-Execution
Tools Function Calling Datasets
HF datasets — Tools-Function-Calling
-
111mohan111/phi4-function-calling-dataset-v3
Viewer • Updated • 7k • 6 -
1nstaller/mail-tool-use
Preview • Updated • 57 • 3 -
2328322889hxy/glaiveai-glaive-function-calling-v2
Viewer • Updated • 75.2k • 20 -
5CD-AI/Vietnamese-Salesforce-xlam-function-calling-60k-gg-translated
Viewer • Updated • 60k • 113 • 7
Agent Tools Datasets
HF datasets — Agent-Tools
Traces Datasets
HF datasets — Traces
Science Physics Datasets
HF datasets — Science-Physics
Chat Texting DMs Datasets
HF datasets — Chat-Texting-DMs
Skills Datasets
HF datasets — Skills
Optimisation Datasets
HF datasets — Optimisation
Encyclopedia World Knowledge Datasets
HF datasets — Encyclopedia-World-Knowledge
General Knowledge Datasets
HF datasets — General-Knowledge
Robotics Datasets
HF datasets — Robotics
Audio Speech Datasets
HF datasets — Audio-Speech
Medical Health Datasets
HF datasets — Medical-Health
Legal Law Datasets
HF datasets — Legal-Law
-
ADRA-RL/dolma3-arxiv_paraphrased_unique_trio_ratio_1.50_adaptive_match_random_7_p0.25_a0.25
Viewer • Updated • 128 • 7 -
ADRA-RL/dolma3-arxiv_unique_trio_ratio_1.50_adaptive_match_loss_random_7_p0.25_a0.25
Viewer • Updated • 128 • 7 -
AIM-Harvard/NAACL-Accepted-Papers
Viewer • Updated • 2.04k • 33 -
Adanato/arxiv_similarity_300
Viewer • Updated • 38.8k • 149
Academic Arxiv Datasets
HF datasets — Academic-Arxiv
Academic Arxiv
HF datasets — Academic-Arxiv
Agent Tools
HF datasets — Agent-Tools
Vision Image
HF datasets — Vision-Image
Synthetic Data
HF datasets — Synthetic-Data
Science Physics
HF datasets — Science-Physics
Safety Alignment
HF datasets — Safety-Alignment
-
0x22almostEvil/words-operations-rewards-5k
Viewer • Updated • 5k • 31 • 1 -
AlekseyKorshuk/ak_edit_issue_analysis_128_v2-reward
Viewer • Updated • 17.6k • 13 -
AlekseyKorshuk/ak_edit_issue_analysis_128_v2_with_zl-reward
Viewer • Updated • 17.6k • 14 -
AlekseyKorshuk/reward-model-no-topic-predictions
Viewer • Updated • 8.81k • 26
Roleplay Characters
HF datasets — Roleplay-Characters
Robotics
HF datasets — Robotics
QA RAG
HF datasets — QA-RAG
Other Misc
HF datasets — Other-Misc
NLP Text
HF datasets — NLP-Text
Multilingual
HF datasets — Multilingual
Medical Health
HF datasets — Medical-Health
Math Reasoning
HF datasets — Math-Reasoning
-
0x22almostEvil/reasoning-gsm-qna-oa
Viewer • Updated • 8.79k • 63 • 8 -
0x22almostEvil/reasoning_bg_oa
Viewer • Updated • 2.63k • 50 • 2 -
AFFFPupu/Maths_competition_questions
Viewer • Updated • 120 • 35 • 2 -
AMAImedia/NOESIS-1M-reasoning-router-code-math-psych-opus47-deepseek4-qwen36-gemini31-r1-gpt54
Viewer • Updated • 1M • 443 • 4
Legal Law
HF datasets — Legal-Law
Instruction Finetuning
HF datasets — Instruction-Finetuning
Code
HF datasets — Code
Benchmarks Evals
HF datasets — Benchmarks-Evals
Audio Speech
HF datasets — Audio-Speech