🔄 In a Training Loop

surfingpapi PRO

surfingpapi

5 56

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF

upvoted an article about 2 months ago

Enabling Long Context Training with Sequence Parallelism in Axolotl

liked a Space about 2 months ago

AdithyaSK/rl-environments-guide

View all activity

Organizations

liked a model 6 days ago

yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF

Text Generation • 12B • Updated 13 days ago • 597k • 2.55k

upvoted an article about 2 months ago

Article

Enabling Long Context Training with Sequence Parallelism in Axolotl

axolotl-ai-co

•

Apr 4, 2025

• 17

liked a Space about 2 months ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

195

Building and scaling RL environments for LLM training

upvoted an article about 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 165

liked 2 datasets 2 months ago

nvidia/Nemotron-SFT-OpenCode-v1

Preview • Updated Mar 23 • 3.48k • 54

pliny-the-prompter/OBLITERATUS-TELEMETRY

Viewer • Updated 10 days ago • 78.6k • 673 • 22

liked a model 3 months ago

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8

Text Generation • 124B • Updated Apr 29 • 316k • 262

liked a dataset 3 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8, 2025 • 3.91M • 4.7k • 678

liked 3 models 3 months ago

liked a dataset 3 months ago

AlicanKiraz0/All-CVE-Records-Training-Dataset

Viewer • Updated Jun 12, 2025 • 297k • 3.48k • 59

liked a dataset 4 months ago

OpenCoder-LLM/opc-sft-stage2

Viewer • Updated Nov 24, 2024 • 436k • 3.04k • 103

liked a model 4 months ago

open-thoughts/OpenThinker-Agent-v1-SFT

Text Generation • 308k • Updated Jan 27 • 427 • • 9

liked 2 datasets 4 months ago

open-thoughts/OpenThoughts3-1.2M

Viewer • Updated Jun 9, 2025 • 1.2M • 18k • 245

nvidia/Nemotron-ClimbMix

Viewer • Updated Oct 21, 2025 • 355M • 5.41k • 118

liked 3 models 4 months ago

DeepHat/DeepHat-V1-7B

Text Generation • 8B • Updated Aug 15, 2025 • 3.81k • • 156

WhiteRabbitNeo/WhiteRabbitNeo-13B-v1

Text Generation • Updated Feb 15, 2024 • 1.67k • 460

fdtn-ai/Foundation-Sec-8B

Text Generation • 8B • Updated Aug 26, 2025 • 6.13k • • 313

liked a model 5 months ago

Virtue-AI-HUB/VulnLLM-R-7B

Text Generation • 8B • Updated Dec 12, 2025 • 12.7k • • 192

surfingpapi PRO

AI & ML interests

Recent Activity

Organizations

surfingpapi's activity

Enabling Long Context Training with Sequence Parallelism in Axolotl

The ultimate guide to RL environments: building and scaling them in the LLM era

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries