MegaLDN (Evgen Novik)

liked 2 Spaces 3 months ago

The Ultra-Scale Playbook

🌌

3.86k

The ultimate guide to training LLM on large GPU Clusters

FineWeb: decanting the web for the finest text data at scale

🍷

1.35k

Explore and download the FineWeb web‑scale text dataset

liked a model 7 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4

Text Generation • 26B • Updated Nov 27, 2025 • 3.88k • 20

liked 2 Spaces 7 months ago

The Smol Training Playbook

📚

3.2k

The secrets to building world-class LLMs

RuQualBench

🐸

27

RuQualBench Leaderboard

liked a model 12 months ago

sergeyzh/BERTA

liked a Space about 1 year ago

MTEB Leaderboard

🥇

7.43k

Embedding Leaderboard

liked 3 datasets about 1 year ago

liked 8 models over 1 year ago

yandex/YandexGPT-5-Lite-8B-pretrain

8B • Updated Mar 31, 2025 • 1.68k • 218

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1, 2025 • 14.1k • 3.61k

t-tech/T-lite-it-1.0

8B • Updated Dec 13, 2024 • 1.96k • 98

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 522k • • 4.97k

IlyaGusev/gemma-2-2b-it-abliterated

Text Generation • 3B • Updated Jul 31, 2024 • 2.26k • • 51

Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct

Text Generation • 1B • Updated Sep 27, 2024 • 2.15k • • 47

Vikhrmodels/Vikhr-Qwen-2.5-0.5B-instruct-GGUF

Text Generation • 0.5B • Updated Oct 6, 2024 • 328 • 9

IlyaGusev/saiga_llama3_8b

Text Generation • 8B • Updated Jul 4, 2024 • 410k • • 140

Evgen Novik

AI & ML interests

Organizations

The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4

The Smol Training Playbook

RuQualBench

sergeyzh/BERTA

MTEB Leaderboard

wyluilipe/ru-dataset-for-pretraining

Muennighoff/natural-instructions

TerenceLau/sparrow

yandex/YandexGPT-5-Lite-8B-pretrain

deepseek-ai/Janus-Pro-7B

t-tech/T-lite-it-1.0

black-forest-labs/FLUX.1-schnell

IlyaGusev/gemma-2-2b-it-abliterated

Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct

Vikhrmodels/Vikhr-Qwen-2.5-0.5B-instruct-GGUF

IlyaGusev/saiga_llama3_8b

Evgen Novik

AI & ML interests

Organizations

MegaLDN's activity

The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale

The Smol Training Playbook

RuQualBench

MTEB Leaderboard