The Troiani LLM model family progression from v0 -> v1 including Base, Instruct and specializations like Vision, Audio...
Walter Troiani Vargas
eZWALT
·
AI & ML interests
None yet
Recent Activity
updated a collection 12 days ago
Troiani v1 published a dataset 12 days ago
eZWALT/Instruct-SFT-Troiani-v0 updated a collection 17 days ago
Production LLMsOrganizations
Production LLMs
- RunningFeatured1.38k
FineWeb: decanting the web for the finest text data at scale
🍷1.38kExplore and download the FineWeb web‑scale text dataset
- Running3.92k
The Ultra-Scale Playbook
🌌3.92kThe ultimate guide to training LLM on large GPU Clusters
- RunningAgents111
Predict Memory
🧮111Estimate model memory usage and see detailed plots
- Running45
Encoder-Free VLM
👁45Train Your Own Encoder-Free VLM in $100
Multimodal NanoChimera
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 19.9k • 589 -
google/siglip2-base-patch16-224
Zero-Shot Image Classification • 0.4B • Updated • 412k • 112 -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 22.3M • 969 -
google/siglip2-base-patch16-512
Zero-Shot Image Classification • 0.4B • Updated • 113k • 47
Cursed Toxic Pretraining Corpora
Troiani v1
The Troiani LLM model family progression from v0 -> v1 including Base, Instruct and specializations like Vision, Audio...
Production LLMs
- RunningFeatured1.38k
FineWeb: decanting the web for the finest text data at scale
🍷1.38kExplore and download the FineWeb web‑scale text dataset
- Running3.92k
The Ultra-Scale Playbook
🌌3.92kThe ultimate guide to training LLM on large GPU Clusters
- RunningAgents111
Predict Memory
🧮111Estimate model memory usage and see detailed plots
- Running45
Encoder-Free VLM
👁45Train Your Own Encoder-Free VLM in $100
Master Thesis: LLMs <> Ads
Multimodal NanoChimera
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 19.9k • 589 -
google/siglip2-base-patch16-224
Zero-Shot Image Classification • 0.4B • Updated • 412k • 112 -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 22.3M • 969 -
google/siglip2-base-patch16-512
Zero-Shot Image Classification • 0.4B • Updated • 113k • 47
RLHF Resources
Cursed Toxic Pretraining Corpora