👋 Open to Work

Joseph [open/acc] Pollack PRO

Tonic

hugging-science

·

https://discord.gg/qdfnvSPcqP

AI & ML interests

🤖Making robots to help people learn things quicker 👩🏻‍🚀🚀

Recent Activity

reacted to RDTvlokip's post with 👍 about 22 hours ago

I finally changed the architecture of my 15M French LLM. It worked. Then I almost fooled myself about how much and catching that was the real win. After proving last time that architecture is a threshold, not a lever, I got stubborn: could I change how the model learns? Four honest attempts, Lion, a sharper AdamW β2, multi-token prediction, LayerScale. Four failures. The bottleneck wasn't the learning rule either. So I changed the shape of the computation instead: loop the same transformer blocks 4×, deeper reasoning, zero added parameters. It beat the baseline on perplexity, the first thing in the whole project to move that number. Then I added my own twist: let each token decide how deep to think, halting on its own entropy. My first evaluation was spectacular. Coherence up 65%. Hallucinated names down 62%. It was noise. Eight prompts, one seed. I re-ran on 50 prompts × 200 tokens and watched the gains shrink to "modest" and on out-of-domain prompts, recurrence actually made things worse. No universal winner. And none of it is new: it's Adaptive Computation Time (2016), the Universal Transformer (2018), and LoopViT (2026), recombined and measured honestly. The real lesson: A number from 8 prompts is a rumor. The eval harness that kills your own best result is worth more than the result it kills. Cite your lineage. Stay preliminary until multiple seeds say otherwise. The three models are live. The write-up is honest about every caveat 👇 🔗 https://huggingface.co/blog/RDTvlokip/teaching-a-15m-french-llm-to-think-deeper

liked a Space 3 days ago

liked a Space 6 days ago

julien-c/caliceo

View all activity

Organizations

Tonic 's datasets 35

Tonic/landshift-sft-v1-duplicate

Viewer • Updated Apr 25 • 239 • 26

Tonic/brief-composer-sft-v1-duplicate

Viewer • Updated Apr 25 • 3k • 80

Tonic/oceanscout-sft-v1-duplicate

Viewer • Updated Apr 25 • 69 • 26

Tonic/nutonic-sft-init-upload-test

Viewer • Updated Apr 21 • 310 • 158

Tonic/WaxalNLP

Viewer • Updated Apr 16 • 1.04k • 40

Tonic/LiquidSpace

Preview • Updated Mar 23 • 85

Tonic/android-operator-text-full

Viewer • Updated Feb 27 • 9.91k • 8

Tonic/voxtral-dataset-20260225_222829

Viewer • Updated Feb 25 • 10 • 7

Tonic/voxtral-dataset-20250913_175651

Viewer • Updated Sep 13, 2025 • 10 • 5

Tonic/voxtral-dataset-20250913_174653

Viewer • Updated Sep 13, 2025 • 10 • 7

Tonic/android-operator-episodes

Updated Sep 2, 2025 • 47 • 2

Tonic/GeneReviews

Viewer • Updated Aug 25, 2025 • 929 • 9

Tonic/trackio-experiments

Viewer • Updated Aug 10, 2025 • 32 • 333 • 1

Tonic/finreg_dataset_gemma3_27b_it_qat

Viewer • Updated Jun 4, 2025 • 5 • 11

Tonic/ollama_1000_example

Viewer • Updated Jun 4, 2025 • 1.05k • 35

Tonic/ESMA-Auto-Bench

Viewer • Updated Jun 3, 2025 • 263 • 18 • 1

Tonic/Health-Bench-Eval-OSS-2025-07

Viewer • Updated May 17, 2025 • 9.67k • 113 • 4

Tonic/twitter-block-lists

Viewer • Updated Mar 12, 2025 • 6.69k • 6

Tonic/scaleway_r1_dark_thoughts_casestudies_processed_fuzzy_think_splits

Viewer • Updated Feb 25, 2025 • 904 • 4 • 1

Tonic/runpod_qwen32_dark_thoughts_casestudies_processed_fuzzy_think_splits

Viewer • Updated Feb 25, 2025 • 78.6k • 18

Tonic/scaleway_r1_dark_thoughts_casestudies

Viewer • Updated Feb 24, 2025 • 1.3M • 35 • 2

Tonic/OpenReasonerZero

Viewer • Updated Feb 23, 2025 • 56.9k • 23 • 3

Tonic/dark_thoughts_stakeholders_deduplicated_shard

Viewer • Updated Feb 17, 2025 • 1.78k • 5 • 3

Tonic/dark_thoughts_stakeholders_test

Viewer • Updated Feb 16, 2025 • 689k • 15

Tonic/Climate-Guard-Toxic-Agent

Viewer • Updated Feb 13, 2025 • 83.8k • 105 • 2

Tonic/MiniF2F

Viewer • Updated Feb 5, 2025 • 488 • 408 • 7

Tonic/combined-fr-caselaw-21-01-2025

Updated Jan 21, 2025 • 130

Tonic/combined-fr-caselaw

Viewer • Updated Jan 20, 2025 • 511k • 28

Tonic/BBC-FineWeb

Updated Jan 8, 2025 • 2

Tonic/medquad

Viewer • Updated Jan 13, 2024 • 15.5k • 23 • 2