chnug (Roger yau)

upvoted 2 articles 8 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

A-Mahla, merve, sergiopaniego, reach-vb, lewtun

•

Sep 23, 2025

• 138

Article

Optimization story: Bloom inference

Narsil

•

Oct 12, 2022

• 8

upvoted a paper 8 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 193

upvoted an article 8 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

+1

tomaarsen, Xenova, osanseviero

•

Feb 23, 2024

• 208

upvoted 2 collections 9 months ago

InternVL3.5

Collection

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 45 items • Updated Mar 2 • 109

DINOv3

Collection

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated Mar 10 • 638

upvoted an article 10 months ago

Article

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

pratikbhavsar

•

Feb 12, 2025

• 28

upvoted a paper 10 months ago

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Paper • 2507.02259 • Published Jul 3, 2025 • 5

upvoted an article 11 months ago

Article

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

nvidia

•

Jun 11, 2025

• 133

upvoted an article 12 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

+4

toslali-ibm, mirinflim, qgallouedec, esnible, rganti, mudhakar

•

Jun 3, 2025

• 101

upvoted an article about 1 year ago

Article

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

tiiuae

•

May 15, 2025

• 36

Roger yau

AI & ML interests

Organizations

Smol2Operator: Post-Training GUI Agents for Computer Use

Optimization story: Bloom inference

A Survey of Reinforcement Learning for Large Reasoning Models

🪆 Introduction to Matryoshka Embedding Models

InternVL3.5

DINOv3

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

Roger yau

AI & ML interests

Organizations

chnug's activity

Smol2Operator: Post-Training GUI Agents for Computer Use

Optimization story: Bloom inference

🪆 Introduction to Matryoshka Embedding Models

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.