Nguyễn Minh Phúc's picture

6

Nguyễn Minh Phúc

DatPySci

·

AI & ML interests

Reinforcement learning, NLP

Recent Activity

updated a model about 6 hours ago

DatPySci/pretrain-sharpen

upvoted a paper 3 days ago

We Can't Understand AI Using our Existing Vocabulary

upvoted a paper 22 days ago

Why Do Reasoning Models Lose Coverage? The Role of Data and Forks in the Road

View all activity

Organizations

upvoted a paper 3 days ago

We Can't Understand AI Using our Existing Vocabulary

Paper • 2502.07586 • Published Feb 11, 2025 • 12

upvoted a paper 22 days ago

Why Do Reasoning Models Lose Coverage? The Role of Data and Forks in the Road

Paper • 2605.17026 • Published 28 days ago • 4

upvoted 2 papers about 2 years ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 157

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4, 2024 • 62

upvoted 2 papers over 2 years ago

Weak-to-Strong Jailbreaking on Large Language Models

Paper • 2401.17256 • Published Jan 30, 2024 • 16

WARM: On the Benefits of Weight Averaged Reward Models

Paper • 2401.12187 • Published Jan 22, 2024 • 19