1 63 33

Joel Wang

joelhenwang

joelhenwang

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Rethinking Memory as Continuously Evolving Connectivity

liked a dataset 2 days ago

Zyphra/Zyda-2

upvoted a paper 2 days ago

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

View all activity

Organizations

upvoted a paper 2 days ago

Rethinking Memory as Continuously Evolving Connectivity

Paper • 2605.28773 • Published 5 days ago • 27

liked a dataset 2 days ago

Zyphra/Zyda-2

Preview • Updated Aug 6, 2025 • 193k • 95

upvoted 7 papers 2 days ago

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published 7 days ago • 132

HGRN2: Gated Linear RNNs with State Expansion

Paper • 2404.07904 • Published Apr 11, 2024 • 21

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Paper • 2605.28293 • Published 5 days ago • 80

updated a model 3 days ago

joelhenwang/OdinNext-138M-Early-Checkpoint

Text Generation • 0.2B • Updated 3 days ago • 47

published a model 3 days ago

joelhenwang/OdinNext-138M-Early-Checkpoint

Text Generation • 0.2B • Updated 3 days ago • 47

upvoted 9 papers 3 days ago

Negligible in Size, Significant in Effect: On Scale Vectors in Large Language Models

Paper • 2605.26895 • Published 6 days ago • 17

Self-Improving Language Models with Bidirectional Evolutionary Search

Paper • 2605.28814 • Published 5 days ago • 54

Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling

Paper • 2604.27039 • Published Apr 29 • 25

Why Fine-Tuning Encourages Hallucinations and How to Fix It

Paper • 2604.15574 • Published Apr 16 • 25

AgriIR: A Scalable Framework for Domain-Specific Knowledge Retrieval

Paper • 2604.16353 • Published Mar 17 • 4

TIDE: Every Layer Knows the Token Beneath the Context

Paper • 2605.06216 • Published 25 days ago • 9

Long Context Pre-Training with Lighthouse Attention

Paper • 2605.06554 • Published 25 days ago • 31

GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding

Paper • 2605.15250 • Published 18 days ago • 13

EndPrompt: Efficient Long-Context Extension via Terminal Anchoring

Paper • 2605.14589 • Published 18 days ago • 17

Joel Wang

AI & ML interests

Recent Activity

Organizations

joelhenwang's activity