arxiv:2502.18356
Kang
JaxonK
·
AI & ML interests
RL, LLMs, Model generalization
Recent Activity
upvoted a paper about 6 hours ago
QUACK: Questioning, Understanding, and Auditing Communicated Knowledge in Multimodal Social Deduction Agents updated a model 5 months ago
JaxonK/cua-reward-qwen3vl-8b new activity 5 months ago
JaxonK/cua-reward-qwen3vl-8b:update model type