Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching Paper • 2602.12221 • Published Feb 12 • 5
Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching Paper • 2602.12221 • Published Feb 12 • 5
Uncertainty in Action: Confidence Elicitation in Embodied Agents Paper • 2503.10628 • Published Mar 13, 2025
Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting Paper • 2506.17212 • Published Jun 20, 2025
MOCHA: Are Code Language Models Robust Against Multi-Turn Malicious Coding Prompts? Paper • 2507.19598 • Published Jul 25, 2025
CoRe3D: Collaborative Reasoning as a Foundation for 3D Intelligence Paper • 2512.12768 • Published Dec 14, 2025 • 4
Hierarchical Dataset Selection for High-Quality Data Sharing Paper • 2512.10952 • Published Dec 11, 2025 • 2
PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation Paper • 2601.07060 • Published Jan 11 • 1
PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation Paper • 2601.16210 • Published Jan 22
Toward Cognitive Supersensing in Multimodal Large Language Model Paper • 2602.01541 • Published Feb 2 • 16
Toward Cognitive Supersensing in Multimodal Large Language Model Paper • 2602.01541 • Published Feb 2 • 16
Toward Cognitive Supersensing in Multimodal Large Language Model Paper • 2602.01541 • Published Feb 2 • 16
Hierarchical Dataset Selection for High-Quality Data Sharing Paper • 2512.10952 • Published Dec 11, 2025 • 2
CoRe3D: Collaborative Reasoning as a Foundation for 3D Intelligence Paper • 2512.12768 • Published Dec 14, 2025 • 4
HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation Paper • 2506.21546 • Published Jun 26, 2025 • 2