DONGFANG ZIHAO's picture

9 3

DONGFANG ZIHAO

UUUserna

·

UUUserna

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents

authored a paper 3 days ago

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

authored a paper 3 days ago

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

View all activity

Organizations

None yet

upvoted a paper 2 days ago

AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents

Paper • 2603.18429 • Published 4 days ago • 21

upvoted a paper 3 days ago

Temporal Gains, Spatial Costs: Revisiting Video Fine-Tuning in Multimodal Large Language Models

Paper • 2603.17541 • Published 4 days ago • 20

upvoted a paper 9 days ago

DVD: Deterministic Video Depth Estimation with Generative Priors

Paper • 2603.12250 • Published 10 days ago • 26

upvoted a paper about 1 month ago

BPDQ: Bit-Plane Decomposition Quantization on a Variable Grid for Large Language Models

Paper • 2602.04163 • Published Feb 4 • 10

upvoted a paper about 2 months ago

OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models

Paper • 2602.04804 • Published Feb 4 • 48

upvoted a paper 3 months ago

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Paper • 2512.22905 • Published Dec 28, 2025 • 20

upvoted 2 papers 5 months ago

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

Paper • 2510.25760 • Published Oct 29, 2025 • 17

Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods

Paper • 2510.07143 • Published Oct 8, 2025 • 13

upvoted a paper 6 months ago

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Paper • 2509.12989 • Published Sep 16, 2025 • 28