ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models Paper • 2603.19466 • Published 11 days ago • 41
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation Paper • 2603.19039 • Published 11 days ago • 50
How to Take a Memorable Picture? Empowering Users with Actionable Feedback Paper • 2602.21877 • Published Feb 25 • 16
OpenBias: Open-set Bias Detection in Text-to-Image Generative Models Paper • 2404.07990 • Published Apr 11, 2024
Improving Fairness using Vision-Language Driven Image Augmentation Paper • 2311.01573 • Published Nov 2, 2023