Small Vision-Language Models are Smart Compressors for Long Video Understanding Paper • 2604.08120 • Published 3 days ago • 12
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding Paper • 2410.17434 • Published Oct 22, 2024 • 27