arxiv:2512.15715
Shang-Wen Daniel Li
swdanielli
AI & ML interests
Large foundation models, vision and language multimodal, and pretraining and self-supervised training
Recent Activity
upvoted a collection about 4 hours ago
Pixio liked a dataset 4 months ago
facebook/EgoAVU_data upvoted a paper 4 months ago
EgoAVU: Egocentric Audio-Visual Understanding