VISIONx @ NYU

university

https://www.sainingxie.com/

AI & ML interests

None defined yet.

Recent Activity

bytetriper new activity 1 day ago

nyu-visionx/siglip2_decoder:RAE repo fails when using google/siglip2-so400m-patch14-224 as encoder

AustinWang0330 updated a collection 3 days ago

bytetriper updated a model 7 days ago

nyu-visionx/webmae_decoder

View all activity

Papers

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding

View all Papers

Organization Card

Community About org cards

Edit this README.md markdown file to author your organization card.

Collections 7

View 7 collections

models 37

nyu-visionx/webmae_decoder

Updated 7 days ago • 13

nyu-visionx/siglip2_decoder

Image-to-Image • Updated 12 days ago • 1.02k

nyu-visionx/webssl300m_decoder

Image-to-Image • Updated 12 days ago • 78

nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B-WebSSL

Text-to-Image • 4B • Updated 12 days ago • 150

nyu-visionx/Scale-RAE-Qwen7B_DiT9.8B

Text Generation • 17B • Updated 28 days ago • 190 • 1

nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B

Text Generation • 4B • Updated 28 days ago • 1.07k

nyu-visionx/Cambrian-S-3B-S3

3B • Updated Jan 4 • 245

nyu-visionx/Cambrian-S-3B-S2

3B • Updated Jan 4 • 275

nyu-visionx/Cambrian-S-3B-S1

3B • Updated Jan 4 • 2

nyu-visionx/Cambrian-S-1.5B-S3

2B • Updated Jan 4 • 178

datasets 13

nyu-visionx/scale-rae-data

Updated 12 days ago • 5.29k • 1

nyu-visionx/Cambrian-S-3M

Updated 14 days ago • 6.83k • 2

nyu-visionx/VSI-Bench

Viewer • Updated Nov 11, 2025 • 10.3k • 11.6k • 58

nyu-visionx/VSI-Train-10k

Viewer • Updated Nov 7, 2025 • 10k • 572 • 3

nyu-visionx/VSI-SUPER-Count

Viewer • Updated Nov 7, 2025 • 400 • 1.08k • 5

nyu-visionx/VSI-SUPER-Recall

Viewer • Updated Nov 7, 2025 • 300 • 606 • 3

nyu-visionx/VSI-590K

Preview • Updated Nov 7, 2025 • 2.26k • 12

nyu-visionx/CV-Bench

Viewer • Updated Jul 20, 2025 • 5.28k • 5.33k • 42

nyu-visionx/pyramid_flow_ft_results

Viewer • Updated Mar 30, 2025 • 8.42k • 16

nyu-visionx/pisa-experiments

Updated Mar 18, 2025 • 44 • 2

View 13 datasets