20 19 68

DongJae Shin PRO

ShinDJ

AI & ML interests

NLP, LLM, Vision-Langauge Model

Recent Activity

liked a model 5 days ago

google/gemma-4-12B

upvoted a paper 5 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

upvoted a paper 7 days ago

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

View all activity

Organizations

liked a model 5 days ago

google/gemma-4-12B

Any-to-Any • 12B • Updated 4 days ago • 118k • 440

upvoted a paper 5 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 12 days ago • 139

upvoted a paper 7 days ago

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

Paper • 2605.30409 • Published 12 days ago • 36

liked 2 models 19 days ago

nvidia/Cosmos-Predict2.5-2B

Updated Mar 3 • 67.5k • 132

nvidia/GR00T-N1.7-3B

Robotics • 3B • Updated Apr 23 • 43.3k • 53

liked a Space 25 days ago

Ltx-2.3 FFLFwith Lora

📚

Generate AI video from text, images, and audio

liked a model about 1 month ago

microsoft/VibeVoice-ASR

Automatic Speech Recognition • 9B • Updated Jan 27 • 572k • 1.17k

liked a dataset about 2 months ago

hysong/MentalBench

Viewer • Updated 26 days ago • 24.8k • 277 • 38

liked a model about 2 months ago

Lightricks/LTX-2.3

Image-to-Video • Updated Apr 13 • 2.26M • 1.34k

upvoted a paper 2 months ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published Mar 17 • 110

liked 2 models 3 months ago

KORMo-Team/KORMo-10B-base

Text Generation • 11B • Updated 4 days ago • 1.91k • 38

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 233k • 1.31k

upvoted a paper 3 months ago

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Paper • 2603.15594 • Published Mar 16 • 150

upvoted an article 3 months ago

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 167

updated a model 3 months ago

KORMo-VL/KORMo-VL

Image-Text-to-Text • 11B • Updated Mar 6 • 1.19k • 39

liked 2 models 3 months ago

KORMo-VL/KORMo-VL

Image-Text-to-Text • 11B • Updated Mar 6 • 1.19k • 39

KORMo-VL/KORMo-VL-Diffusion

Updated Mar 5 • 8 • 17

liked 2 datasets 3 months ago

nuprl/AgentPack

Viewer • Updated Oct 2, 2025 • 2M • 79 • 18

HAERAE-HUB/KMMMU

Viewer • Updated Apr 16 • 3.45k • 295 • 13

published a dataset 3 months ago

MLP-VLM/bllossom-vision

Viewer • Updated Nov 29, 2024 • 4.97M • 5