Sugato Ray's picture

Sugato Ray PRO

sugatoray

·

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

updated a collection about 7 hours ago

LLM Training Datasets

liked a dataset about 7 hours ago

markov-ai/computer-use-large

upvoted an article 3 days ago

Introducing Storage Buckets on the Hugging Face Hub

View all activity

Organizations

updated a collection about 7 hours ago

LLM Training Datasets

A collection of datasets for training LLMs. • 127 items • Updated about 7 hours ago • 30

liked a dataset about 7 hours ago

markov-ai/computer-use-large

Updated about 4 hours ago • 45.7k • 68

upvoted an article 3 days ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

6 days ago

•

168

updated 2 collections 4 days ago

Papers-Fundamentals

29 items • Updated 4 days ago • 1

Papers

Large Language Model (LLM) and NLP related papers. • 347 items • Updated 4 days ago • 14

upvoted a paper 4 days ago

Building AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned

Paper • 2603.05344 • Published 10 days ago • 6

updated 2 collections 4 days ago

Papers + RL/Reasoning

37 items • Updated 4 days ago

Papers

Large Language Model (LLM) and NLP related papers. • 347 items • Updated 4 days ago • 14

upvoted a paper 4 days ago

Can Aha Moments Be Fake? Identifying True and Decorative Thinking Steps in Chain-of-Thought

Paper • 2510.24941 • Published Oct 28, 2025 • 4

upvoted an article 5 days ago

Article

Ulysses Sequence Parallelism: Training with Million-Token Contexts

7 days ago

•

20

upvoted an article 9 days ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

10 days ago

•

93

upvoted a collection 13 days ago

Qwen3.5

Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 4 days ago • 113

updated a collection 13 days ago

LLMs

Collection of LLMs • 404 items • Updated 13 days ago • 1

liked a model 13 days ago

janhq/Jan-code-4b

Text Generation • 4B • Updated 12 days ago • 2.07k • 68

upvoted a collection 13 days ago

Jan-code

2 items • Updated 14 days ago • 19

upvoted a collection 16 days ago

pplx-embed

Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 17 days ago • 87

updated 2 collections 16 days ago

LLMs

Collection of LLMs • 404 items • Updated 13 days ago • 1

LLM Tools

A collection of tools as various HF Spaces on LLMs. • 142 items • Updated 13 days ago • 3

liked a Space 16 days ago

LFM2.5 1.2B Thinking WebGPU

Run LFM2.5-1.2B-Thinking directly in your browser on WebGPU

updated a collection 17 days ago

Papers-Fundamentals

29 items • Updated 4 days ago • 1