ct2

ct-2

17 53 28

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Variable-Width Transformers

upvoted a paper 13 days ago

Tapered Language Models

upvoted a collection 13 days ago

EdgeRazor-Nbit

View all activity

Organizations

None yet

upvoted 2 papers 13 days ago

Variable-Width Transformers

Paper • 2606.18246 • Published 22 days ago • 15

Tapered Language Models

Paper • 2606.23670 • Published 16 days ago • 9

upvoted a collection 13 days ago

EdgeRazor-Nbit

Collection

16 items • Updated May 7 • 9

upvoted a paper 17 days ago

Rethinking Shrinkage Bias in LLM FP4 Pretraining: Geometric Origin, Systemic Impact, and UFP4 Recipe

Paper • 2606.20381 • Published 20 days ago • 10

liked 2 models 27 days ago

sensenova/SenseNova-U1-8B-MoT

Any-to-Any • 18B • Updated May 15 • 42.5k • 287

ideogram-ai/ideogram-4-nf4

Text-to-Image • Updated Jun 4 • 8.64k • 423

upvoted a paper 27 days ago

Kwai Keye-VL-2.0 Technical Report

Paper • 2606.10651 • Published 29 days ago • 192

liked a model 29 days ago

silx-ai/Quasar-Preview

Text Generation • 17B • Updated 18 days ago • 5.25k • 93

upvoted 3 papers about 1 month ago

The Shape of Addition: Geometric Structures of Arithmetic in Large Language Models

Paper • 2606.03645 • Published May 29 • 5

LongLive-RAG: A General Retrieval-Augmented Framework for Long Video Generation

Paper • 2606.02553 • Published Jun 1 • 20

LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws

Paper • 2605.23901 • Published May 22 • 13

updated a bucket about 1 month ago

ct-2/Sherry-3B-1.25bit-per-channel-bucket

6.44 GB

published a bucket about 1 month ago

ct-2/Sherry-3B-1.25bit-per-channel-bucket

6.44 GB

liked a model about 1 month ago

MoraxGeo/Sherry-3B-1.25bit-per-channel

3B • Updated Feb 4 • 7 • 2

upvoted a collection about 2 months ago

BitCPM-CANN

Collection

Full-pipeline ternary quantized model trained on CANN. • 12 items • Updated May 24 • 28

updated a bucket about 2 months ago

ct-2/BitCPM4-CANN-8B-bucket

16.4 GB

published a bucket about 2 months ago

ct-2/BitCPM4-CANN-8B-bucket

16.4 GB

updated a bucket about 2 months ago

ct-2/BitCPM4-CANN-8B-gguf-bucket

2.37 GB

published a bucket about 2 months ago

ct-2/BitCPM4-CANN-8B-gguf-bucket

2.37 GB

upvoted a paper about 2 months ago

Mix-Quant: Quantized Prefilling, Precise Decoding for Agentic LLMs

Paper • 2605.20315 • Published May 19 • 28

ct2

AI & ML interests

Recent Activity

Organizations

ct-2's activity

ct-2/Sherry-3B-1.25bit-per-channel-bucket

ct-2/Sherry-3B-1.25bit-per-channel-bucket

ct-2/BitCPM4-CANN-8B-bucket

ct-2/BitCPM4-CANN-8B-bucket

ct-2/BitCPM4-CANN-8B-gguf-bucket

ct-2/BitCPM4-CANN-8B-gguf-bucket