Gonzales's picture

Gonzales

Fernanda24

·

AI & ML interests

None yet

Organizations

None yet

New activity in cerebras/DeepSeek-V3.2-REAP-508B-A37B 5 months ago

FP8 versions of DeepSeek-V3.2 would awesome!

#1 opened 5 months ago by

New activity in cerebras/DeepSeek-V3.2-REAP-345B-A37B 5 months ago

ValueError: gemm_fp8_nt_groupwise is only supported on SM100, SM103 in trtllm backend.

#3 opened 5 months ago by

New activity in Kebob/DeepSeek-V3.2-CPU-NUMA2-AMXINT4 6 months ago

attention backend

#1 opened 6 months ago by

New activity in QuantTrio/DeepSeek-V3.2-AWQ 6 months ago

Aww Man!

#1 opened 6 months ago by

New activity in Intel/DeepSeek-V3.1-Terminus-int4-mixed-AutoRound 6 months ago

Question will it work in vllm or sglang with rtx 6000 blackwells? cuda arch sm120

#1 opened 8 months ago by

New activity in QuantTrio/DeepSeek-V3.1-AWQ-Lite 6 months ago

ooof this fits in 4x96gb can we get this for the new 3.2 Speciale ase well please :)

#2 opened 6 months ago by

New activity in tencent/DeepSeek-V3.1-Terminus-W4AFP8 6 months ago

4 x RTX PRO 6000

#1 opened 6 months ago by

New activity in deepseek-ai/DeepSeek-V3.2 6 months ago

model type `deepseek_v32` not found

#16 opened 6 months ago by

New activity in mistralai/Ministral-3-14B-Instruct-2512 6 months ago

Update README.md

#4 opened 6 months ago by

broken link to mistral small

#3 opened 6 months ago by

New activity in QuantTrio/DeepSeek-V3.2-AWQ 6 months ago

Aww Man!

#1 opened 6 months ago by

New activity in apple/starflow 6 months ago

Cool!

#6 opened 6 months ago by

New activity in QuantTrio/DeepSeek-V3.1-AWQ-Lite 6 months ago

ooof this fits in 4x96gb can we get this for the new 3.2 Speciale ase well please :)

#2 opened 6 months ago by

New activity in Intel/DeepSeek-V3.1-Terminus-int4-mixed-AutoRound 6 months ago

can we get this for the new Deepseek v3.2?

#2 opened 6 months ago by

Question will it work in vllm or sglang with rtx 6000 blackwells? cuda arch sm120

#1 opened 8 months ago by

New activity in lukealonso/MiniMax-M2-NVFP4 6 months ago

you know which nightly it worked with? because it does not with current one

#1 opened 7 months ago by

New activity in Firworks/INTELLECT-3-nvfp4 6 months ago

is NVFP4 supported on sm120 (blackwell rtx pro 6000, rtx 5090 etc)?

#4 opened 6 months ago by

New activity in cerebras/Kimi-Linear-REAP-35B-A3B-Instruct 6 months ago

Kimi K2 Thinking REAP?

#6 opened 6 months ago by

New activity in bartowski/MiniMaxAI_MiniMax-M2-GGUF 6 months ago

best quant avail but maybe needs new jinja template?

#1 opened 6 months ago by

New activity in PrimeIntellect/INTELLECT-3 6 months ago

sampling params temp etc.?

#6 opened 6 months ago by