Pannaga j's picture

Pannaga j

pannaga10

·

AI & ML interests

None yet

Recent Activity

new activity about 19 hours ago

google/gemma-4-31B-it:Gemma4 is super slow on vllm docker , max to max 15 token per second only (4 X L4 GPU)

new activity 4 days ago

google/gemma-4-31B-it:This model is so good, but the KV cache ruins it.

new activity 6 days ago

google/gemma-4-E2B-it:How to best prompt for transcription results when sending audio input?

View all activity

Organizations

models 0

None public yet

datasets 0

None public yet