Sean Bayly's picture

Sean Bayly

swtb

·

swtb3

AI & ML interests

Computer Vision, NLP, Medical Imaging

Recent Activity

new activity 29 days ago

Qwen/Qwen3.5-35B-A3B-GPTQ-Int4:Smaller model quants

new activity 2 months ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4:Tool use crash the model

new activity 2 months ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8:NVFP4 support

View all activity

Organizations

New activity in Qwen/Qwen3.5-35B-A3B-GPTQ-Int4 29 days ago

Smaller model quants

#8 opened 29 days ago by

New activity in nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 2 months ago

Tool use crash the model

#3 opened 2 months ago by

New activity in nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 2 months ago

NVFP4 support

#9 opened 4 months ago by

New activity in zilliz/semantic-highlight-bilingual-v1 3 months ago

Context size

#1 opened 3 months ago by

New activity in nvidia/Nemotron-Orchestrator-8B 4 months ago

License

#21 opened 4 months ago by

New activity in nvidia/Qwen3-30B-A3B-NVFP4 4 months ago

6 months since intro of NVFP4, and it's basically still a myth

#4 opened 4 months ago by

updated a dataset 4 months ago

swtb/llm-handbook-rag-eval

Viewer • Updated Dec 2, 2025 • 150 • 10

published a dataset 4 months ago

swtb/llm-handbook-rag-eval

Viewer • Updated Dec 2, 2025 • 150 • 10

New activity in ibm-granite/granite-docling-258M 6 months ago

Layout when running in vllm container

#26 opened 6 months ago by

granite-docling-258M docker image?

#19 opened 7 months ago by

upvoted a collection 8 months ago

INT4 LLMs for vLLM

Accurate INT4 quantized models by Neural Magic, ready for use with vLLM! • 16 items • Updated Mar 2 • 12

New activity in discord-community/LevelBot about 1 year ago

Work expereince via Hgging Face?

#35 opened about 1 year ago by

New activity in facebook/bart-large-cnn about 1 year ago

Doesn't seem to do actual summarization

#78 opened almost 2 years ago by

liked a Space over 1 year ago

Can You Run It? LLM version

Calculate GPU needs for running LLMs on your hardware

updated a Space over 1 year ago

Test

New activity in sentence-transformers/all-mpnet-base-v2 over 1 year ago

What should I cite if I use this model?

#1 opened over 3 years ago by

liked a model over 1 year ago

meta-llama/Meta-Llama-3-8B-Instruct

Text Generation • 8B • Updated Jun 18, 2025 • 1.33M • • 4.46k

liked a model almost 2 years ago

nvidia/NV-Embed-v1

Updated Nov 30, 2024 • 3.05k • 427

updated 2 models almost 2 years ago

swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune-FP16-BinaryClass-WeightedLoss

Token Classification • 0.3B • Updated Jun 1, 2024 • 1

swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune-BinaryClass-WeightedLoss

Token Classification • 0.3B • Updated Jun 1, 2024