Carlos García's picture
🤝 Open to Collab

Carlos García

cgarciams
·

AI & ML interests

Building a GPT-2 medium size (approx. 400 M parameters) model from scratch, using PyTorch, the OpenWebText dataset, Tiktoken, AdamW optimizer and FlashAttention. Just for fun.

Recent Activity

updated a model about 3 hours ago
cgarciams/gsp2_355m_sft
updated a collection 4 days ago
GSP-2 355M Models
updated a model 4 days ago
cgarciams/gsp2_355m_base
View all activity

Organizations

Universidad de Zaragoza's profile picture