Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Saurav Prateek's picture
3 10

Saurav Prateek

SauravP97
·
http://sauravp97.github.io/
  • SauravP97
  • saurav-prateek-7b2096140

AI & ML interests

LLM, Deep Neural Nets, Embedding Models

Recent Activity

liked a dataset 4 days ago
Abhishekcr448/Hinglish-Everyday-Conversations-1M
liked a dataset 4 days ago
BhabhaAI/Cross-Hindi-Hinglish-chat
liked a dataset 4 days ago
KathirKs/fineweb-edu-hindi
View all activity

Organizations

None yet

Articles 1

Article
1

Visualize the encoding process in HuggingFace Byte-Pair Encoding tokenizer

Papers 2

arxiv:2601.20843
arxiv:2512.03887

models 3

SauravP97/tiny-stories-19M

Text Generation • 19.3M • Updated 18 days ago • 67 • 1

SauravP97/tiny-stories-3M

Text Generation • 3.65M • Updated Feb 5 • 180 • 1

SauravP97/toy-transformer-shakespeare-work

Updated Jan 11 • 2 • 1

datasets 1

SauravP97/tiny-stories-tokenized-bpe

Viewer • Updated 5 days ago • 2.14M • 16 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs