Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
10
Saurav Prateek
SauravP97
Follow
0 followers
·
6 following
http://sauravp97.github.io/
SauravP97
saurav-prateek-7b2096140
AI & ML interests
LLM, Deep Neural Nets, Embedding Models
Recent Activity
liked
a dataset
4 days ago
Abhishekcr448/Hinglish-Everyday-Conversations-1M
liked
a dataset
4 days ago
BhabhaAI/Cross-Hindi-Hinglish-chat
liked
a dataset
4 days ago
KathirKs/fineweb-edu-hindi
View all activity
Organizations
None yet
Articles
1
Article
1
Visualize the encoding process in HuggingFace Byte-Pair Encoding tokenizer
Papers
2
arxiv:
2601.20843
arxiv:
2512.03887
models
3
Sort: Recently updated
SauravP97/tiny-stories-19M
Text Generation
•
19.3M
•
Updated
18 days ago
•
67
•
1
SauravP97/tiny-stories-3M
Text Generation
•
3.65M
•
Updated
Feb 5
•
180
•
1
SauravP97/toy-transformer-shakespeare-work
Updated
Jan 11
•
2
•
1
datasets
1
SauravP97/tiny-stories-tokenized-bpe
Viewer
•
Updated
5 days ago
•
2.14M
•
16
•
1