Updated open-sci-ref baselines. Re-training without dropout. Re-training on DCLM, FineWeb-Edu, Nemotron, HPLT-2, Pile. Further ref datasets included.
AI & ML interests
Researching and building foundation models with improved generalization and reasoning. LAION & friends spin-off for open-sourcing foundation models with strong generalization and reasoning , including datasets necessary for their creation, to serve as common open, reproducible grounds for further research experiments.
Recent Activity
View all activity
models 93
open-sci/open-sci-ref-v0.02-0.4b-commoncorpus-300B-4096
0.4B • Updated
open-sci/open-sci-ref-v0.02-0.13b-commoncorpus-300B-4096
0.1B • Updated
open-sci/open-sci-ref-v0.02-1.7b-dclm-300B-4096
2B • Updated
open-sci/open-sci-ref-v0.02-1.3b-dclm-300B-4096
1B • Updated
open-sci/open-sci-ref-v0.02-0.4b-dclm-300B-4096
0.4B • Updated
open-sci/open-sci-ref-v0.02-0.13b-dclm-300B-4096
0.1B • Updated
open-sci/open-sci-ref-v0.02-0.4b-slimpajama-300B-4096
0.4B • Updated
open-sci/open-sci-ref-v0.02-0.13b-slimpajama-300B-4096
0.1B • Updated
open-sci/open-sci-ref-v0.02-0.4b-pile-300B-4096
0.4B • Updated
open-sci/open-sci-ref-v0.02-0.13b-pile-300B-4096
0.1B • Updated