Ar4l/8L_768H_1536I_8h_0.1d___6400S_8g_1024b_0.0005lr__debug Fill-Mask • 42M • Updated Sep 15, 2024 • 2
AISE-TUDelft/100M_babylm_ascii__SPM-BPE_6144__8000S__32g__256b__0.00125lr12L_1024H_2048I_16h__debertav2 Fill-Mask • 0.1B • Updated Sep 13, 2024 • 2
AISE-TUDelft/10M_fwedu_0.001_ascii__SPM-BPE_6144__8000S__32g__256b__0.00125lr12L_1024H_2048I_16h__debertav2 Fill-Mask • 0.1B • Updated Sep 13, 2024 • 4