arxiv:2406.15927
Shreshth Malik
s-a-malik
·
AI & ML interests
None yet
Organizations
models 18
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response
Text Generation • 8B • Updated
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped-mean-token
Text Generation • 8B • Updated
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped-correct-only-mean-token
Text Generation • 8B • Updated
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped
Text Generation • 8B • Updated
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped-correct-only
Text Generation • 8B • Updated
• 1
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-last-5
Text Generation • 8B • Updated
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-RL-0.4-last-5
Text Generation • 8B • Updated
• 1
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-RL-last-5
Text Generation • 8B • Updated
s-a-malik/Qwen-2.5-1.5B-Embedding-Entropy-RL-1
Text Generation • 2B • Updated
• 6
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-RL-0.025
Text Generation • 8B • Updated
• 2
datasets 0
None public yet