Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Reasoning_eval
university
https://chtholly17.github.io/
Activity Feed
Follow
7
AI & ML interests
None defined yet.
Recent Activity
dwenlong
authored
a paper
about 14 hours ago
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
dwenlong
authored
a paper
about 14 hours ago
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
dwenlong
authored
a paper
about 14 hours ago
Token Hidden Reward: Steering Exploration-Exploitation in Group Relative Deep Reinforcement Learning
View all activity
Team members
5
models
18
Sort: Recently updated
ReasoningEval/huatuo_sft_m23k_grpo_qwen3-14b
15B
•
Updated
Nov 3, 2025
ReasoningEval/huatuo_sft_m23k_grpo_qwen3-8b
8B
•
Updated
Nov 3, 2025
ReasoningEval/huatuo_sft_m23k_grpo_llama31-8b
8B
•
Updated
Nov 3, 2025
ReasoningEval/openr1_sft_PRIME_grpo_qwen3-14b
15B
•
Updated
Nov 3, 2025
•
1
ReasoningEval/openr1_sft_PRIME_grpo_qwen3-8b
8B
•
Updated
Nov 3, 2025
ReasoningEval/openr1_sft_PRIME_grpo_llama31-8b
8B
•
Updated
Nov 3, 2025
ReasoningEval/openr1_sft_qwen3-8b
8B
•
Updated
Oct 29, 2025
•
2
ReasoningEval/openr1_sft_qwen3-14b
425k
•
Updated
Oct 28, 2025
•
3
ReasoningEval/openr1_sft_llama31-8b
8B
•
Updated
Oct 28, 2025
•
2
ReasoningEval/huatuo_sft_qwen3-8b
8B
•
Updated
Oct 28, 2025
•
3
View 18 models
datasets
0
None public yet