Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Hanxu Hu's picture
2 15 5

Hanxu Hu PRO

HanxuHU
Symbol-LLM's profile picture shawnxzhu's profile picture jvamvas's profile picture
·
https://hanxuhu.github.io/
  • huhanxu1
  • hanxuhu
  • hanxu-hu-746952221

AI & ML interests

LLM, NLP

Recent Activity

authored a paper about 19 hours ago
DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning
authored a paper about 19 hours ago
Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation
upvoted a paper 1 day ago
Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation
View all activity

Organizations

University of Zurich, Department of Computational Linguistics's profile picture Learn new languages through RL's profile picture

HanxuHU 's models 14

HanxuHU/Qwen2-0.5B-SFT

Updated Mar 31, 2025

HanxuHU/self-seq-Meta-Llama-3-8B-tulu100k_seq_it2_llama70b

Updated Jun 22, 2024 • 8 • 1

HanxuHU/self-seq-Meta-Llama-3-8B-tulu100k_base_ours_new_llama70b

Text Generation • Updated Jun 21, 2024 • 3 • 1

HanxuHU/sit_all_models

Updated Jun 20, 2024

HanxuHU/flancot_full_it1

Updated May 30, 2024

HanxuHU/sharegpt_filter

Updated May 29, 2024

HanxuHU/files

Updated May 13, 2024

HanxuHU/my-mLLMs

Updated May 10, 2024

HanxuHU/multilingual_mmmu

Updated Apr 3, 2024

HanxuHU/alpaca_topk_indices

Updated Mar 13, 2024

HanxuHU/mt5-small-finetuned-wikitext2

Updated Dec 25, 2022

HanxuHU/t5-small-finetuned-wikitext2

Updated Dec 8, 2022

HanxuHU/distilroberta-base-finetuned-wikitext2

Updated Dec 3, 2022

HanxuHU/distilgpt2-finetuned-wikitext2

Updated Dec 2, 2022
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs