Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

LLaMA-MoE

https://github.com/pjlab-sys4nlp/llama-moe
Activity Feed

AI & ML interests

None defined yet.

Tong Zhu's profile pictureXiaoye Qu's profile pictureJiacheng Ruan's profile pictureDaize Dong's profile picturetongjingqi(SII)'s profile pictureXuyang Hu's profile picture

llama-moe 's models 8

llama-moe/LLaMA-MoE-v2-3_8B-residual-sft

8B • Updated Dec 3, 2024 • 8 • 2

llama-moe/LLaMA-MoE-v2-3_8B-2_8-sft

8B • Updated Dec 3, 2024 • 309 • 6

llama-moe/LLaMA-MoE-v1-3_0B-2_16

Text Generation • Updated Jun 25, 2024 • 773 • 11

llama-moe/LLaMA-MoE-v1-3_5B-4_16

Text Generation • Updated Jun 25, 2024 • 808 • 16

llama-moe/LLaMA-MoE-v1-3_0B-2_16-sft

Text Generation • 7B • Updated Jun 25, 2024 • 13 • 2

llama-moe/LLaMA-MoE-v1-3_5B-2_8-sft

Text Generation • 7B • Updated Jun 25, 2024 • 35 • 3

llama-moe/LLaMA-MoE-v1-3_5B-4_16-sft

Text Generation • 7B • Updated Jun 25, 2024 • 6 • 1

llama-moe/LLaMA-MoE-v1-3_5B-2_8

Text Generation • Updated Jun 25, 2024 • 1.51k • 15
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs