OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Paper
• 2402.01739
• Published
• 28
Rethinking Interpretability in the Era of Large Language Models
Paper
• 2402.01761
• Published
• 23
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper
• 2402.03620
• Published
• 117
Aya Model: An Instruction Finetuned Open-Access Multilingual Language
Model
Paper
• 2402.07827
• Published
• 48
Chain-of-Thought Reasoning Without Prompting
Paper
• 2402.10200
• Published
• 109
Generative Representational Instruction Tuning
Paper
• 2402.09906
• Published
• 54
BitDelta: Your Fine-Tune May Only Be Worth One Bit
Paper
• 2402.10193
• Published
• 21
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective
Depth Up-Scaling
Paper
• 2312.15166
• Published
• 61
SPAR: Personalized Content-Based Recommendation via Long Engagement
Attention
Paper
• 2402.10555
• Published
• 35
Learning to Learn Faster from Human Feedback with Language Model
Predictive Control
Paper
• 2402.11450
• Published
• 22
Paper
• 2402.12219
• Published
• 17
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
Paper
• 2402.16840
• Published
• 25
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
• 2402.17764
• Published
• 627
Can large language models explore in-context?
Paper
• 2403.15371
• Published
• 33
OpenELM: An Efficient Language Model Family with Open-source Training
and Inference Framework
Paper
• 2404.14619
• Published
• 126
FLAME: Factuality-Aware Alignment for Large Language Models
Paper
• 2405.01525
• Published
• 29
Octopus v4: Graph of language models
Paper
• 2404.19296
• Published
• 118
KAN: Kolmogorov-Arnold Networks
Paper
• 2404.19756
• Published
• 116
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper
• 2405.00732
• Published
• 122
From Loops to Oops: Fallback Behaviors of Language Models Under
Uncertainty
Paper
• 2407.06071
• Published
• 7
Human-like Episodic Memory for Infinite Context LLMs
Paper
• 2407.09450
• Published
• 62
LLMs + Persona-Plug = Personalized LLMs
Paper
• 2409.11901
• Published
• 35