Agents
updated
A Zero-Shot Language Agent for Computer Control with Structured
Reflection
Paper
• 2310.08740
• Published
• 15
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Paper
• 2310.12823
• Published
• 36
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring
Emergent Behaviors
Paper
• 2308.10848
• Published
• 1
CLEX: Continuous Length Extrapolation for Large Language Models
Paper
• 2310.16450
• Published
• 10
An Early Evaluation of GPT-4V(ision)
Paper
• 2310.16534
• Published
• 22
Personas as a Way to Model Truthfulness in Language Models
Paper
• 2310.18168
• Published
• 5
ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation
Paper
• 2311.00272
• Published
• 11
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
Paper
• 2311.02262
• Published
• 14
Ultra-Long Sequence Distributed Transformer
Paper
• 2311.02382
• Published
• 6
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning
Paper
• 2311.02303
• Published
• 12
Prompt Engineering a Prompt Engineer
Paper
• 2311.05661
• Published
• 23
Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized
Model Responses
Paper
• 2312.00763
• Published
• 23
Merlin:Empowering Multimodal LLMs with Foresight Minds
Paper
• 2312.00589
• Published
• 27
Instruction-tuning Aligns LLMs to the Human Brain
Paper
• 2312.00575
• Published
• 15
DeepCache: Accelerating Diffusion Models for Free
Paper
• 2312.00858
• Published
• 23
PathFinder: Guided Search over Multi-Step Reasoning Paths
Paper
• 2312.05180
• Published
• 10
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Paper
• 2312.10003
• Published
• 44
Faithful Persona-based Conversational Dataset Generation with Large
Language Models
Paper
• 2312.10007
• Published
• 11
Supervised Knowledge Makes Large Language Models Better In-context
Learners
Paper
• 2312.15918
• Published
• 9
Infinite-LLM: Efficient LLM Service for Long Context with DistAttention
and Distributed KVCache
Paper
• 2401.02669
• Published
• 17
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Paper
• 2401.05033
• Published
• 18
LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model
Paper
• 2401.02330
• Published
• 18
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual
Perception
Paper
• 2401.16158
• Published
• 20
Weaver: Foundation Models for Creative Writing
Paper
• 2401.17268
• Published
• 45
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Paper
• 2402.05140
• Published
• 23
More Agents Is All You Need
Paper
• 2402.05120
• Published
• 57
Premise Order Matters in Reasoning with Large Language Models
Paper
• 2402.08939
• Published
• 28
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
Paper
• 2402.09727
• Published
• 38
Chain-of-Thought Reasoning Without Prompting
Paper
• 2402.10200
• Published
• 109
In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs
Miss
Paper
• 2402.10790
• Published
• 42
Evaluating Very Long-Term Conversational Memory of LLM Agents
Paper
• 2402.17753
• Published
• 19
GAIA: a benchmark for General AI Assistants
Paper
• 2311.12983
• Published
• 245
Learning to Decode Collaboratively with Multiple Language Models
Paper
• 2403.03870
• Published
• 21
SOTOPIA-π: Interactive Learning of Socially Intelligent Language
Agents
Paper
• 2403.08715
• Published
• 21
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for
Large Language Models
Paper
• 2403.12881
• Published
• 18
Evolutionary Optimization of Model Merging Recipes
Paper
• 2403.13187
• Published
• 58
LLM Agent Operating System
Paper
• 2403.16971
• Published
• 73
AgentLite: A Lightweight Library for Building and Advancing
Task-Oriented LLM Agent System
Paper
• 2402.15538
• Published
• 6
LLMs Simulate Big Five Personality Traits: Further Evidence
Paper
• 2402.01765
• Published
LLM Agents in Interaction: Measuring Personality Consistency and
Linguistic Alignment in Interacting Populations of Large Language Models
Paper
• 2402.02896
• Published
Is Cognition and Action Consistent or Not: Investigating Large Language
Model's Personality
Paper
• 2402.14679
• Published
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large
Language Models
Paper
• 2403.02246
• Published
• 1
LLM Multi-Agent Systems: Challenges and Open Problems
Paper
• 2402.03578
• Published
• 1
AgentScope: A Flexible yet Robust Multi-Agent Platform
Paper
• 2402.14034
• Published
• 13
Social Skill Training with Large Language Models
Paper
• 2404.04204
• Published
• 15
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of
Diverse Models
Paper
• 2404.18796
• Published
• 71
Prometheus 2: An Open Source Language Model Specialized in Evaluating
Other Language Models
Paper
• 2405.01535
• Published
• 124
Mixture-of-Agents Enhances Large Language Model Capabilities
Paper
• 2406.04692
• Published
• 59
Octo-planner: On-device Language Model for Planner-Action Agents
Paper
• 2406.18082
• Published
• 48
ROS-LLM: A ROS framework for embodied AI with task feedback and
structured reasoning
Paper
• 2406.19741
• Published
• 60
LiteSearch: Efficacious Tree Search for LLM
Paper
• 2407.00320
• Published
• 40
Agentless: Demystifying LLM-based Software Engineering Agents
Paper
• 2407.01489
• Published
• 65
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for
LLM Agents
Paper
• 2407.04363
• Published
• 34
Stark: Social Long-Term Multi-Modal Conversation with Persona
Commonsense Knowledge
Paper
• 2407.03958
• Published
• 21
LAMBDA: A Large Model Based Data Agent
Paper
• 2407.17535
• Published
• 37
PERSONA: A Reproducible Testbed for Pluralistic Alignment
Paper
• 2407.17387
• Published
• 20
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS
Paper
• 2408.01584
• Published
• 10
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in
Long-Horizon Tasks
Paper
• 2408.03615
• Published
• 31
Generating novel experimental hypotheses from language models: A case
study on cross-dative generalization
Paper
• 2408.05086
• Published
• 5
The AI Scientist: Towards Fully Automated Open-Ended Scientific
Discovery
Paper
• 2408.06292
• Published
• 128
Benchmarking Agentic Workflow Generation
Paper
• 2410.07869
• Published
• 29
marcelbinz/Llama-3.1-Centaur-70B
Text Generation
• 71B • Updated
• 744
• 81
Agent Laboratory: Using LLM Agents as Research Assistants
Paper
• 2501.04227
• Published
• 95