Gentopia.AI
Published:
Gentopia is a lightweight and extensible framework for LLM-driven Agents and ALM research. It provides essential components to build, test and evaluate agents. At its core, Gentopia aims to assemble an agent with a single config, thus minimizing your effort in building, tuning, and sharing agents.
Gentopia maintains an agent platform GentPool to share specialized agents, where your agent interacts with other agents by cloning, hierarchical plug-in, or sharing environment. We provide a unique agent benchmark for holistic evaluation.
Motivation π§
Agent practitioners start to realize the difficulty in tuning a βwell-roundedβ agent with tons of tools or instructions in a single layer. Recent studies like TinyStories, Specializing Reasoning, Letβs Verify SbS, ReWOO, etc. also point us towards an intuitive yet undervalued direction π
An LLM is more capable if you create a context/distribution shift specialized to some target tasks.
Sadly, there is no silver bullet for agent specialization. For example, you can
- Simply add
Let's think step by step.
in your prompt for more accurate Math QA. - Give a few-shot exemplar in your prompt to guide a better reasoning trajectory for novel plotting.
- Supervise fine-tuning (SFT) your 70B
llama2
like this to match reasoning of 175B GPT-3.5. - Tune your agent paradigm like this demo to easily half the execution time for Seach & Summarize.
- And more β¦
Isnβt it beautiful if one shares his effort in specialized intelligence, allowing others to reproduce, build on, or interact with it? π€ This belief inspires us to build Gentopia, designed for agent specialization, sharing, and interaction, to stackingly achieve collective growth towards greater intelligence..
Core Features π‘
- βοΈ Config-driven agent assembling and chat.
- π Large amount of prebuilt agent types, LLM clients, tools, memory systems, and more.
- πͺΆ Lightweight and highly extensible implementation of essential components.
- π§ͺ Aligning with state-of-the-art AI research.
- π€ Enabling multi-agent interactions.
- π¦ Unique platform of agent zoo and eval benchmark.
π Highlight Topics π
- π€ Agent Templates
- β°οΈ Hierarchical Agents
- π₯ Unique Agent Benchmark
- π¦ Open LLM Supports
- π§ High-Performance Memory
News π·
[Oct 7] The companion paper was accepted by #EMNLP 23. See you in Singapore! π¦π
[Aug 6] Weβve submitted our work as a paper for EMNLP 2023. Special thanks to the research team and professors from NCSU, GMU, and CMU for collaboration :)
Leave a Comment