Post

Agents framework for LLM + AgentGen looks like a nice framework

๐—ก๐—ฒ๐˜„ ๐˜ƒ๐—ฒ๐—ฟ๐˜€๐—ถ๐—ผ๐—ป ๐—ผ๐—ณ ๐—ผ๐˜‚๐—ฟ ๐—”๐—ด๐—ฒ๐—ป๐˜๐˜€ ๐—ณ๐—ฟ๐—ฎ๐—บ๐—ฒ๐˜„๐—ผ๐—ฟ๐—ธ ๐Ÿ™Œ

Agents 2.0 is out, and itโ€™s already the best-performing agent framework using an open model! Top 1 of open models on GAIA, top 4 overall. Iโ€™m really proud to have worked on this ๐Ÿ˜ƒ

Itโ€™s also:

  • โœจ ๐—ฆ๐—ถ๐—บ๐—ฝ๐—น๐—ฒ๐—ฟ: prompt, tools, and attributes are accessible
  • ๐Ÿงฉ ๐— ๐—ผ๐—ฑ๐˜‚๐—น๐—ฎ๐—ฟ: use any LLM. Llama-3-70B-Instruct is ๐Ÿ”ฅ
  • ๐Ÿ’ช ๐—ฃ๐—ฒ๐—ฟ๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐—ป๐˜ w/ React Agents

You can access it on Hugging Face Transformers โ€˜mainโ€™ branch (v4.41.0 lands this week)

Read the announcement blog post here ๐Ÿ‘‰ https://huggingface.co/blog/agents

If youโ€™ve never played with Agents, the following guide gets you up to speed as to whatโ€™s possible with them: ๐Ÿ‘‰ https://huggingface.co/docs/transformers/main/en/agents


๐Ÿ’ก โ€œPlanningโ€ is super important in agentic pipelines as it determines the entire trajectory of the agents.

 AgentGen

AgentGen looks like a nice framework that can help automate and simplify this.

Paper ๐Ÿ‘‰ https://arxiv.org/abs/2408.00764

โ›ณ What is AgentGen?

AgentGen is a framework designed to automate and improve the planning capabilities of LLM-based agents, it automates the generation of diverse environments and planning tasks, allowing for more effective agent training.

It addresses the limitations of manual environment creation by using automated methods to create a wide variety of scenarios and tasks with varying difficulty levels.

โ›ณ Key Features:

  • ๐Ÿ‘‰ AgentGen uses an inspiration corpus (LIMA dataset) composed of domain-specific text segments to generate a wide range of environment specifications. This approach covers numerous scenarios and domains, enhancing the training landscape for agents.
  • ๐Ÿ‘‰ The authors propose Bidirectional Evolution (BI-EVOL) which creates planning tasks with a smooth difficulty curve by evolving tasks in both simpler and more complex directions. This helps facilitate the LLMsโ€™ gradual acquisition of planning skills.
  • ๐Ÿ‘‰ Uses high-quality trajectory data (sequences of action-observation pairs) to instruction-tune LLMs, improving their decision-making and planning capabilities.

AgentGen-tuned models, such as Llama-3 8B, show significant performance improvements, surpassing models like GPT-3.5 and even outperforming GPT-4 in certain tasks!

This post is licensed under CC BY 4.0 by the author.