Agents#
AutoGen AgentChat provides a set of preset Agents, each with variations in how an agent might respond to messages. All agents share the following attributes and methods:
name
: The unique name of the agent.description
: The description of the agent in text.on_messages()
: Send the agent a sequence ofChatMessage
get aResponse
. It is important to note that agents are expected to be stateful and this method is expected to be called with new messages, not the complete history.on_messages_stream()
: Same ason_messages()
but returns an iterator ofAgentEvent
orChatMessage
followed by aResponse
as the last item.on_reset()
: Reset the agent to its initial state.
See autogen_agentchat.messages
for more information on AgentChat message types.
Assistant Agent#
AssistantAgent
is a built-in agent that
uses a language model and has the ability to use tools.
from autogen_agentchat.agents import AssistantAgent
from autogen_agentchat.messages import TextMessage
from autogen_agentchat.ui import Console
from autogen_core import CancellationToken
from autogen_ext.models.openai import OpenAIChatCompletionClient
# Define a tool that searches the web for information.
async def web_search(query: str) -> str:
"""Find information on the web"""
return "AutoGen is a programming framework for building multi-agent applications."
# Create an agent that uses the OpenAI GPT-4o model.
model_client = OpenAIChatCompletionClient(
model="gpt-4o",
# api_key="YOUR_API_KEY",
)
agent = AssistantAgent(
name="assistant",
model_client=model_client,
tools=[web_search],
system_message="Use tools to solve tasks.",
)
Getting Responses#
We can use the on_messages()
method to get the agent response to a given message.
async def assistant_run() -> None:
response = await agent.on_messages(
[TextMessage(content="Find information on AutoGen", source="user")],
cancellation_token=CancellationToken(),
)
print(response.inner_messages)
print(response.chat_message)
# Use asyncio.run(assistant_run()) when running in a script.
await assistant_run()
[ToolCallRequestEvent(source='assistant', models_usage=RequestUsage(prompt_tokens=61, completion_tokens=15), content=[FunctionCall(id='call_hqVC7UJUPhKaiJwgVKkg66ak', arguments='{"query":"AutoGen"}', name='web_search')]), ToolCallExecutionEvent(source='assistant', models_usage=None, content=[FunctionExecutionResult(content='AutoGen is a programming framework for building multi-agent applications.', call_id='call_hqVC7UJUPhKaiJwgVKkg66ak')])]
source='assistant' models_usage=RequestUsage(prompt_tokens=92, completion_tokens=14) content='AutoGen is a programming framework designed for building multi-agent applications.'
The call to the on_messages()
method
returns a Response
that contains the agent’s final response in the chat_message
attribute,
as well as a list of inner messages in the inner_messages
attribute,
which stores the agent’s “thought process” that led to the final response.
Note
It is important to note that on_messages()
will update the internal state of the agent – it will add the messages to the agent’s
history. So you should call this method with new messages.
You should not repeatedly call this method with the same messages or the complete history.
Note
Unlike in v0.2 AgentChat, the tools are executed by the same agent directly within
the same call to on_messages()
.
By default, the agent will return the result of the tool call as the final response.
Streaming Messages#
We can also stream each message as it is generated by the agent by using the
on_messages_stream()
method,
and use Console
to print the messages
as they appear to the console.
async def assistant_run_stream() -> None:
# Option 1: read each message from the stream (as shown in the previous example).
# async for message in agent.on_messages_stream(
# [TextMessage(content="Find information on AutoGen", source="user")],
# cancellation_token=CancellationToken(),
# ):
# print(message)
# Option 2: use Console to print all messages as they appear.
await Console(
agent.on_messages_stream(
[TextMessage(content="Find information on AutoGen", source="user")],
cancellation_token=CancellationToken(),
)
)
# Use asyncio.run(assistant_run_stream()) when running in a script.
await assistant_run_stream()
---------- assistant ----------
[FunctionCall(id='call_fSp5iTGVm2FKw5NIvfECSqNd', arguments='{"query":"AutoGen information"}', name='web_search')]
[Prompt tokens: 61, Completion tokens: 16]
---------- assistant ----------
[FunctionExecutionResult(content='AutoGen is a programming framework for building multi-agent applications.', call_id='call_fSp5iTGVm2FKw5NIvfECSqNd')]
---------- assistant ----------
AutoGen is a programming framework designed for building multi-agent applications. If you need more detailed information or specific aspects about AutoGen, feel free to ask!
[Prompt tokens: 93, Completion tokens: 32]
---------- Summary ----------
Number of inner messages: 2
Total prompt tokens: 154
Total completion tokens: 48
Duration: 4.30 seconds
The on_messages_stream()
method
returns an asynchronous generator that yields each inner message generated by the agent,
with the final item being the response message in the chat_message
attribute.
From the messages, you can observe that the assistant agent utilized the web_search
tool to
gather information and responded based on the search results.
Using Tools#
Large Language Models (LLMs) are typically limited to generating text or code responses. However, many complex tasks benefit from the ability to use external tools that perform specific actions, such as fetching data from APIs or databases.
To address this limitation, modern LLMs can now accept a list of available tool schemas (descriptions of tools and their arguments) and generate a tool call message. This capability is known as Tool Calling or Function Calling and is becoming a popular pattern in building intelligent agent-based applications. Refer to the documentation from OpenAI and Anthropic for more information about tool calling in LLMs.
In AgentChat, the AssistantAgent
can use tools to perform specific actions.
The web_search
tool is one such tool that allows the assistant agent to search the web for information.
A custom tool can be a Python function or a subclass of the BaseTool
.
By default, when AssistantAgent
executes a tool,
it will return the tool’s output as a string in ToolCallSummaryMessage
in its response.
If your tool does not return a well-formed string in natural language, you
can add a reflection step to have the model summarize the tool’s output,
by setting the reflect_on_tool_use=True
parameter in the AssistantAgent
constructor.
Langchain Tools#
In addition to custom tools, you can also use tools from the Langchain library
by wrapping them in LangChainToolAdapter
.
import pandas as pd
from autogen_ext.tools.langchain import LangChainToolAdapter
from langchain_experimental.tools.python.tool import PythonAstREPLTool
df = pd.read_csv("https://raw.githubusercontent.com/pandas-dev/pandas/main/doc/data/titanic.csv")
tool = LangChainToolAdapter(PythonAstREPLTool(locals={"df": df}))
model_client = OpenAIChatCompletionClient(model="gpt-4o")
agent = AssistantAgent(
"assistant", tools=[tool], model_client=model_client, system_message="Use the `df` variable to access the dataset."
)
await Console(
agent.on_messages_stream(
[TextMessage(content="What's the average age of the passengers?", source="user")], CancellationToken()
)
)
---------- assistant ----------
[FunctionCall(id='call_BEYRkf53nBS1G2uG60wHP0zf', arguments='{"query":"df[\'Age\'].mean()"}', name='python_repl_ast')]
[Prompt tokens: 111, Completion tokens: 22]
---------- assistant ----------
[FunctionExecutionResult(content='29.69911764705882', call_id='call_BEYRkf53nBS1G2uG60wHP0zf')]
---------- assistant ----------
29.69911764705882
---------- Summary ----------
Number of inner messages: 2
Total prompt tokens: 111
Total completion tokens: 22
Duration: 0.62 seconds
Response(chat_message=ToolCallSummaryMessage(source='assistant', models_usage=None, content='29.69911764705882', type='ToolCallSummaryMessage'), inner_messages=[ToolCallRequestEvent(source='assistant', models_usage=RequestUsage(prompt_tokens=111, completion_tokens=22), content=[FunctionCall(id='call_BEYRkf53nBS1G2uG60wHP0zf', arguments='{"query":"df[\'Age\'].mean()"}', name='python_repl_ast')], type='ToolCallRequestEvent'), ToolCallExecutionEvent(source='assistant', models_usage=None, content=[FunctionExecutionResult(content='29.69911764705882', call_id='call_BEYRkf53nBS1G2uG60wHP0zf')], type='ToolCallExecutionEvent')])
Using Model Context#
AssistantAgent
has a model_context
parameter that can be used to pass in a ChatCompletionContext
object. This allows the agent to use different model contexts, such as
BufferedChatCompletionContext
to
limit the context sent to the model.
By default, AssistantAgent
uses
the UnboundedChatCompletionContext
which sends the full conversation history to the model. To limit the context
to the last n
messages, you can use the BufferedChatCompletionContext
.
from autogen_core.model_context import BufferedChatCompletionContext
# Create an agent that uses only the last 5 messages in the context to generate responses.
agent = AssistantAgent(
name="assistant",
model_client=model_client,
tools=[web_search],
system_message="Use tools to solve tasks.",
model_context=BufferedChatCompletionContext(buffer_size=5), # Only use the last 5 messages in the context.
)
User Proxy Agent#
UserProxyAgent
is a built-in agent that
provides one way for a user to intervene in the process. This agent will put the team in a temporary blocking state, and thus any exceptions or runtime failures while in the blocked state will result in a deadlock. It is strongly advised that this agent be coupled with a timeout mechanic and that all errors and exceptions emanating from it are handled.
from autogen_agentchat.agents import UserProxyAgent
async def user_proxy_run() -> None:
user_proxy_agent = UserProxyAgent("user_proxy")
response = await user_proxy_agent.on_messages(
[TextMessage(content="What is your name? ", source="user")], cancellation_token=CancellationToken()
)
print(f"Your name is {response.chat_message.content}")
# Use asyncio.run(user_proxy_run()) when running in a script.
await user_proxy_run()
The User Proxy agent is ideally used for on-demand human-in-the-loop interactions for scenarios such as Just In Time approvals, human feedback, alerts, etc. For slower user interactions, consider terminating a team using a termination condition and start another one from
run()
or run_stream()
with another message.
Other Preset Agents#
The following preset agents are available:
CodeExecutorAgent
: An agent that can execute code.OpenAIAssistantAgent
: An agent that is backed by an OpenAI Assistant, with ability to use custom tools.MultimodalWebSurfer
: A multi-modal agent that can search the web and visit web pages for information.FileSurfer
: An agent that can search and browse local files for information.VideoSurfer
: An agent that can watch videos for information.
Next Step#
Having explored the usage of the AssistantAgent
, we can now proceed to the next section to learn about the teams feature in AgentChat.