多智能体框架AutoGenAutoGen是一个用于创建可自主行动或与人类一起工作的多智能体AI应用程序的框架

共1428个文件

cs：379个

py：368个

md：146个

多智能体

152 浏览量 2025-01-21 22:07:38 上传评论收藏 21.99MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

多智能体框架AutoGen AutoGen是一个用于创建可自主行动或与人类一起工作的多智能体 AI 应用程序的框架（1428个子文件）

ATTRIBUTION 1KB

NuGet.config 224B

RestoreInteractive.config 222B

OpenAIMessageTests.cs 33KB

OpenAIMessageTests.cs 30KB

ChatRequestMessageTests.cs 25KB

ChatCompletionClientAgentTests.cs 21KB

FunctionCallTemplate.cs 18KB

GeminiMessageConnector.cs 17KB

OpenAIChatRequestMessageConnector.cs 16KB

GrpcAgentWorker.cs 16KB

OpenAIChatAgentTest.cs 16KB

GeminiMessageTests.cs 15KB

OpenAIChatRequestMessageConnector.cs 15KB

Example07_Dynamic_GroupChat_Calculate_Fibonacci.cs 15KB

RolePlayOrchestratorTests.cs 15KB

OpenAIChatAgentTest.cs 15KB

GrpcGateway.cs 14KB

Agent.cs 14KB

Anthropic_Agent_With_Prompt_Caching.cs 14KB

DistributedApplicationExtension.cs 14KB

MistralChatMessageConnector.cs 13KB

AzureAIInferenceChatRequestMessageConnector.cs 13KB

FunctionCallGenerator.cs 13KB

AnthropicTestUtils.cs 12KB

MistralClientTests.cs 12KB

SemanticKernelAgentTest.cs 11KB

GeminiAgentTests.cs 11KB

AnthropicMessageConnector.cs 11KB

RolePlayToolCallOrchestratorTests.cs 11KB

SemanticKernelChatMessageContentConnector.cs 10KB

DocumentCommentExtension.cs 10KB

GeminiChatAgent.cs 10KB

GPTAgentTest.cs 10KB

MistralClientAgentTests.cs 10KB

Example04_Dynamic_GroupChat_Coding_Task.cs 10KB

GithubService.cs 10KB

AnthropicClientAgentTest.cs 10KB

AnthropicClientTest.cs 10KB

OllamaAgentTests.cs 9KB

FunctionCallMiddleware.cs 9KB

MessageExtension.cs 9KB

MathClassTest.cs 8KB

ChatCompletionRequest.cs 8KB

MathClassTest.cs 8KB

OpenAIChatAgent.cs 8KB

MessageExtension.cs 8KB

InteractiveService.cs 8KB

AgentWorker.cs 8KB

OpenAIChatAgent.cs 8KB

ChatCompletionsClientAgent.cs 8KB

AnthropicClient.cs 8KB

SingleAgentTest.cs 8KB

Example03_Agent_FunctionCall.cs 7KB

GroupChat.cs 7KB

AzureService.cs 7KB

OllamaAgent.cs 7KB

OllamaMessageConnector.cs 7KB

GrpcAgentWorkerHostBuilderExtension.cs 7KB

MiddlewareAgentCodeSnippet.cs 7KB

OllamaMessageTests.cs 7KB

HostBuilderExtensions.cs 7KB

ConversableAgent.cs 7KB

GithubWebHookProcessor.cs 7KB

FSM_Group_Chat.cs 6KB

MistralClient.cs 6KB

Example17_ReActAgent.cs 6KB

MiddlewareExtension.cs 6KB

KernelFunctionMiddlewareTests.cs 6KB

AgentExtension.cs 6KB

DotnetInteractiveFunction.cs 6KB

Example12_TwoAgent_Fill_Application.cs 6KB

FunctionCallCodeSnippet.cs 6KB

MiddlewareTest.cs 6KB

ModelReplyOptions.cs 6KB

OpenAICodeSnippet.cs 6KB

OpenAIChatCompletionService.cs 6KB

ServiceCollectionChatCompletionExtensions.cs 6KB

AgentMessenger.cs 5KB

AgentExtensions.cs 5KB

Example05_Dalle_And_GPT4V.cs 5KB

FunctionAttribute.cs 5KB

Function_Call_With_Gemini.cs 5KB

GroupChatExtension.cs 5KB

MistralClientAgent.cs 5KB

Graph.cs 5KB

CreateAnAgent.cs 5KB

HelloAppHostIntegrationTests.cs 5KB

SemanticKernelHostingExtensions.cs 5KB

RegistryGrain.cs 5KB

GPTAgent.cs 5KB

MiddlewareAgent.cs 4KB

SemanticKernelAgent.cs 4KB

VertexGeminiClientTests.cs 4KB

AspireHostingExtensions.cs 4KB

GoogleGeminiClientTests.cs 4KB

RolePlayToolCallOrchestrator.cs 4KB

SemanticKernelCodeSnippet.cs 4KB

OllamaReplyOptions.cs 4KB

AnthropicClientAgent.cs 4KB

共 1428 条

# Magentic-One > [!IMPORTANT] > **Note (December 22nd, 2024):** We recommend using the [Magentic-One API](https://github.com/microsoft/autogen/tree/main/python/packages/autogen-ext/src/autogen_ext/teams/magentic_one.py) as the preferred way to interact with Magentic-One. The API provides a more streamlined and robust interface for integrating Magentic-One into your projects. > [!CAUTION] > Using Magentic-One involves interacting with a digital world designed for humans, which carries inherent risks. To minimize these risks, consider the following precautions: > > 1. **Use Containers**: Run all tasks in docker containers to isolate the agents and prevent direct system attacks. > 2. **Virtual Environment**: Use a virtual environment to run the agents and prevent them from accessing sensitive data. > 3. **Monitor Logs**: Closely monitor logs during and after execution to detect and mitigate risky behavior. > 4. **Human Oversight**: Run the examples with a human in the loop to supervise the agents and prevent unintended consequences. > 5. **Limit Access**: Restrict the agents' access to the internet and other resources to prevent unauthorized actions. > 6. **Safeguard Data**: Ensure that the agents do not have access to sensitive data or resources that could be compromised. Do not share sensitive information with the agents. > Be aware that agents may occasionally attempt risky actions, such as recruiting humans for help or accepting cookie agreements without human involvement. Always ensure agents are monitored and operate within a controlled environment to prevent unintended consequences. Moreover, be cautious that Magentic-One may be susceptible to prompt injection attacks from webpages. > [!NOTE] > This code is currently being ported to AutoGen AgentChat. If you want to build on top of Magentic-One, we recommend waiting for the port to be completed. In the meantime, you can use this codebase to experiment with Magentic-One. We are introducing Magentic-One, our new generalist multi-agent system for solving open-ended web and file-based tasks across a variety of domains. Magentic-One represents a significant step towards developing agents that can complete tasks that people encounter in their work and personal lives. Find additional information about Magentic-one in our [blog post](https://aka.ms/magentic-one-blog) and [technical report](https://arxiv.org/abs/2411.04468). ![](./imgs/autogen-magentic-one-example.png) > _Example_: The figure above illustrates Magentic-One mutli-agent team completing a complex task from the GAIA benchmark. Magentic-One's Orchestrator agent creates a plan, delegates tasks to other agents, and tracks progress towards the goal, dynamically revising the plan as needed. The Orchestrator can delegate tasks to a FileSurfer agent to read and handle files, a WebSurfer agent to operate a web browser, or a Coder or Computer Terminal agent to write or execute code, respectively. ## Architecture ![](./imgs/autogen-magentic-one-agents.png) Magentic-One work is based on a multi-agent architecture where a lead Orchestrator agent is responsible for high-level planning, directing other agents and tracking task progress. The Orchestrator begins by creating a plan to tackle the task, gathering needed facts and educated guesses in a Task Ledger that is maintained. At each step of its plan, the Orchestrator creates a Progress Ledger where it self-reflects on task progress and checks whether the task is completed. If the task is not yet completed, it assigns one of Magentic-One other agents a subtask to complete. After the assigned agent completes its subtask, the Orchestrator updates the Progress Ledger and continues in this way until the task is complete. If the Orchestrator finds that progress is not being made for enough steps, it can update the Task Ledger and create a new plan. This is illustrated in the figure above; the Orchestrator work is thus divided into an outer loop where it updates the Task Ledger and an inner loop to update the Progress Ledger. Overall, Magentic-One consists of the following agents: - Orchestrator: the lead agent responsible for task decomposition and planning, directing other agents in executing subtasks, tracking overall progress, and taking corrective actions as needed - WebSurfer: This is an LLM-based agent that is proficient in commanding and managing the state of a Chromium-based web browser. With each incoming request, the WebSurfer performs an action on the browser then reports on the new state of the web page The action space of the WebSurfer includes navigation (e.g. visiting a URL, performing a web search); web page actions (e.g., clicking and typing); and reading actions (e.g., summarizing or answering questions). The WebSurfer relies on the accessibility tree of the browser and on set-of-marks prompting to perform its actions. - FileSurfer: This is an LLM-based agent that commands a markdown-based file preview application to read local files of most types. The FileSurfer can also perform common navigation tasks such as listing the contents of directories and navigating a folder structure. - Coder: This is an LLM-based agent specialized through its system prompt for writing code, analyzing information collected from the other agents, or creating new artifacts. - ComputerTerminal: Finally, ComputerTerminal provides the team with access to a console shell where the Coder’s programs can be executed, and where new programming libraries can be installed. Together, Magentic-One’s agents provide the Orchestrator with the tools and capabilities that it needs to solve a broad variety of open-ended problems, as well as the ability to autonomously adapt to, and act in, dynamic and ever-changing web and file-system environments. While the default multimodal LLM we use for all agents is GPT-4o, Magentic-One is model agnostic and can incorporate heterogonous models to support different capabilities or meet different cost requirements when getting tasks done. For example, it can use different LLMs and SLMs and their specialized versions to power different agents. We recommend a strong reasoning model for the Orchestrator agent such as GPT-4o. In a different configuration of Magentic-One, we also experiment with using OpenAI o1-preview for the outer loop of the Orchestrator and for the Coder, while other agents continue to use GPT-4o. ### Logging in Team One Agents Team One agents can emit several log events that can be consumed by a log handler (see the example log handler in [utils.py](src/autogen_magentic_one/utils.py)). A list of currently emitted events are: - OrchestrationEvent : emitted by a an [Orchestrator](src/autogen_magentic_one/agents/base_orchestrator.py) agent. - WebSurferEvent : emitted by a [WebSurfer](src/autogen_magentic_one/agents/multimodal_web_surfer/multimodal_web_surfer.py) agent. In addition, developers can also handle and process logs generated from the AutoGen core library (e.g., LLMCallEvent etc). See the example log handler in [utils.py](src/autogen_magentic_one/utils.py) on how this can be implemented. By default, the logs are written to a file named `log.jsonl` which can be configured as a parameter to the defined log handler. These logs can be parsed to retrieved data agent actions. # Setup and Usage You can install the Magentic-One package and then run the example code to see how the agents work together to accomplish a task. 1. Clone the code and install the package: The easiest way to install is with the [uv package installer](https://docs.astral.sh/uv/getting-started/installation/) which you need to install separately, however, this is not necessary. Clone repo, use uv to setup and activate virtual environment: ```bash git clone https://github.com/microsoft/autogen.git cd autogen/python uv sync --all-extras source .venv/bin/activate ``` For Windows, run `.venv\Scripts\activate` to activate the environment. 2. I

评论收藏

内容反馈