GitHub - heshengtao/super-agent-party: ⭐A new generation of agent management mid-platform! Upgrade your LLM API to Agent API with one click! Support for windows / macOS / linux desktop and docker! ⭐新一代智能体管理中台！一键将你的LLM API升级为Agent API ！支持windows / macOS / linux桌面版和docker！

Introduction

🚀 Zero-invasive, ultra-simple extension, and empower LLM API with enterprise-level capabilities without modifying a single line of code. Seamlessly attach knowledge bases, real-time internet access, permanent memory, MCP, A2A, deep thinking control, in-depth research, and custom tools to your LLM interface, creating a plug-and-play LLM enhancement platform.

Why Choose Us?

✅ Efficient development: Supports streaming output, does not affect the original API's response speed, and no code changes are required
✅ Quick access: Avoids repeated access to multiple service providers for a single function, pre-configured with mainstream LLM manufacturer/intelligent body protocol adapters, compatible with OpenAI/Ollama/MCP/A2A, and experience the next-generation LLM middleware instantly
✅ High customization: Supports custom knowledge base, real-time networking, permanent memory, code execution tools, MCP, A2A, deep thinking control, in-depth research, custom tools, and other advanced intelligent body functions, creating a pluggable LLM enhancement platform. Customized intelligent bodies can be saved as snapshots for convenient use next time. Snapshotted intelligent bodies can be called directly using the OpenAI API.
✅ Data security: Supports local knowledge base and local model access, ensuring data is not leaked and enterprise data security is maintained. All files will be cached locally and will not be uploaded anywhere.
✅ Team collaboration: Supports team collaboration, multi-person sharing of knowledge base, model services, tools, MCP, A2A, and other resources, improving team collaboration efficiency. Chat records or files and images in the knowledge base are stored locally and can be used as a local file bed or image bed.

Quick Start

Windows Desktop Installation

👉 Click to download

⭐ Note! Choose to install only for the current user during installation, otherwise, administrator privileges will be required to start.

MacOS Desktop Installation (beta test)

👉 Click to download

⭐ Note! After downloading, drag the app file from the dmg file to the /Applications directory. Then open the Terminal and execute the following command, entering the root password when prompted, to remove the Quarantine attribute added due to being downloaded from the internet:

sudo xattr -dr com.apple.quarantine /Applications/Super-Agent-Party.app

Linux Desktop Installation

We provide two mainstream Linux installation package formats for your convenience in different scenarios.

1. Install using `.AppImage` (Recommended)

.AppImage is a Linux application format that does not require installation and can be used immediately. Suitable for most Linux distributions.

👉 Click to download

2. Install using `.deb` package (Suitable for Ubuntu/Debian systems)

👉 Click to download

Docker Deployment (Recommended)

Two commands to install this project:

docker pull ailm32442/super-agent-party:latest
docker run -d -p 3456:3456 -v ./super-agent-data:/app/data ailm32442/super-agent-party:latest

⭐Note! ./super-agent-data can be replaced with any local folder, after Docker starts, all data will be cached in this local folder and will not be uploaded anywhere.
Plug and play: access http://localhost:3456/

Source Code Deployment

Windows:

git clone https://github.com/heshengtao/super-agent-party.git
cd super-agent-party
uv sync
npm install
start_with_dev.bat

Linux or Mac:

git clone https://github.com/heshengtao/super-agent-party.git
cd super-agent-party
uv sync
npm install
chmod +x start_with_dev.sh

8000
./start_with_dev.sh

For detailed deployment methods, please refer to the Deployment and Usage Documentation

Usage

Desktop: Click the desktop icon to use immediately.
Web or docker: Access http://localhost:3456/ after startup.

API call: Developer-friendly, perfectly compatible with OpenAI format, can output in real-time, and does not affect the original API's response speed. No need to modify the calling code:

from openai import OpenAI
client = OpenAI(
  api_key="super-secret-key",
  base_url="http://localhost:3456/v1"
)
response = client.chat.completions.create(
  model="super-model",
  messages=[
      {"role": "user", "content": "What is Super Agent Party?"}
  ]
)
print(response.choices[0].message.content)

MCP call: After starting, you can invoke the local MCP service by writing the following content in the configuration file:
```
{
  "mcpServers": {
    "super-agent-party": {
      "url": "http://127.0.0.1:3456/mcp",
    }
  }
}
```

Features

Switch from the sidebar to the call method to see how to invoke Agent Party through OpenAI API, MCP server, docker, and the web interface. The OpenAI interface has added the following switch parameters:

enable_thinking: The default is False, whether to enable the thinking mode.
enable_deep_research: The default is False, whether to enable the deep research mode.
enable_web_search: The default is False, whether to enable web search.

Knowledge base, allowing the large model to answer questions based on the information in the knowledge base. And it supports the following functions:

If there are multiple knowledge bases, the model will actively query the corresponding knowledge base according to the question requirements.
You can choose the timing of retrieval, and you can choose to actively retrieve or passively retrieve the knowledge base.
We have supported the rerank model, which can improve the retrieval effect of the knowledge base.
Support mixed search function, which allows you to choose the proportion between keyword search and semantic search.

Networking function, which allows the large model to actively query information online according to the needs of the question. Currently, it supports:

duckduckgo (Completely free, but cannot be accessed in China's online environment)
searxng (can be locally deployed with Docker)
tavily（需要申请api key）
jina (can be used for web scraping without an API key)
crawl4ai (can be locally deployed with Docker, and is used for web scraping).

MCP service, which allows large models to actively invoke the MCP service according to the needs of the query. Currently, it supports three invocation methods: standard input and output, server-sent events (SSE), streaming HTTP, and websocket.
A2A service, which allows large models to actively invoke the A2A service according to the needs of the query.
Deep thinking allows us to transplant the reasoning ability of the inference model into tools or multimodal models, so that the large model can use the inference model for reasoning analysis before tool invocation. For example: deepseek-V3 can be invoked by tools, but the inference model deepseek-R1 cannot be invoked by tools. In this case, we can transplant the reasoning ability of deepseek-R1 into deepseek-V3, so that deepseek-V3 can use deepseek-R1 for reasoning analysis before tool invocation.
Conduct in-depth research, convert users' problems into tasks, gradually analyze and infer, then invoke tools. After outputting the results, we will recheck whether the task is completed. If the task is not completed, we will continue to analyze and infer, then invoke tools until the task is completed.
Custom LLM tools can convert LLM interfaces into LLM tools, and any project that adapts to the Ollama format or the OpenAI interface can be used as a tool.
Visual caching, which allows you to configure a visual model separately for recognizing image information. The recognition results will be cached to save tokens. Configuring a visual model can enable some models without visual capabilities (for example, most inference models, etc.) to acquire visual capabilities.
Storage space management function, which allows you to view the files and pictures uploaded in chat in the storage space, and they are all cached locally, enhancing the software's function of image and file storage.
Added memory module, which can be viewed on the tool interface.

To add new memories, you need to add a word embedding model, and the agent will update the memory vector database in real time. Every time you answer, it will automatically search for relevant memories.
The memory module can be enabled and disabled in the memory configuration, and the number of results can be adjusted to allow the agent to see more or less relevant memories.

Added a code execution tool, supporting both cloud-based and local solutions:

Cloud-based solution: Invoke the code sandbox from e2b, an API key needs to be obtained.

Local solution: Use the code sandbox from bytedance/SandboxFusion, requires local deployment using Docker.

docker run -it -p 8080:8080 volcengine/sandbox-fusion:server-20241204

For users in mainland China, the following mirror is provided:

docker run -it -p 8080:8080 vemlp-cn-beijing.cr.volces.com/preset-images/code-sandbox:server-20241204

Implemented widgets: current time, retrieving content from file/image URLs, pseudo reasoning, Pollinations image generation, enhanced rendering of LaTeX formulas, and language tone.

Current time: Get the current time.
Retrieve the content from the file/image URL: Retrieve the content from the file/image URL.
Pseudo-reasoning: Enabling a model that doesn't have reasoning capabilities to acquire them.
Pollinations image generation: Call the Pollinations image generation API to generate images. (No API key is needed.)
Enhanced latex formula rendering: Control the more stable output of latex formulas in large models.
Language tone: Control the more stable output language and tone of the large model.

Support for converting custom HTTP requests into agent tools has been added. You can now use any HTTP request as an agent tool, and you can add custom HTTP request tools in the Agent Toolkit interface.

Disclaimer:

This open-source project and its content (hereinafter referred to as the "project") are for reference only and do not imply any explicit or implicit warranties. The project contributors do not assume any responsibility for the completeness, accuracy, reliability, or applicability of the project. Any behavior that relies on the project content shall be at the user's own risk. In any case, the project contributors shall not be liable for any indirect, special, or incidental losses or damages arising from the use of the project content.

License Agreement

This project uses a dual licensing model:

By default, this project follows the GNU Affero General Public License v3.0 (AGPLv3) license agreement
If you need to use this project for closed-source commercial purposes, you must obtain a commercial license from the project administrator

Using this project for closed-source commercial purposes without written authorization is considered a violation of this agreement. The complete text of AGPLv3 can be found in the LICENSE file in the project root directory or at gnu.org/licenses.

Support:

Join the Community

If you have any questions or issues with the project, you are welcome to join our community.

QQ Group: 931057213

WeChat Group: we_glm (add the assistant's WeChat and join the group)
Discord: Discord link

Donate

If my work has brought value to you, please consider buying me a cup of coffee! Your support not only injects vitality into the project but also warms the creator's heart. ☕💖 Every cup counts!

Name		Name	Last commit message	Last commit date
Latest commit History 1,384 Commits
.github		.github
config		config
doc		doc
py		py
static		static
tiktoken_cache		tiktoken_cache
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_ZH.md		README_ZH.md
entitlements.mac.plist		entitlements.mac.plist
install.bat		install.bat
install.sh		install.sh
main.js		main.js
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
server.py		server.py
server.spec		server.spec
start_with_dev.bat		start_with_dev.bat
start_with_dev.sh		start_with_dev.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction

Why Choose Us?

Quick Start

Windows Desktop Installation

MacOS Desktop Installation (beta test)

Linux Desktop Installation

1. Install using `.AppImage` (Recommended)

2. Install using `.deb` package (Suitable for Ubuntu/Debian systems)

Docker Deployment (Recommended)

Source Code Deployment

Usage

Features

Disclaimer:

License Agreement

Support:

Follow us

Join the Community

Donate

About

Uh oh!

Releases 8

Packages

Uh oh!

Languages

License

heshengtao/super-agent-party

Folders and files

Latest commit

History

Repository files navigation

Introduction

Why Choose Us?

Quick Start

Windows Desktop Installation

MacOS Desktop Installation (beta test)

Linux Desktop Installation

1. Install using .AppImage (Recommended)

2. Install using .deb package (Suitable for Ubuntu/Debian systems)

Docker Deployment (Recommended)

Source Code Deployment

Usage

Features

Disclaimer:

License Agreement

Support:

Follow us

Join the Community

Donate

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Uh oh!

Languages

1. Install using `.AppImage` (Recommended)

2. Install using `.deb` package (Suitable for Ubuntu/Debian systems)

Packages