Skip to main content

AutoGPT - Developer Guide

What's this about?

AutoGPT is an open-source platform to build, deploy and run continuous AI agents that automate multi-step workflows. You assemble an agent visually by wiring together blocks β€” each block is one discrete action β€” into a graph, then run it on demand, on a schedule, or in response to external triggers. This guide is the developer entry point: it explains what AutoGPT is today, how the platform is put together, the vocabulary you need, and where everything lives in the repo.

Checked against primary sources

This guide is based on the official AutoGPT documentation (agpt.co/docs) and the Significant-Gravitas/AutoGPT GitHub repository (dev branch), reviewed on June 25, 2026. Code signatures and file paths are quoted from the repository source; the platform moves fast, so always cross-check against the current dev branch before relying on a specific detail.

1. What AutoGPT is (and what it isn't)​

The name "AutoGPT" covers two distinct projects that live in the same repository. Knowing which one you are looking at saves a lot of confusion.

AutoGPT PlatformAutoGPT Classic
What it isA modern platform to build, deploy and run AI agents as visual workflowsThe original 2023 standalone autonomous agent
How you use itWeb UI (low-code Agent Builder) + a server backendPython toolkit, CLI and a reference agent
StateActively developed, the focus of the projectLegacy / maintenance, kept for reference
Lives inautogpt_platform/classic/ (Forge, benchmark, frontend)
LicensePolyform ShieldMIT

The rest of this guide is about the Platform. Classic is summarised in Β§6.

Mental model

Think of the Platform as a visual programming environment for AI agents. Instead of writing a script that calls an LLM and some APIs, you drop blocks on a canvas and connect them. The runtime executes that graph for you, handles credentials, retries, scheduling and billing of credits, and lets you publish the result for others to run.

2. The two halves of the Platform​

The Platform is split into a frontend (where humans build and operate agents) and a server (where agents actually run).

Frontend​

A Next.js / TypeScript web application. It is where you:

CapabilityWhat it does
Agent BuilderA low-code canvas to design and configure agents by connecting blocks
Workflow managementBuild, modify and optimise the graph; each block performs a single action
Deployment controlsManage an agent's lifecycle from testing to production
Library / Ready-to-use agentsPick a pre-configured agent and run it without building anything
Agent interactionRun your agents and provide their inputs through the UI
Monitoring & analyticsTrack runs, outputs and performance over time

Server​

The backend is the "powerhouse" where agents run. Once deployed, an agent can be triggered by external sources (webhooks, schedules, manual runs) and operate continuously. The server is a set of Python services and contains:

  • Core logic β€” the graph engine and the block library that drive execution.
  • Infrastructure β€” the supporting services (database, queue, cache, auth, file scanning, …).
  • Marketplace β€” a catalogue where you can find and deploy pre-built agents.

Under Docker Compose the backend is not a single process but several cooperating services β€” a REST API server, an executor, a websocket server, a database manager, a scheduler and a notification server β€” plus their dependencies (Postgres via Supabase, Redis/FalkorDB, RabbitMQ, ClamAV). You don't need to think about each one to build agents, but it's useful to know the backend is a distributed set of services when self-hosting (see Self-Hosting).

3. Core concepts​

These five terms appear everywhere in the UI, the docs and the code. Internalise them before going further.

ConceptDefinition
BlockThe smallest unit of functionality β€” one block does one thing (call an LLM, send an email, do maths, read a webpage). Blocks have a typed input schema and a typed output schema.
NodeA specific instance of a block placed on the canvas, with its inputs configured. The same block can appear as many nodes.
LinkA connection from an output pin of one node to an input pin of another. Links are how data flows through the agent.
GraphThe whole agent: a set of nodes connected by links. "Agent" and "graph" are effectively synonyms β€” the graph is the agent's workflow.
AgentA graph you can run, schedule, publish and share. An agent has versions.

Two more terms for distribution:

  • Marketplace β€” the public catalogue of agents others have published; browse, test and deploy them.
  • Library β€” your own collection of agents (built or added from the marketplace), ready to run.
            β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”      link       β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     link      β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
input ───▢ β”‚ Node A β”‚ ───────────────▢│ Node B │──────────────▢│ Node C β”‚ ───▢ output
β”‚ (Block) β”‚ output β†’ input β”‚ (Block) β”‚ β”‚ (Block) β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
one Graph = one Agent
How data actually moves

A block's run() method is an async generator: it yields one or more (output_name, value) pairs. Each yielded value travels along every link attached to that output pin and lands on the connected input pin of the downstream node. This streaming model is why a single block can emit multiple results over time, not just one return value. The mechanics are covered in Building Blocks.

4. Tech stack​

AreaTechnology
Backend languagePython (~69% of the repo)
Frontend languageTypeScript (~29%)
ContainerisationDocker Engine β‰₯ 20.10, Docker Compose β‰₯ 2.0
DatabasePostgreSQL via Supabase, accessed through Prisma
Queue / cacheRabbitMQ, Redis / FalkorDB
Schema/validationPydantic models for every block's input & output schema
Standard (Classic)Agent Protocol by the AI Engineer Foundation

The use of Pydantic is important for developers: a block's input and output schemas are Pydantic models, which are converted to JSON Schema and stored in the database. That JSON Schema is what the Agent Builder renders as a form, and what the executor validates against at run time.

5. Monorepo layout​

The repository is a monorepo. The two licensing zones map onto two top-level areas:

AutoGPT/
β”œβ”€β”€ autogpt_platform/ # ← the Platform (Polyform Shield License)
β”‚ β”œβ”€β”€ backend/
β”‚ β”‚ └── backend/
β”‚ β”‚ β”œβ”€β”€ blocks/ # the block library β€” _base.py defines the Block class
β”‚ β”‚ β”œβ”€β”€ data/ # block.py, model.py (SchemaField, Credentials, …), graph, execution
β”‚ β”‚ β”œβ”€β”€ sdk/ # the Block SDK (auto-registration, provider/credentials helpers)
β”‚ β”‚ └── integrations/ # provider definitions, OAuth, webhooks
β”‚ β”œβ”€β”€ frontend/ # Next.js web app (Agent Builder, Library, Marketplace)
β”‚ β”œβ”€β”€ docker-compose.yml # the full self-host stack
β”‚ └── .env.default # template you copy to .env
β”‚
β”œβ”€β”€ classic/ # ← AutoGPT Classic (MIT License)
β”‚ β”œβ”€β”€ forge/ # agent-building toolkit
β”‚ β”œβ”€β”€ benchmark/ # agbenchmark
β”‚ └── frontend/ # classic GUI
β”‚
β”œβ”€β”€ docs/ # documentation source
└── run # root CLI (./run setup | agent | benchmark) β€” Classic

The files most relevant to block development β€” backend/data/block.py, backend/blocks/_base.py and backend/data/model.py β€” are dissected in Building Blocks.

6. AutoGPT Classic (legacy)​

Everything outside autogpt_platform/ is the original AutoGPT, under the MIT license. You will rarely need it for new work, but for context:

  • Forge β€” a ready-to-go toolkit that handles the boilerplate of building your own standalone agent. Its components can also be reused individually.
  • agbenchmark β€” a benchmarking harness for any agent that speaks the Agent Protocol; published on PyPI as agbenchmark.
  • Classic GUI β€” a simple frontend that connects to agents via the Agent Protocol.
  • Root CLI β€” ./run setup, ./run agent, ./run benchmark tie these together. Run ./run setup once to install dependencies.

If your goal is to build modern agents, ignore Classic and work in autogpt_platform/.

7. Licensing β€” read this before you build commercially​

AutoGPT uses two licenses, split by folder. This matters if you plan to host or commercialise:

ScopeLicensePractical meaning
Everything in autogpt_platform/Polyform ShieldSource-available. You may use, self-host and modify it, but the Shield license restricts offering it as a competing commercial product/service.
Everything outside autogpt_platform/ (Classic, Forge, agbenchmark, Classic GUI)MITPermissive β€” use freely, including commercially.
Not legal advice

Polyform Shield is not an OSI "open source" license in the strict sense β€” it is source-available with a non-compete restriction. Before building a commercial offering on top of the Platform, read the actual license text in the repo and get your own legal review. The summary above is orientation, not a legal opinion.

8. Where to go next​

You want to…Read
Run the Platform on your own machineSelf-Hosting β€” requirements, Docker setup, ports, troubleshooting
Write your own blockBuilding Blocks β€” the Block class, schemas, run(), credentials and testing
Compare AutoGPT to other agent frameworksAgent comparison overview

9. Sources​

  • Significant-Gravitas/AutoGPT β€” README.md, autogpt_platform/backend/backend/data/block.py, autogpt_platform/backend/backend/blocks/_base.py, autogpt_platform/backend/backend/data/model.py (dev branch, reviewed 2026-06-25)
  • AutoGPT documentation β€” agpt.co/docs
  • Repository β€” github.com/Significant-Gravitas/AutoGPT