How MultiAgentPPT Generates Slides with AI Agents: Architecture and Code Walkthrough
This article examines the MultiAgentPPT project, detailing its multi‑agent workflow, the four core agents that generate outlines, split topics, conduct research, and summarize results, and explains how the system retrieves data via a WeChat crawler and constructs prompts for LLM‑driven PPT creation.
MultiAgentPPT Implementation Overview
The MultiAgentPPT project (https://github.com/johnson7788/MultiAgentPPT) implements an A2A+MCP+ADK multi‑agent system that streams concurrent PPT generation, inspired by https://github.com/allweonedev/presentation-ai.
The workflow is visualized in two diagrams (shown below) and consists of four agents.
Core Agents
Outline Generation Agent creates an initial content outline based on user requirements.
Topic Splitting Agent divides the outline into multiple research topics.
Research Agent runs in parallel, with each sub‑agent performing deep research on an assigned topic.
Summary Agent aggregates the research results and streams the final PPT content back to the frontend.
The backend implementation resides at https://github.com/johnson7788/MultiAgentPPT/tree/main/backend.
Data Source and Retrieval
The system relies on a Retrieval‑Augmented Generation (RAG) pipeline, requiring a knowledge base that is kept up‑to‑date via a web crawler. The crawler code (
weixin_search.py) searches WeChat public accounts, obtains real URLs, and extracts article content.
The extracted WeChat content is then fed into the knowledge base for RAG.
Sub‑Agent Execution Logic
Key modules include
simpleOutline(frontend outline test),
simplePPT(simple PPT test),
slide_outline(outline generation with retrieval), and
slide_agent(PPT generation from outline).
slide_outlineuses RAG to fetch relevant articles, assembles them into a prompt, and sends it to the LLM for generation.
The
slide_agentfurther splits into components:
research_topic,
split_topic, and
summary_writer, each driven by specific prompts.
The
split_topicprompt parses the outline into independent research topics, while
research_topiccreates parallel researchers to gather material. The
SummaryAgentcompiles the results using XML‑style prompts.
References
1. https://github.com/johnson7788/MultiAgentPPT
Architect
Professional architect sharing high‑quality architecture insights. Topics include high‑availability, high‑performance, high‑stability architectures, big data, machine learning, Java, system and distributed architecture, AI, and practical large‑scale architecture case studies. Open to ideas‑driven architects who enjoy sharing and learning.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.