OrchestrAPI

A RAG Agent for orchestrating any API

OrchestrAPI is a RAG Agent orchestration framework that can interpret complex natural language queries and execute them against an API. Built on the Cloudflare Agents SDK and the cutting-edge stack of Cloudflare Workers, Durable Objects, and the Vercel AI SDK, this project demonstrates a robust, production-ready architecture for creating scalable and intelligent AI agents.

This repository uses the TMDB (The Movie Database) API as a demonstration of the agent's capabilities.

orchestrapi-demo.mp4

Technologies Used

UI Framework: Next.js
Backend: Cloudflare Workers
State Management: Cloudflare Durable Objects
Vector Store: Cloudflare AutoRAG
Inference Provider: Cloudflare Workers AI
Agent Framework: Cloudflare Agents SDK

Navigating this project

.
├── app/
│   ├── api/chat/route.ts      # Next.js API route for the chat UI
│   └── assistant.tsx          # Main frontend component for the assistant UI
├── cloudflare/
│   └── agent/
│       ├── streaming-orchestrator.ts # Coordinates the 4-step agent process
│       ├── planning-service.ts       # Creates execution plan from user query
│       ├── tool-execution-service.ts # Executes API calls based on the plan
│       └── rag-service.ts            # Finds relevant endpoints with AutoRAG
├── lib/
│   └── tmdb-open-api.json     # OpenAPI spec for the TMDB API
└── wrangler.jsonc             # Configuration for the Cloudflare Worker

Streaming Orchestrator (streaming-orchestrator.ts): The core of the agent, managing the flow from query to response.
Hybrid RAG Search (rag-service.ts): Finds relevant API endpoints using Cloudflare AutoRAG.
LLM Planning (planning-service.ts): Generates a multi-step execution plan using an LLM.
Deterministic Tool Execution (tool-execution-service.ts): Executes the plan by calling the correct API tools.
Frontend (assistant.tsx): The main React component for the chat interface.

Architecture Diagram

graph TD
    subgraph "User's Browser"
        A[Chat Interface]
    end

    subgraph "Cloudflare Network"
        B[Next.js Worker/Agent Proxy]
        subgraph "Agent"
            C[Orchestrator] --> D[Planning Service]
            C --> E[Tool Execution Service]
            C --> F[Response Generation]
        end
    end

    subgraph "External Services"
        G["Cloudflare Workers AI <br>(LLM & Vector Search)"]
        H[TMDB API]
    end

    A -- "POST /api/chat" --> B
    B --> C
    D -- "Creates Plan" --> G
    E -- "Executes Tools" --> H
    F -- "Generates Final Answer" --> G

How It Works

The agent uses a 4-step process to answer user queries, coordinated by a central StreamingOrchestrator. This provides a transparent, real-time experience for the user.

Hybrid RAG Search: For any user query, the agent first queries Cloudflare AutoRAG, which performs a semantic search against the OpenAPI specification to find the most relevant API endpoints. Each endpoint in the OpenAPI spec is formatted and summarized to fit the chunk size limit in the vector database. To make the agent more robust, these results are combined with a curated "safety net" of foundational tools (e.g., search-company, search-person) that are essential for resolving entities.
LLM Planning: The combined documentation is then passed to the PlanningService, which uses DeepSeeks's R1 Distill Qwen 32B model to create a structured, multi-step JSON execution plan. The prompt for this stage is hardened to ensure the LLM generates valid, executable plans that can handle dependencies between steps (e.g., using the ID from step 1 as an input for step 2).
Deterministic Tool Execution: The ToolExecutionService executes the plan. It uses functions derived from the OpenAPI spec and calls them directly with parameters from the plan. This approach is deterministic and avoids unpredictable "AI calling an AI" behavior.
Response Generation: Finally, the raw JSON results from the API calls are passed to the ResponseGenerationService, which uses one last LLM call to synthesize the data into a human-readable answer.

sequenceDiagram
    participant User
    participant Next.js Frontend
    participant Agent
    participant Workers AI (LLM)
    participant TMDB API

    User->>Next.js Frontend: "What are the most popular movies produced by Pixar?"
    Next.js Frontend->>Agent: POST /api/chat
    Agent->>Agent: 1. Hybrid RAG Search
    Note right of Agent: Combines semantic search results with a 'safety net' of foundational tools (like search-company).
    Agent->>Workers AI (LLM): 2. Plan Execution
    Note right of Agent: "Find Pixar's ID, then find their movies."
    Workers AI (LLM)-->>Agent: Execution Plan (JSON)
    Agent->>TMDB API: 3. Execute Step 1: search-company(query="Pixar")
    TMDB API-->>Agent: Company ID: 3
    Agent->>TMDB API: 3. Execute Step 2: discover-movie(with_companies=3)
    TMDB API-->>Agent: List of Pixar movies
    Agent->>Workers AI (LLM): 4. Generate Response
    Note right of Agent: Synthesize the list of movies into a friendly answer.
    Workers AI (LLM)-->>Agent: Streamed natural language response
    Agent-->>Next.js Frontend: Stream reasoning & final answer
    Next.js Frontend-->>User: Displays the full execution trace and final answer.

Architectural Considerations

Hybrid RAG: Combines semantic search with a curated list of foundational tools for robust planning.
Multi-Step Planning: Capable of creating and executing complex, multi-step plans to answer nuanced questions.
Deterministic Execution: Uses a reliable, direct execution path for tool calls, avoiding unpredictable behavior.
Streaming UI: Provides a real-time, transparent view into the agent's reasoning process, using the Vercel AI SDK's streaming utilities.
Serverless & Scalable: Built entirely on Cloudflare's serverless platform (Workers, Durable Objects, AutoRAG), ensuring high scalability and cost-efficiency.

Getting Started

Prerequisites

A Cloudflare account
A TMDB API Key
Node.js (v18 or later)

Installation & Setup

Clone the repository:

git clone https://github.com/hasibhassan/orchestrapi.git
cd orchestrapi

Install dependencies:
```
npm install
```
Configure Cloudflare:
- Rename wrangler.toml.example to wrangler.toml and fill in your Cloudflare account details.
- Create a D1 database and an AutoRAG index in your Cloudflare dashboard. Add the binding names and IDs to your wrangler.toml.
- Create a .dev.vars file in the root of the project and add your TMDB API key (or set via the Console):
```
TMDB_API_TOKEN="your_tmdb_api_token_here"
```
[!NOTE] You will need to run a script to parse your tmdb-open-api.json and insert the vectors into your Cloudflare AutoRAG index. (Note: The seeding script is not included in this repository. Keep in mind the chunking limits in Vectorize.).

Run the development server:

npm run dev

# To deploy ensure you have the Cloudflare Wrangler CLI configured properly:
npm run deploy
# also as of now, you will have to deploy the agent-proxy-worker manually
wrangler deploy --config wrangler.agent-worker.jsonc

This will start the Next.js frontend and the Cloudflare Worker backend simultaneously. You can ask questions like:

"Find the highest-rated sci-fi movie from 2023"
"What are the top 5 action movies?"
"Get details about the movie Inception"

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
app		app
cloudflare/agent		cloudflare/agent
components		components
hooks		hooks
lib		lib
public		public
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
cloudflare-env.d.ts		cloudflare-env.d.ts
components.json		components.json
custom-worker.ts		custom-worker.ts
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
open-next.config.ts		open-next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json
worker-configuration.d.ts		worker-configuration.d.ts
wrangler.agent-worker.jsonc		wrangler.agent-worker.jsonc
wrangler.jsonc		wrangler.jsonc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OrchestrAPI

Technologies Used

Navigating this project

Architecture Diagram

How It Works

Architectural Considerations

Getting Started

Prerequisites

Installation & Setup

About

Uh oh!

Releases

Packages

Languages

hasibhassan/orchestrapi

Folders and files

Latest commit

History

Repository files navigation

OrchestrAPI

Technologies Used

Navigating this project

Architecture Diagram

How It Works

Architectural Considerations

Getting Started

Prerequisites

Installation & Setup

About

Topics

Resources

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages