About

Agentic ADK is an Agent application development framework launched by Alibaba International AI Business, based on Google-ADK and Ali-LangEngine.

中文版说明

Agentic ADK is an Agent application development framework launched by Alibaba International AI Business, based on Google-ADK and Ali-LangEngine. It is used for developing, constructing, evaluating, and deploying powerful, flexible, and controllable complex AI Agents. ADK aims to make Agent development simpler and more user-friendly, enabling developers to more easily build, deploy, and orchestrate various Agent applications ranging from simple tasks to complex collaborations.

Features Overview

Based on the Google ADK interface, it strengthens core execution pathways such as streaming interaction and visualization debugging tools, enabling developers to efficiently develop Agent applications.
Seamlessly integrates with Alibaba's International Multimodal Large Language Model, Ovis, to achieve deep alignment and fusion of visual and textual information. This model is characterized by high performance and lightweight design, offering the following advantages for efficient development and deployment of multimodal Agents:
- Outstanding Logical Reasoning: By combining instruction fine-tuning and preference learning, the model's Chain-of-Thought (CoT) reasoning abilities are significantly enhanced, enabling it to better understand and execute complex instructions.
- Precise Cross-Language Understanding and Recognition: Beyond just Chinese and English, the model has improved text recognition (OCR) capabilities in multilingual environments and optimized the accuracy of structured data extraction from complex visual elements such as tables and charts.
Flexible multi-agent framework, supporting various execution modes such as synchronous, asynchronous, streaming, and parallel, and naturally integrating the A2A protocol.
A high-performance workflow engine combined with agents, built on top of Alibaba's SmartEngine workflow engine, utilizes RxJava3 to implement a reactive programming model. It employs a node-based process system to define agent behaviors, supporting synchronous, asynchronous, and bidirectional communication modes, providing a flexible foundation for building complex AI applications.
Offers hundreds of API tools and introduces the MCP integration gateway.
DeepResearch/RAG, ComputerUse, BrowserUse, Sandbox, and other best practices for Agentic AI.
Implementation of context extension for agent conversations, including Session, Memory, Artifact, and more, with built-in short and long-term memory plugins.
Provides prompt automation tuning and security risk control-related agent examples.

Framework Design

Google ADK Interface-Oriented Design

Agentic ADK inherits the excellent design of google-adk and supports the following key features:

LLM

Rich large model selection. Natively compatible with the use of models/vendors including OpenAI, Bailian/Qwen, OpenRouter, Claude, etc.

Component Abstraction	Description
LangEngine	This component supports all compatible third-party `Model/WorkSpace` integrations under the LangEngine ecosystem into the Agent system, including OpenAI, Bailian/Qwen, Idealab, OpenRouter, etc.
DashScopeLlm	Supports integration with OpenAPI interfaces on Alibaba Cloud Bailian

Agent

Highly abstracted Agent definition and flexible Agent orchestration. The framework has built-in LLM/sequential/parallel/loop Agent definitions; supports single Agent and multi-Agent (MAS) architecture design, facilitating the expansion of your agent design patterns and architecture.

Component Abstraction	Description
LlmAgent	A core component in ADK that acts as the "thinking" part of the application. It leverages the powerful capabilities of large language models (LLMs) for reasoning, understanding natural language, making decisions, generating responses, and interacting with tools.
SequentialAgent	A WorkflowAgent that executes its child Agents in the order specified in the list.
LoopAgent	A WorkflowAgent that executes its child Agents in a loop (i.e., iterative) manner. It repeatedly runs a set of agents until a specified iteration count is reached or a termination condition is met.
ParallelAgent	A WorkflowAgent that can execute its child Agents concurrently, significantly speeding up the entire workflow when subtasks can be executed independently.
Other Advanced Concepts	CustomAgents: Custom Agent processing flows can be implemented by inheriting google.adk.agents.BaseAgent. Multi-Agent Systems: Multiple different Agent instances can be combined into a multi-agent system (MAS), supporting the construction of more complex applications

Tool

Rich tool assembly. Easy integration of Function/MCP and any third-party tools.

Component Abstraction	Description
Function Tool	FunctionTool: Function as a tool, any method can be converted into a Tool for Agent to call. LongRunningFunctionTool: Designed for tasks that require significant processing time without blocking Agent execution. AgentTool: By orchestrating other Agents as tools, their capabilities in the system can be fully utilized. This tool allows the current Agent to call another Agent to perform specific tasks, effectively delegating responsibilities.
DashScopeTool	Integration with Alibaba Cloud Bailian tool applications
MCPTool	ADK built-in MCP tool
GoogleSearchTool	ADK built-in Google search tool
GUITaskExecuteTool	ADK built-in GUI task execution tool

Callback

Flexible Callback mechanism. Provides hooks at multiple timing points during Agent execution, making it convenient to implement custom logic before and after LLM/Tool/Agent calls.

Reference: https://google.github.io/adk-docs/callbacks

Best Practices: https://google.github.io/adk-docs/callbacks/design-patterns-and-best-practices/

Debug & Eval

Out-of-the-box debugging and evaluation capabilities. Provides a white-screen Debug page for quick Agent debugging whether locally or remotely.

Integration of High-Performance Dynamic Workflow Engine

Built on Alibaba's SmartEngine workflow engine, it utilizes RxJava3 to implement reactive programming patterns, employs a node-based process system to define agent behavior, and supports synchronous, asynchronous, and bidirectional communication modes, providing a flexible foundation for building complex AI applications.

┌─────────────────────────────────────────────────────────────────────┐
│                          User Application Layer                     │
├─────────────────────────────────────────────────────────────────────┤
│                       Runner (Execution Entry)                      │
├─────────────────────────────────────────────────────────────────────┤
│                    Pipeline Processing Layer                        │
│  ┌─────────────┐  ┌────────────────┐  ┌─────────────────────────┐  │
│  │ Agent       │  │  ...           │  │  Custom Processing      │  │
│  │ Execution   │  │                │  │  Pipeline               │  │
│  │   Pipe      │  │                │  │                         │  │
│  └─────────────┘  └────────────────┘  └─────────────────────────┘  │
├─────────────────────────────────────────────────────────────────────┤
│                    Flow Engine Layer                                │
│  ┌─────────────┐  ┌────────────────┐  ┌─────────────────────────┐  │
│  │ FlowCanvas  │  │                │  │                         │  │
│  │ (Flow       │  │    FlowNode    │  │  DelegationExecutor     │  │
│  │ Container)  │  │                │  │                         │  │
│  └─────────────┘  └────────────────┘  └─────────────────────────┘  │
├─────────────────────────────────────────────────────────────────────┤
│                    AI Capability Abstraction Layer                  │
│  ┌─────────────┐  ┌────────────────┐  ┌─────────────────────────┐  │
│  │  BasicLlm   │  │    BaseTool    │  │        BaseCondition    │  │
│  │ (LLM Model) │  │   (Tool Set)   │  │  (Conditional Judgment) │  │
│  └─────────────┘  └────────────────┘  └─────────────────────────┘  │
├─────────────────────────────────────────────────────────────────────┤
│                    Infrastructure Layer                             │
│  ┌─────────────┐  ┌────────────────┐  ┌─────────────────────────┐  │
│  │ SmartEngine │  │   RxJava3      │  │  Spring Framework       │  │
│  │ (Workflow   │  │ (Reactive      │  │  (Dependency Injection  │  │
│  │ Engine)     │  │ Programming    │  │  Framework)             │  │
│  │             │  │ Framework)     │  │                         │  │
│  └─────────────┘  └────────────────┘  └─────────────────────────┘  │
└─────────────────────────────────────────────────────────────────────┘

Core Component

Flow Engine Components

FlowCanvas: The main container for flow definition, used to build and deploy workflows
FlowNode: The base class for all flow nodes, defining the basic behavior of nodes
Node Types:
- LlmFlowNode: Used for interacting with large language models
- ToolFlowNode: Used for executing external tools
- ConditionalContainer: Used for conditional branching
- ParallelFlowNode: Used for parallel execution
- ReferenceFlowNode: Used for referencing other flows

Execution Components

Runner: The main entry point for flow execution
DelegationExecutor: Handles the execution of delegated tasks
SystemContext: Contains execution context and configuration information
Request/Result: Data structures for requests and responses

AI Capability Components

BasicLlm Interface and Implementations (e.g., DashScopeLlm): Defines and implements interactions with large language models
LlmRequest/LlmResponse: Data structures for large language model interactions
BaseTool Interface and Implementations (e.g., DashScopeTools): Defines and implements external tool calls

Pipeline System

PipeInterface: Interface for pipeline components
AgentExecutePipe: Main implementation of the execution pipeline
PipelineUtil: Utility class for pipeline execution

Execution Modes

The framework supports three execution modes:

SYNC (Synchronous Mode): Sequential execution, waiting for each node to complete before executing the next
ASYNC (Asynchronous Mode): Asynchronous execution, can process multiple tasks in parallel
BIDI (Bidirectional Mode): Supports bidirectional communication, can dynamically receive input

Usage Guide and Examples

Detailed Usage Guide

DeepSearchAgent Code Example

License

This project is licensed under Apache License Version 2 (https://www.apache.org/licenses/LICENSE-2.0.txt, SPDX-License-identifier: Apache-2.0).

Name		Name	Last commit message	Last commit date
Latest commit History 180 Commits
ali-agentic-adk-core		ali-agentic-adk-core
ali-agentic-adk-extension		ali-agentic-adk-extension
ali-langengine-community		ali-langengine-community
ali-langengine-core		ali-langengine-core
ali-langengine-demo		ali-langengine-demo
ali-langengine-frontend		ali-langengine-frontend
ali-langengine-infrastructure		ali-langengine-infrastructure
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
README-langengine.md		README-langengine.md
README-langengine_CN.md		README-langengine_CN.md
README-mcp-en.md		README-mcp-en.md
README-mcp.md		README-mcp.md
README.md		README.md
README_CN.md		README_CN.md
dependency.sh		dependency.sh
deploy.sh		deploy.sh
install.sh		install.sh
mvnw		mvnw
mvnw.cmd		mvnw.cmd
package.sh		package.sh
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Framework Design

Google ADK Interface-Oriented Design

LLM

Agent

Tool

Callback

Debug & Eval

Integration of High-Performance Dynamic Workflow Engine

Core Component

Flow Engine Components

Execution Components

AI Capability Components

Pipeline System

Execution Modes

Usage Guide and Examples

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 16

Uh oh!

Languages

License

AIDC-AI/Agentic-ADK

Folders and files

Latest commit

History

Repository files navigation

About

Framework Design

Google ADK Interface-Oriented Design

LLM

Agent

Tool

Callback

Debug & Eval

Integration of High-Performance Dynamic Workflow Engine

Core Component

Flow Engine Components

Execution Components

AI Capability Components

Pipeline System

Execution Modes

Usage Guide and Examples

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 16

Uh oh!

Languages

Packages