AI Agent Memory Design & Optimization Playground

Interactive playground for testing and comparing 9 different AI agent memory optimization strategies

Overview

This project implements 9 different memory optimization techniques for AI agents, providing a comprehensive solution for managing conversation history and context in production AI systems. Each strategy is implemented as a modular, plug-and-play class with a unified interface.

Why Memory Optimization Matters

Token Cost Reduction: Prevent exponential growth in LLM API costs
Context Preservation: Maintain relevant information across conversations
Scalability: Handle long conversations efficiently
Performance: Optimize response times and memory usage

Memory Strategies Implemented

Basic Strategies

Sequential Memory - Complete conversation history storage
Sliding Window Memory - Fixed-size recent conversation window
Summarization Memory - LLM-based conversation compression

Advanced Strategies

Retrieval Memory (RAG) - Vector similarity search for semantic retrieval
Memory-Augmented Memory - Persistent memory tokens with sliding window
Hierarchical Memory - Multi-layered working + long-term memory

Complex Strategies

Graph Memory - Knowledge graph with entity relationships
Compression Memory - Intelligent compression with importance scoring
OS-like Memory - RAM/disk simulation with paging mechanisms

Features

Modular Architecture - Strategy pattern for easy swapping
Interactive Playground - Streamlit web interface for testing
Performance Analytics - Token usage and response time tracking
Batch Comparison - Test multiple strategies simultaneously
Production Ready - FastAPI endpoints for deployment
Real-time Metrics - Memory statistics and performance monitoring

Installation

Prerequisites

Python 3.10+
OpenAI API Key

Setup

Clone the repository

git clone https://github.com/AIAnytime/Agent-Memory-Playground.git
cd Agent-Memory-Playground

Install dependencies

pip install -r requirements.txt

Configure environment

# Create .env file
echo "OPENAI_API_KEY=your_openai_api_key_here" > .env

Quick Start

1. Interactive Playground (Streamlit)

streamlit run streamlit_playground.py

Open http://localhost:8501 in your browser
Enter your OpenAI API key in the sidebar
Select a memory strategy and start testing!

2. API Server (FastAPI)

uvicorn api:app --reload

API documentation: http://localhost:8000/docs
Create sessions, chat, and monitor performance via REST API

3. Command Line Example

python example_usage.py

Interactive CLI for testing all memory strategies
Detailed memory statistics and performance metrics

Usage Examples

Basic Usage

from memory_strategies import SequentialMemory, AIAgent

# Initialize memory strategy
memory = SequentialMemory()
agent = AIAgent(memory_strategy=memory)

# Chat with the agent
response = agent.chat("Hello! My name is Alex.")
print(response["ai_response"])

# Memory automatically preserved for next interaction
response = agent.chat("What's my name?")
print(response["ai_response"])  # Will remember "Alex"

Advanced RAG Implementation

from memory_strategies import RetrievalMemory, AIAgent

# Initialize RAG-based memory
memory = RetrievalMemory(k=3)  # Retrieve top 3 similar conversations
agent = AIAgent(memory_strategy=memory)

# Build conversation history
agent.chat("I'm a software engineer working on ML projects")
agent.chat("I prefer Python and love coffee")
agent.chat("I'm building a recommendation system")

# Query with semantic similarity
response = agent.chat("What do you know about my work?")
# Will retrieve relevant context about ML, Python, and recommendation systems

Production API Usage

# Create a session with hierarchical memory
curl -X POST "http://localhost:8000/sessions" \
  -H "Content-Type: application/json" \
  -d '{
    "strategy_type": "hierarchical",
    "system_prompt": "You are a helpful AI assistant.",
    "api_key": "your_openai_key"
  }'

# Chat with the session
curl -X POST "http://localhost:8000/sessions/{session_id}/chat" \
  -H "Content-Type: application/json" \
  -d '{
    "message": "Remember that I prefer concise responses",
    "api_key": "your_openai_key"
  }'

Performance Comparison

Strategy	Token Efficiency	Retrieval Speed	Memory Usage	Best For
Sequential	❌ Low	⚡ Instant	📈 High	Short conversations
Sliding Window	✅ High	⚡ Instant	📊 Constant	Real-time chat
Retrieval (RAG)	✅ High	🔍 Fast	📊 Medium	Production systems
Hierarchical	✅ Very High	🔍 Fast	📊 Medium	Complex applications
Graph Memory	🔍 Medium	🐌 Slow	📈 High	Knowledge systems

Architecture

Strategy Pattern Design

AIAgent
├── BaseMemoryStrategy (Abstract)
│   ├── add_message()
│   ├── get_context()
│   └── clear()
├── SequentialMemory
├── SlidingWindowMemory
├── RetrievalMemory
└── ... (6 more strategies)

Key Components

Memory Strategies: Modular memory implementations
AI Agent: Core agent using strategy pattern
Utilities: Token counting, embeddings, LLM integration
API Layer: FastAPI endpoints for production use
Playground: Streamlit interface for testing

Monitoring & Metrics

Track essential performance metrics:

{
    "total_content_tokens": 1250,      # Raw conversation data
    "total_prompt_tokens": 4800,       # Actual LLM costs
    "average_retrieval_time": 0.15,    # Memory access speed
    "memory_efficiency": 0.73,         # Compression ratio
    "context_relevance_score": 0.89    # Quality of retrieved context
}

Configuration

Memory Strategy Parameters

Sliding Window Memory

SlidingWindowMemory(window_size=4)  # Keep last 4 conversation turns

Retrieval Memory (RAG)

RetrievalMemory(k=3)  # Retrieve top 3 similar conversations

Hierarchical Memory

HierarchicalMemory(
    window_size=2,  # Working memory size
    k=3            # Long-term retrieval count
)

Production Deployment

Docker Deployment

FROM python:3.9-slim

WORKDIR /app
COPY requirements.txt .
RUN pip install -r requirements.txt

COPY . .
EXPOSE 8000

CMD ["uvicorn", "api:app", "--host", "0.0.0.0", "--port", "8000"]

Environment Variables

OPENAI_API_KEY=your_openai_api_key
OPENAI_MODEL=gpt-4o-mini
EMBEDDING_MODEL=text-embedding-3-small

Testing

Run the test suite:

python -m pytest tests/

Run performance benchmarks:

python benchmark.py

Documentation

Technical Guide - Comprehensive implementation details
API Documentation - FastAPI interactive docs
Strategy Comparison - Performance analysis
Production Guide - Deployment best practices

Contributing

We welcome contributions! Please see our Contributing Guidelines for details.

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

Acknowledgments

OpenAI for providing the GPT models and embeddings
Streamlit for the amazing web framework
FastAPI for the high-performance API framework
FAISS for efficient vector similarity search

Support & Contact

Website: aianytime.net
Creator Portfolio: sonukumar.site
YouTube: @AIAnytime
Issues: GitHub Issues

Built with ❤️ by AI Anytime

Star this repo if you find it helpful!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
memory_strategies		memory_strategies
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
api.py		api.py
example_usage.py		example_usage.py
requirements.txt		requirements.txt
sct.png		sct.png
streamlit_playground.py		streamlit_playground.py

License

AIAnytime/Agent-Memory-Playground

Folders and files

Latest commit

History

Repository files navigation

AI Agent Memory Design & Optimization Playground

Overview

Why Memory Optimization Matters

Memory Strategies Implemented

Basic Strategies

Advanced Strategies

Complex Strategies

Features

Installation

Prerequisites

Setup

Quick Start

1. Interactive Playground (Streamlit)

2. API Server (FastAPI)

3. Command Line Example

Usage Examples

Basic Usage

Advanced RAG Implementation

Production API Usage

Performance Comparison

Architecture

Strategy Pattern Design

Key Components

Monitoring & Metrics

Configuration

Memory Strategy Parameters

Production Deployment

Docker Deployment

Environment Variables

Testing

Documentation

Contributing

License

Acknowledgments

Support & Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages