Vexa Contextual RAG

A hybrid search system that combines semantic search (Qdrant) and text search (Elasticsearch BM25) for enhanced retrieval augmented generation (RAG) capabilities, optimized for processing meeting data from the Vexa API.

What are Contextual Embeddings?

Contextual embeddings, pioneered by Anthropic, address a critical limitation in traditional RAG systems: semantic isolation of chunks.

The Problem with Traditional RAG

In conventional RAG implementations, documents are divided into chunks for vector storage and retrieval. However, this chunking process often results in the loss of contextual information, making it difficult for the retriever to determine relevance accurately. For example, a chunk containing "the function returns an error" loses crucial context about which function, what type of error, and under what conditions.

How Contextual Embeddings Solve This

Contextual embeddings enhance document chunks by prepending relevant context from the entire document before embedding or indexing. This approach:

Preserves Semantic Context: Each chunk retains information about its broader document context
Improves Retrieval Accuracy: Better matching between queries and relevant content
Reduces Retrieval Errors: More precise document selection for RAG applications

Vexa API Integration

This implementation is specifically optimized to process meeting data accessed from the Vexa API. The system includes:

Meeting Transcript Processing: Handles Vexa's meeting transcript format with segments, speakers, and timestamps
Speaker-Aware Chunking: Groups consecutive messages by speaker and topic for better context preservation
Contextual Processing Pipeline: Preserves conversation context and speaker information
Optimized Indexing: Efficient processing of meeting data with metadata preservation
Filtering: Enables filtering by specific speakers and meeting IDs for targeted retrieval

Quick Start

Prerequisites

Docker and Docker Compose
Python 3.8+
OpenAI API key
Voyage API key (for embeddings)
Access to Vexa API (optional, for meeting data processing)

Setup

Clone the repository

git clone https://github.com/Vexa-ai/vexa-contextual-rag
cd vexa-contextual-rag

Set up environment variables

cp .env.example .env
# Edit .env with your API keys

Start the services
```
docker-compose up -d
```
Install Python dependencies
```
pip install -r requirements.txt
```

Usage

see usage.ipynb

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
indexing		indexing
scripts		scripts
search		search
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yaml		docker-compose.yaml
llm.py		llm.py
requirements.txt		requirements.txt
setup_elastic.py		setup_elastic.py
setup_qdrant.py		setup_qdrant.py
usage.ipynb		usage.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vexa Contextual RAG

What are Contextual Embeddings?

The Problem with Traditional RAG

How Contextual Embeddings Solve This

Vexa API Integration

Quick Start

Prerequisites

Setup

Usage

see usage.ipynb

Contributing

License

Support

References

About

Uh oh!

Releases

Packages

Languages

License

Vexa-ai/vexa-contextual-rag

Folders and files

Latest commit

History

Repository files navigation

Vexa Contextual RAG

What are Contextual Embeddings?

The Problem with Traditional RAG

How Contextual Embeddings Solve This

Vexa API Integration

Quick Start

Prerequisites

Setup

Usage

see usage.ipynb

Contributing

License

Support

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages