EdgeDB Retrieval Augmented Generator 🦜

Getting started

Install dependencies: pip3 install -r requirements.txt
Set your OPENAI_API_KEY in the environment, if necessary.
Prepare the docs in Markdown format. Split longer documents into sections, so that each documents covers one specific concept.
Prepare the metadata file. Use the following JSON lines format:

{"url":"relative/path/to/doc.md","category":"edgedb_general"}

The current list of categories goes like this:
- edgeql_and_sdl
- ddl
- integrations
- edgedb_general
- other
See build_rag.ipynb for a step-by-step configuration guide.
See demo.py for an example chatbot implementation with answer streaming.

System overview

1. Index

The index consists of two components:

vectorstore to store document embeddings and perform vector search.
docstore to store full documents.

To build the index, we need to have the documents stored as Markdown files, as well as the metadata JSON file. For every document in metadata file:

Load the document.
Calculate an embedding and store in the vectorstore.
Store the original full document in the docstore (represented by a JSON file).

2. Conversational RAG

This is the backbone of the chatbot application. It is represented by a LangChain chain that takes each user message through multiple processing stages before generating a response.

The stages include:

Contextualization: Turning a message into a search query using chat history.
Query analysis: Breaking down the search query into a similarity search part and a filter.
Retrieval: Using vectorstore to retrieve relevant documents with similarity search.
Generation: Producing the final answer based on documents and chat history.

Stages 1, 2 and 4 perform one LLM request each, with stage 4 doing the heavy lifting of synthesizing the answer.

Integration steps

Initialization

Build the index:

index = Index.from_metadata(
    metadata_path=Path("resources/doc_metadata.jsonl"),
    lib_path=Path("../docs_md"),
    persist_path=persist_path,
    embedding_function=embedding_function,
)

Use index to build a retriever

retriever = build_retriever(
    llm=llm, vectorstore=index.vectorstore, docstore=index.docstore
)

Set up a history callable. The RAG itself does not store or handle chat history in any way. Instead it calls this callable with arguments specified in the config to get relevant chat history for every generation.

def parse_history(raw_history):
    # parses message history from the list of pairs of strings
    history = ChatMessageHistory()
    for human, ai in raw_history:
        history.add_user_message(human)
        history.add_ai_message(ai)

    return history

# description parse_history arguments
history_factory_config=[
    ConfigurableFieldSpec(
        id="raw_history",
        annotation=List,
        name="Raw chat message history",
        description="List of messages coming from frontend",
        default=[],
        is_shared=True,
    ),
]

# example call with this setup
response = generator.stream(
    {"input": question},
    config={"configurable": {"raw_history": history}},  # this is where we attach raw history coming from the frontend
)

Use the retriever and the history callable to build the generator

generator = build_generator(
    llm=llm,
    retriever=retriever,
    get_session_history_callable=parse_history,
    history_factory_config=history_factory_config,
)

Generating answers

Use LangChain's stream to get a streaming response.
- Make sure to pass in the newest user message, as well as previous chat history.
- Using chat history enables the chatbot to handle followup question and have multi-turn conversations.
- The chatbot searches for documents relevant to the latest message every time before generating the answer.
```
response = generator.stream(
    {"input": question},
    config={"configurable": {"raw_history": history}},
)
```

Iterate over the response:

for segment in response:
    # deal with the answer

Response structure

The overall structure of the response is this:

{
	'input': 'In TypeScript, how do I do an insert?',
	'chat_history': [],
	'retrieval_result': {
		'input': 'In TypeScript, how do I do an insert?',
		'search_terms': Search(
			query='insert in TypeScript',
			category='integrations'
		),
		'documents': [
			Document(
				page_content='Full text of the doc',
				metadata={
					'source':'path/to/doc.md',
					'category': 'integrations'
				}
			),
		]
	},
	'answer': QuotedAnswer(
		answer='To perform an insert in TypeScript using EdgeDB...',
		citations=[
			Citation(
				source_id=0,
				quote='Verbatim quote from the doc')
			]
	)
}

Note Pydantic types used by LangChain under the hood.

from langchain_core.documents import Document
from src.core.retriever import Search
from src.core.generator import Citation, QuotedAnswer

However, when streaming you are not going to get this entire dictionary all at once. LangChain is going to stream the output of the query analysis first, then proceed to stream the answer.

For an example of handling LangChain streaming output see demo.py.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
assets		assets
finetune		finetune
notebooks		notebooks
resources		resources
src/core		src/core
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EdgeDB Retrieval Augmented Generator 🦜

Getting started

System overview

1. Index

2. Conversational RAG

Integration steps

Initialization

Generating answers

Response structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

tensorsense/edgedb_rag

Folders and files

Latest commit

History

Repository files navigation

EdgeDB Retrieval Augmented Generator 🦜

Getting started

System overview

1. Index

2. Conversational RAG

Integration steps

Initialization

Generating answers

Response structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages