TaxBot is a Retrieval-Augmented Generation (RAG) application designed to assist users with queries related to Indian taxation. It combines state-of-the-art machine learning models and retrieval techniques to deliver accurate and context-aware responses.
- RAG Architecture: Combines retrieval and generation to enhance response accuracy.
- Domain-Specific Expertise: Focused on Indian taxation laws and policies.
- Technologies Used: Built using LangChain, ChromaDB, and Zephyr-7b-beta model from Hugging Face.
- Efficient Knowledge Retrieval: Integrates a custom-built knowledge base for precise information extraction.
- LangChain: Framework for building applications powered by language models.
- ChromaDB: Vector database for managing and retrieving embeddings.
- Hugging Face Zephyr-7b-beta: A fine-tuned large language model optimized for Indian taxation.
- Python 3.8+
- Pip package manager
- Virtual environment (optional but recommended)
- Clone the repository:
git clone https://github.com/pratikroy311/TAXBot.git cd taxbot
- Create and activate a virtual environment:
python -m venv venv source venv/bin/activate # For Linux/macOS venv\Scripts\activate # For Windows
- Install dependencies:
pip install -r requirements.txt
- Set up your environment variables:
- Create a
.env
file in the root directory. - Add the required API keys and configuration settings. Example:
OPENAI_API_KEY=your_openai_api_key CHROMADB_PATH=./data/chromadb
- Create a
- Run the application:
python app.py
- Open your browser and navigate to the local server URL (e.g.,
http://127.0.0.1:5000
). - Input your taxation-related query, and TaxBot will provide detailed responses.
TaxBot/
├── app.py # Main application entry point
├── requirements.txt # Python dependencies
├── data/ # ChromaDB data storage
├── models/ # Model-related files
├── utils/ # Helper functions and utilities
└── README.md # Project documentation
- Expand the knowledge base to include more domains beyond taxation.
- Enhance multi-language support for Indian regional languages.
- Integrate with external APIs for live updates on taxation rules.
Contributions are welcome! Please fork the repository and create a pull request with your changes.
This project is licensed under the MIT License.
- Hugging Face for the Zephyr-7b-beta model.
- LangChain for the framework support.
- ChromaDB for efficient vector database management.