AI-Powered Pharmacovigilance via Literature Monitoring

This repository contains the full implementation of an AI-powered pharmacovigilance system that automates the extraction of Adverse Event Reports (AERs) and generates detailed narrative case reports from unstructured pharmaceutical literature.

The system combines traditional NLP, biomedical entity extraction, LLM-based summarization, and full-stack deployment components, providing a complete solution for regulatory safety reporting.

Features

Literature Ingestion: Handles pharmaceutical documents (PDF/HTML) and extracts clean text using OCR and parsing.
AER Entity Extraction: Extracts structured data like drug name, dosage, reaction, etc., using BioBERT/SciSpacy + rule-based pipelines.
Vault-compliant JSON Generation: Formats data into standard regulatory JSON schema for downstream use.
Narrative Generation: Uses Claude (via AWS Bedrock) to generate fluent case narratives from structured AER data.
AER Insights Analyzer: Enables users to upload multiple AER JSON case files and receive a dynamic, visual and textual insights of data.
REST API Backend: Exposes core functionalities through a FastAPI server with endpoints for file upload, JSON output, and feedback submission.
Streamlit Frontend: Interactive interface for uploading literature and viewing extracted reports in real time.
Containerized Deployment: Deployed with Docker, NGINX (HTTPS), and AWS EC2.

Folder Structure

pharmacovigilance/
├── aer_entity_extraction/
│   ├── __pycache__/
│   ├── __init__.py
│   ├── ner_pipeline.py
│   ├── rule_extractors.py
│   └── testrun.py
│
├── case_data_construction/
│   ├── __pycache__/
│   ├── __init__.py
│   ├── json_generator.py
│   └── testrun2.py
│
├── literature_ingestion/
│   ├── __pycache__/
│   ├── __init__.py
│   └── text_extraction.py
│
├── narrative_generation/
│   ├── __pycache__/
│   ├── __init__.py
│   ├── narrative_generator.py
│   └── prompt_builder.py
│
├── case_insights_analysis/
│   ├── __pycache__/
│   ├── __init__.py
│   ├── insights_api.py
│   └── sample_cases/
|
├── nginx/
│   ├── certs/
│   ├── Dockerfile
│   └── nginx.conf
│
├── rest_api/
│   ├── __pycache__/
│   ├── __init__.py
│   ├── .dockerignore
│   ├── Dockerfile
│   ├── main.py
│   └── requirements.txt
│
├── streamlit/
│   ├── .dockerignore
│   ├── app.py
│   ├── Dockerfile
│   └── requirements.txt
│
├── docker-compose.yml
└── requirements.txt

Setup & Deployment

1. Clone the Repository

git clone https://github.com/Nidhish-Balasubramanya/AI-Powered-Pharmacovigilance-via-Literature-Monitoring
cd pharmacovigilance-app

2. Local Development

Python 3.10+
Create virtual env:

python -m venv venv && source venv/bin/activate
pip install -r requirements.txt

3. Run via Docker Compose

docker-compose up --build

This starts:

REST API backend on http://localhost:8000
Streamlit frontend on http://localhost:8501
HTTPS via NGINX reverse proxy (certificates must be configured)

REST API Endpoints (FastAPI)

1 `POST /upload`

Description: Upload a pharmaceutical document (.pdf, .txt, or image) to extract AER entities and construct a Vault-compatible JSON.

Request: multipart/form-data
- file: The literature document to upload
Response:

{
  "case_id": "a1b2c3d4...",
  "message": "Case data extracted successfully.",
  "case_json": { }
}

Errors: 400 (Unsupported file type), 500 (Processing failed)

2 `GET /case/{case_id}`

Description: Fetch the JSON-structured AER case generated from the uploaded literature.

Path Param: case_id – Unique ID of the case
Response: JSON content of the AER case
Errors: 404 if not found

3 `POST /narrative`

Description: Generate a narrative from a previously extracted case.

Query Param: case_id
Response:

{
  "case_id": "a1b2c3d4...",
  "narrative": "Patient experienced..."
}

Errors: 404 if case not found

4 `GET /download/case/{case_id}`

Description: Download the structured AER JSON file.

Path Param: case_id
Response: Attachment (.json) as application/json
Errors: 404 if case not found

5 `GET /download/narrative/{case_id}`

Description: Download the generated narrative as a .txt file.

Path Param: case_id
Response: Attachment (.txt) as text/plain
Errors: 404 if narrative not found

6 `POST /validate`

Description: Submit validation feedback for a specific case.

Form Params:
- case_id: ID of the case being reviewed
- feedback: Free-text feedback message
Response:

{
  "message": "Feedback received. Thank you!"
}

7 `GET /health`

Description: Simple health check for uptime and monitoring.

Response:

{
  "status": "ok"
}

Frontend (Streamlit)

The Streamlit UI allows:

Uploading documents
Viewing extracted JSON
Triggering narrative generation
Displaying full case report

Accessible via: https://pharmacovigilence.com/

Tech Stack

Python 3.10, FastAPI, Streamlit
SciSpacy, BioBERT/ClinicalBERT, AWS Bedrock (Claude)
Docker, NGINX, AWS EC2
Vault JSON Schema, OCR, Regex/Ruled NER

License

This work is licensed under CC BY-NC-ND 4.0.

Contact

Nidhish Balasubramanya - nidhishbalasubramanya@gmail.com
For queries or feedback, feel free to open an issue or contact via the email.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
1_literature_ingestion		1_literature_ingestion
2_aer_entity_extraction		2_aer_entity_extraction
3_case_data_construction		3_case_data_construction
4_narrative_generation		4_narrative_generation
5_rest_api		5_rest_api
6_streamlit		6_streamlit
7_nginx		7_nginx
8_case_insights_analysis		8_case_insights_analysis
LICENSE		LICENSE
README.md		README.md
completion_report.md		completion_report.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI-Powered Pharmacovigilance via Literature Monitoring

Features

Folder Structure

Setup & Deployment

1. Clone the Repository

2. Local Development

3. Run via Docker Compose

REST API Endpoints (FastAPI)

1 `POST /upload`

2 `GET /case/{case_id}`

3 `POST /narrative`

4 `GET /download/case/{case_id}`

5 `GET /download/narrative/{case_id}`

6 `POST /validate`

7 `GET /health`

Frontend (Streamlit)

Tech Stack

License

Contact

About

Uh oh!

Releases

Packages

Languages

License

Nidhish-Balasubramanya/AI-Powered-Pharmacovigilance-via-Literature-Monitoring

Folders and files

Latest commit

History

Repository files navigation

AI-Powered Pharmacovigilance via Literature Monitoring

Features

Folder Structure

Setup & Deployment

1. Clone the Repository

2. Local Development

3. Run via Docker Compose

REST API Endpoints (FastAPI)

1 POST /upload

2 GET /case/{case_id}

3 POST /narrative

4 GET /download/case/{case_id}

5 GET /download/narrative/{case_id}

6 POST /validate

7 GET /health

Frontend (Streamlit)

Tech Stack

License

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1 `POST /upload`

2 `GET /case/{case_id}`

3 `POST /narrative`

4 `GET /download/case/{case_id}`

5 `GET /download/narrative/{case_id}`

6 `POST /validate`

7 `GET /health`

Packages