Extracts key information from 1,025 historical NOAA Form 17-4 (Initial Report On Weather Modification Activities) PDF files using LLM integration. Saves extracted structured information into CSV file.
- Navigate to
code/
- Install required Python dependencies
pip install requirements.txt
- Obtain your own OpenAI and LLM Whisperer credentials and save your API keys in
.env
- Run
python llm-extractor.py
to generate the dataset. This will take about 2.5 hours. - View the generated dataset in
dataset/final/