RAG pipeline for climate literature using open models (Llama 3), local embeddings (FAISS), and semantic HTML chunking. Built as part of my BA thesis.
See notebook header.
Make sure you have:
- a compatible Python environment (Python 3.10+ recommended)
- access to the KISSKI API (or replace it with a local Llama3 endpoint)
- your input files stored in the correct directory structure
an
.env
file
This code is licensed for noncommercial use only.
You may use, share, and adapt it freely for research, academic, or personal projects.
Commercial use is strictly prohibited without written permission.
See LICENSE for full terms.