Anime Recommender System using LLMs & Kubernetes

A Large Language Model (LLM)-powered Anime Recommender System built using LangChain, GROQ API, ChromaDB, and deployed via Kubernetes on a GCP VM with Docker & Minikube.

Features

Personalized anime recommendations powered by GROQ LLM
Retrieval-Augmented Generation (RAG) using Chroma vector DB
Prompt-driven contextual search over anime metadata
Built-in Streamlit app UI
End-to-end containerized deployment using Kubernetes on GCP
Monitoring with Grafana + Helm

Project Structure

.
├── app/
│   └── app.py                     # Streamlit frontend app
├── config/
│   └── config.py                  # Loads .env vars (e.g., GROQ API Key)
├── pipeline/
│   ├── build_pipeline.py          # Builds vector DB from anime dataset
│   └── pipeline.py                # Runs LLM inference pipeline
├── src/
│   ├── data_loader.py             # Reads and processes raw anime CSV data
│   ├── vector_store.py            # Embeds + stores vectors into ChromaDB
│   ├── prompt_template.py         # Custom prompt template for LLM
│   └── recommender.py             # GROQ LLM + LangChain RetrievalQA logic
├── utils/
│   ├── custom_exception.py        # Centralized error tracking
│   └── logger.py                  # File-based logging support
├── Dockerfile                     # Docker build config
├── llmops-k8s.yaml                       # Kubernetes Deployment & Service spec
├── .env                           # Environment secrets (GROQ API Key, etc.)
└── README.md                      # You're here!

How It Works

Step-by-step Pipeline

Data Processing (data_loader.py):
- Reads raw anime CSV
- Combines metadata into a text corpus
- Saves cleaned data
Embedding & Storage (vector_store.py, build_pipeline.py):
- Splits text into chunks
- Converts chunks into embeddings using HuggingFace
- Saves them in Chroma vector DB
Prompt Engineering (prompt_template.py):
- Creates a custom prompt template that instructs the LLM to recommend anime titles
LLM-powered Retrieval (recommender.py, pipeline.py):
- Uses LangChain RetrievalQA with GROQ LLM
- Fetches top relevant chunks from ChromaDB
- Produces 3 anime recommendations
Frontend Interface (app.py):
- Simple Streamlit UI for entering queries
- Returns recommendations instantly

Deployment Guide (on GCP using Kubernetes)

Infrastructure Setup

Create GCP VM Instance
- Install: Docker → Minikube → kubectl
- Setup: git clone your repo into VM

Docker & Minikube

docker build -t anime-recommender .
minikube start

Secrets
- Store .env securely
- Create Kubernetes Secret from .env file

Deploy on K8s

kubectl apply -f llmops-k8s.yaml
minikube tunnel  # Run in a separate terminal to expose LoadBalancer

Service Types Explained

Type	Use Case	Access
ClusterIP	Internal microservices only	Not exposed outside cluster
NodePort	Expose on VM IP + specific port	`http://<VM-IP>:<NodePort>`
LoadBalancer	Internet-facing deployment	GCP auto-assigns external IP address

Production Tip: Use LoadBalancer for real deployments.

📊 Monitoring with Grafana

Create namespace:
```
kubectl create namespace monitoring
```
Install Helm & Grafana:
Set up Grafana Cloud, get token, and configure observability

Environment Variables (`.env`)

GROQ_API_KEY=your_groq_api_key_here
MODEL_NAME=groq/llama3-8b-8192

Sample Query

Ask questions like:

"Suggest some action-packed anime like Attack on Titan"

"What are the best anime series for beginners?"

Tech Stack

Python, LangChain, GROQ LLM
HuggingFace, ChromaDB
Streamlit, Docker, Kubernetes, Minikube
GCP VM, Grafana, Helm

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
app		app
chroma_db		chroma_db
config		config
data		data
pipeline		pipeline
src		src
utils		utils
.gitignore		.gitignore
DEPLOYMENT.md		DEPLOYMENT.md
Dockerfile		Dockerfile
FULL_DOCUMENTATION.md		FULL_DOCUMENTATION.md
README.md		README.md
dev.csv		dev.csv
dev_summary.csv		dev_summary.csv
llmops-k8s.yaml		llmops-k8s.yaml
main.py		main.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Anime Recommender System using LLMs & Kubernetes

Features

Project Structure

How It Works

Step-by-step Pipeline

Deployment Guide (on GCP using Kubernetes)

Infrastructure Setup

Service Types Explained

📊 Monitoring with Grafana

Environment Variables (`.env`)

Sample Query

Tech Stack

About

Uh oh!

Releases

Packages

Languages

avineet123/ANIME-RECOMMENDER-SYSTEM-LLMOPS

Folders and files

Latest commit

History

Repository files navigation

Anime Recommender System using LLMs & Kubernetes

Features

Project Structure

How It Works

Step-by-step Pipeline

Deployment Guide (on GCP using Kubernetes)

Infrastructure Setup

Service Types Explained

📊 Monitoring with Grafana

Environment Variables (.env)

Sample Query

Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Environment Variables (`.env`)

Packages