Agentic Fraud Analyst Assistant

This is a project developed in the "End-to-End AI Engineering Bootcamp" that I enrolled in Aug. 2025.

Problem

Company Y is a professional networking platform and is utilized by job seekers and recruiters. This company currently uses a rule-based system flags whether a job posting is fraudulent or not. Alternatively, the company receives complaints from people using their website on these suspicious job postings. The role of fraud analysts at this company is to determine the validity of these flags to ensure they are not false-positives. Occasionally ticket volume can be overwhelmingly high for the team to handle.

Goal

Create an AI assistant for the fraud analyst team to leverage in resolving issues/tickets/complaints efficiently.

Concept Design

End-users: Fraud Analysts
Purpose: Identify whether a job posting is real or fake (fraudulent) and perform text analysis to identify possible fraudulent indicators.

Data & Knowledge

Real/Fake Job Posting Dataset

Based on the Kaggle dataset with 18K postings where 600 of them are fraudulent. Consists of 18 columns including the following text data points:

Job title
Salary range
Company profile
Job description
Location of posting

Source: https://www.kaggle.com/datasets/shivamb/real-or-fake-fake-jobposting-prediction/data

Possible Prompt types

Prompt Category	Example
Classification	"Is this job posting real or fake?" [Job posting text]
Explanation	"Why do you think this job posting is fake (real)"
Feature Extraction	"What features makes this job posting fake?
Comparison	"Which of these two postings is more likely to be a scam? [Job 1] vs [Job 2]"
Step-by-Step	"Evaluate this posting for fraud step by step. [job posting text]"

Possible Stakeholders

Director of IT/Fraud
Director of Data Science
Fraud Analyst
Data Scientist

Performance Metrics & Evals

Retrieval quality
End-to-End system performance (response time)
Final answer scoring
Task inference from user input
Reasoning steps, tool user, and final responses.

Deployment & Integration

Knowledge Base Preparation - collect and preprocess data appropriate for the RAG system. Chunk data into manageable pieces and upload to vector database.
API integration - utilize FastAPI to create context-aware endpoints and be able to handle error handling. Implement feedback loops within workflows for continuous learning and system improvement.
Testing and Validation - create RAG-specific testing for retrieval accuracy, relevance, and latency.
Deployment Pipeline - Create automated deployment pipelines for code updates.

Timeline & Milestones

Sprint 1: RAG Prototyping - embedding models & vector DB
Sprint 2: Retrieval Quality & Prompt engineering
Sprint 3: Autonomous agents
Sprint 4: Agents & Agentic System
Sprint 5: From Basic to Agentic RAG
Sprint 6: Multi-Agent Systems
Sprint 7: Deployment, Optimization and Reliability

Name		Name	Last commit message	Last commit date
Latest commit History 128 Commits
.github/workflows		.github/workflows
documentation/project_design		documentation/project_design
evals		evals
notebooks		notebooks
src		src
web		web
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
Dockerfile.classifier_mcp		Dockerfile.classifier_mcp
Dockerfile.entities_mcp		Dockerfile.entities_mcp
Dockerfile.fastapi		Dockerfile.fastapi
Dockerfile.items_mcp		Dockerfile.items_mcp
Dockerfile.mlflow		Dockerfile.mlflow
Dockerfile.streamlit		Dockerfile.streamlit
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
create-bucket.sh		create-bucket.sh
docker-compose.yml		docker-compose.yml
init-db.sh		init-db.sh
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Agentic Fraud Analyst Assistant

Problem

Goal

Concept Design

Data & Knowledge

Possible Prompt types

Possible Stakeholders

Performance Metrics & Evals

Deployment & Integration

Timeline & Milestones

About

Uh oh!

Releases

Packages

Languages

License

anthonyckleung/ai-engineering-bootcamp

Folders and files

Latest commit

History

Repository files navigation

Agentic Fraud Analyst Assistant

Problem

Goal

Concept Design

Data & Knowledge

Possible Prompt types

Possible Stakeholders

Performance Metrics & Evals

Deployment & Integration

Timeline & Milestones

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages