Skip to content

A chat bot to talk to your multiple PDFs (vectorized via text embedding models) at once; provides grounded answers to your queries to the GPT-4o powered bot

Notifications You must be signed in to change notification settings

ImmrBhattarai/PDF-Chatbot-AI-Foundary

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 

Repository files navigation

PDF-Chatbot-AI-Foundary | DIO Project

A chat bot to talk to your multiple PDFs (vectorized via text embedding models) at once; provides grounded answers to your queries to the GPT-4o powered bot

Read my detailed article to see how you can do this as well: Medium Article

Challenge Completed in Azure Foundry Hub

What I did:

  1. Created an Azure Foundry Project (Via Azure portal)

  2. Created a Hub for the project

image image

  1. Model + Endpoints | Manage Deployments of your Models and services

image

3.1. Created a chat completion model (GPT-4o)

image image image

3.2. Created a text embedding model (text-embedding-3-large)

image image

After completing both 3.1 and 3,2, you will see something like this:

image

  1. Set the tone of the chatbot to define my use cases. Set the system instruction.

"You indentiy yourself as a visa counselor for the USA Visa related queries, either for pre-processing or post-processing stages. You only help with visa for the USA and particularly for Student Visa, nothing more. In case someone asks you anything beyond, silently deny that. You go above and beyond only to offer help in your subject as a Visa Counselor."

image image

  1. Add Data Sources and configure them

image image image image

Instead of scientic paper research, I opted for visa guide. I collected public PDFs from the internet for this purpose.

5.1 Select Appropriate VM size

image

5.2 Ingestion steps

image image

  1. Test the chat playground and observe model responses

image image image image

  1. Deploy your model

image

  1. Utilized Endpoint in localhost

What I did in this project (Summary)

  • Upload PDF files containing information relevant to your study or project.
  • Implement a vector search system to index and retrieve information from PDFs.
  • Use artificial intelligence to generate responses based on the content of uploaded documents.
  • Develop an interactive chat where it is possible to ask questions and get contextual answers based on the files.

What I learned in this project (Summary)

  • What is Azure Foundry Hub
  • Model Catalog and various models to explore
  • Creating chat models and embedding models
  • Loading Data Sources and content filter
  • Creating Azure AI Search and Foundry resources
  • Model Parameters and ways to deploy model
  • Using system instructions to narrow down the scope of chatbot
  • Insights on other available Azure OpenAI services
  • Indexing, semantic search and model groundness
  • Deploying chatbot endpoints
  • Using endpoints in local host for a live chat environnment andexperience.

What was my experience with this project?

Trying out the hands-on Azure Foundry for the very first time felt really confusing, but navigating every other settings, section, options, menus, etc made me feel comfortable in no time. The DIO challenge added new confidence in me to try out the new agentic chatbot (still in preview).

About

A chat bot to talk to your multiple PDFs (vectorized via text embedding models) at once; provides grounded answers to your queries to the GPT-4o powered bot

Topics

Resources

Stars

Watchers

Forks