Chat with any pdf you upload

Powered by the OpenAI API and langchain.

Any uploaded document will be parsed and upserted into the Pinecone vector DB.

The contextual chat is done using Langchain LLM chain methods that retrieve vectorized documents from Pinecone DB and then make use of the OpenAI embeddings to process the conversation.

Tech stack used includes LangChain, Pinecone, Typescript, Openai, and Next.js. LangChain is a framework that makes it easier to build scalable AI/LLM apps and chatbots. Pinecone is a vectorstore for storing embeddings and your PDF in text to later retrieve similar docs.

Development

Clone the repo or download the ZIP

git clone [github https url]

Install packages

First run npm install

Set up your .env file

Copy .env.example into .env Your .env file should look like this:

OPENAI_API_KEY=

PINECONE_API_KEY=
PINECONE_ENVIRONMENT=

PINECONE_INDEX_NAME=

Visit openai to retrieve API keys and insert into your .env file.
Visit pinecone to create and retrieve your API keys, and also retrieve your environment and index name from the dashboard.

In utils/makechain.ts chain change the QA_PROMPT for your own usecase. Change modelName in new OpenAI to gpt-4, if you have access to gpt-4 api. Please verify outside this repo that you have access to gpt-4 api, otherwise the application will not work.

Make sure your pinecone dashboard environment and index matches the one in the pinecone.ts and .env files.
Check that you've set the vector dimensions to 1536.
Make sure your pinecone namespace is in lowercase.
Pinecone indexes of users on the Starter(free) plan are deleted after 7 days of inactivity. To prevent this, send an API request to Pinecone to reset the counter before 7 days.
Retry from scratch with a new Pinecone project, index, and cloned repo.

Credit

This project is was mostly taken from the Maayooear project. I implemented the parsing & upload with any PDF feature and some extra refactorings.

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
.vscode		.vscode
components		components
config		config
declarations		declarations
drizzle		drizzle
pages		pages
prisma		prisma
public		public
scripts		scripts
styles		styles
types		types
utils		utils
visual-guide		visual-guide
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierrc		.prettierrc
README.md		README.md
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.cjs		postcss.config.cjs
tailwind.config.cjs		tailwind.config.cjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Chat with any pdf you upload

Development

Credit

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

esponges/gpt-langchain-upload-doc-chatbot

Folders and files

Latest commit

History

Repository files navigation

Chat with any pdf you upload

Development

Credit

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages