Ask PDF

Ask questions to a PDF file using Retrieval-Augmented Generation with pgvector and Heroku Managed Inference and Agents.

Requirements

Node.js LTS (>v20.9.0) - We recommend using Volta as version manager.
An Heroku account
Heroku CLI
PostgreSQL psql client
AWS Command Line Interface
pnpm

Installation

Install dependencies by running:

npm install

Create an Heroku application with:

heroku create <app-name>

Provision the Heroku Postgres with pgvector addon:

 heroku addons:create heroku-postgresql:essential-0

Provision the Heroku Managed Inference and Agents add-ons:

Claude 4 Sonnet for inference and Cohere Embed Multilingual for embeddings.

 heroku ai:models:create claude-4-sonnet --as INFERENCE

 heroku ai:models:create cohere-embed-multilingual --as EMBEDDING

Provision the Bucketeer addon:

 heroku addons:create bucketeer:hobbyist

Once the PostgreSQL database is created, setup the database schema with:

heroku pg:psql -f data/database.sql

Setup Bucketeer public policy, make sure to replace <bucket-name> and run:

aws s3api put-public-access-block --bucket <bucket-name> --public-access-block-configuration BlockPublicAcls=FALSE,IgnorePublicAcls=FALSE,BlockPublicPolicy=FALSE,RestrictPublicBuckets=FALSE

Create a policy.json file and replace <bucket-name>.

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Sid": "PublicReadGetObject",
      "Effect": "Allow",
      "Principal": "*",
      "Action": ["s3:GetObject"],
      "Resource": ["arn:aws:s3:::<bucket-name>/public/*"]
    }
  ]
}

Then run the following, replacing <bucket-name>:

aws s3api put-bucket-policy --bucket <bucket-name> --policy file://policy.json

Note: To run the aws commands you need to configure your credentials first by running:

aws configure

Run in Development

Create a .env file with the following information, you can use .env.sample as a template:

BUCKETEER_AWS_ACCESS_KEY_ID=<value>
BUCKETEER_AWS_REGION=us-east-1
BUCKETEER_AWS_SECRET_ACCESS_KEY=<value>
BUCKETEER_BUCKET_NAME=<value>
DATABASE_URL=<value>
EMBEDDING_KEY=<value>
EMBEDDING_MODEL_ID=cohere-embed-multilingual
EMBEDDING_URL='https://us.inference.heroku.com'
INFERENCE_KEY=<value>
INFERENCE_MODEL_ID=claude-4-sonnet
INFERENCE_URL='https://us.inference.heroku.com'

Note: This configuration variables can be fetched from Heroku using:

heroku config --shell > .env

Run the project locally with:

pnpm run dev

Manual Deployment

To manually deploy to Heroku you can run:

git push heroku main

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.husky		.husky
.react-router/types		.react-router/types
app		app
data		data
public		public
src		src
test		test
tmp		tmp
.env.sample		.env.sample
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
Procfile		Procfile
README.md		README.md
SECURITY.md		SECURITY.md
app.json		app.json
eslint.config.cjs		eslint.config.cjs
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.cjs		postcss.config.cjs
react-router.config.ts		react-router.config.ts
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ask PDF

Requirements

Installation

Run in Development

Manual Deployment

About

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

License

heroku-reference-apps/ask-pdf

Folders and files

Latest commit

History

Repository files navigation

Ask PDF

Requirements

Installation

Run in Development

Manual Deployment

About

Topics

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 2

Uh oh!

Languages