PDF Voice Assistant

A web application that allows users to fill out PDF forms by speaking. The app captures the user’s voice, transcribes it to text, uses a local language model (e.g. Mistral via Ollama) to extract structured data, maps that data to the form fields of a PDF, and returns the filled-out document. It dynamically supports any fillable PDF and adapts to multiple or non-standard fields based on user input.

Tech Stack:

● Frontend: Vite, React, React Router DOM, TypeScript, TailwindCSS, MicRecorder, File upload

● Backend: Node.js (Express), FastAPI Python microservices (Whisper) and Ollama (Mistral) ● Libraries: pdf-lib, fs, multer.js, ollama

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
client		client
docs		docs
server		server
services		services
shared		shared
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PDF Voice Assistant

(WIP)

About

Uh oh!

Releases

Packages

Languages

License

emhgit/pdf-voice-assistant

Folders and files

Latest commit

History

Repository files navigation

PDF Voice Assistant

(WIP)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages