OCR Application using Gemma-3

This project leverages Gemma-3 vision capabilities and Streamlit to create a 100% locally running computer vision app that can perform both OCR and extract structured text from the image.

Installation and setup

Set virtual environment:

python -m venv ocr-gemma3-env

On macOS and Linux:

source ocr-gemma3-env/bin/activate

On Windows:

ocr-gemma3-env\Scripts\activate

Install Dependencies: Ensure you have Python 3.11 or later installed.

pip install -r requirements.txt

Setup Ollama:

# setup ollama on linux 
curl -fsSL https://ollama.com/install.sh | sh

# pull gemma-3 vision model
ollama run gemma3:12b

Run the Streamlit app:

streamlit run streamlit_app.py

Made with ❤️

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
assets		assets
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OCR Application using Gemma-3

Installation and setup

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

klaushajdaraj/OCR

Folders and files

Latest commit

History

Repository files navigation

OCR Application using Gemma-3

Installation and setup

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages