The goal of the project is to develop an OCR application that extracts information from questionnaires, codes the data, and outputs them in tabular format.
- Extracts text and data from scanned questionnaires using OCR.
- Supports template-based recognition for improved accuracy.
- Detects form fields such as text inputs, checkboxes, and radio buttons.
- Exports extracted data to Excel format.
- React frontend for capturing images and displaying results.
- Python Django backend for processing images and managing templates.
- Backend: Python with Django, OpenCV, and Tesseract OCR.
- Frontend: React with Axios for API communication.
For detailed setup instructions, refer to the Setup Guide.
- Launch the React app to capture or upload images.
- Use the app to process images and extract data from forms.
- View the extracted data and export it to Excel if needed.
Contributions are welcome! To contribute:
- Fork the repository.
- Create a new branch for your feature or bug fix.
- Commit your changes and push them to your fork.
- Submit a pull request with a detailed description of your changes.
This project is licensed under the MIT License.