In Computer Vision module, we will scan the document, identify the location of text and finally extract text from the image. Then in Natural language processing, we will extract the entitles from the text and do necessary text cleaning and parse the entities form the text.
- Pytesseract
- OpenCV
- Spacy
- Pandas
pip install -r requirements_app.txt
Download OCR Pytesseract
https://tesseract-ocr.github.io/tessdoc/Installation.html