Skip to content

Fishbony/ocr_PTA_GPPH

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OCR of PTA(Pure Tone Audiometry) pdf file

Update 2020.03.20

Update PTA_transfer.py

Background

In our hospital, PTA results were get in pdf format.

People need to input the result in different frequency by hand so this project is used to free our hand. Tesseract were chosen as OCR engine in our project. Available at https://github.com/tesseract-ocr/tesseract.

This project does not include a GUI application.

TODO

  • complete rename function
  • input the locations of different frequency easily and save them to isolated file.

File list

./test
./pics
./README.md
./PTA_rename.py (incomplete)
./PTA_transfer.py
./OCR.py

How to run file

Windows platform

  • File prepare. Ready to OCR

    • Rename your PTA pdf files. It could rename your file by OCR. This function is incomplete. DO NOT run this file. If you have renamed your pdf files, this step could be skipped.

      python PTA_rename.py
    • Transfer your pdf to jpg.

      python PTA_transfer.py
  • OCR

    • OCR your images and get a csv file.

      python OCR.py

Support

Authors & Contributors List

Name: Fishbony

Affiliates: Postgraduate student in Shantou University Medical College and resident in Guangdong Provincial Peoples' Hospital

Email: czhangent@163.com

About

ocr_PTA_GPPH

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages