GitHub - DhruvKikan/Neo-Tech-PDFParser

Used AWS EC2 for deployment and testing live

DON'T FORK OR TRY TO RUN LOCALLY.

The main pdf processing requires tesseract and OpenCV for text processing.

After text extraction, basic pre-processing is done utilizing regular expressions. After pre-processing, the text is fed into a LLM which extracts and returns the information in a parsed manner based on the template. Uploading JDs of jobs and scoring candidates on the basis of the JD has been added.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
instructions		instructions
neotech_reader		neotech_reader
results		results
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

DhruvKikan/Neo-Tech-PDFParser

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages