Resume Parser & Job Matching

A project that extracts and summarizes PDF resumes using both code-based logic (regex, dictionaries) and LLM refinement (Groq LLM). It then compares the finalized resume data against a user-provided job description using lemma-based and semantic matching approaches, yielding detailed matching scores.

Live Demo

Project Deployment Link: Resume Parser App

Video Demonstration

Check out the demo video here: Demo Video

Features

PDF Resume Upload:
Upload any PDF resume file.
Code-Based Parsing:
- Regex for phone and email.
- List-based skill detection from known keywords.
LLM Refinement:
- A Groq LLM verifies partial parsed data, removing incorrect items and producing a final summary in subpoints (Education, Experience, Skills, etc.).
- Fallback text appears if the LLM or API key is unavailable.
Job Description Matching:
- Lemma-Based (Jaccard / lexical overlap).
- Semantic (using sentence-transformers to measure embedding similarity).
- Combined final score.
Detailed Outputs:
- Raw vs. Cleaned Resume Text
- Matched & Unmatched Tokens / Sentences
- Final Summaries / Bullet Points

How It Works

User Interaction:
- Paste a job description.
- Upload a PDF resume.
Partial Parsing:
- Regex extracts phone/email.
- Keyword detection finds known skills (e.g., "Python", "Java").
LLM Finalization:
- The partial parse plus the raw text is fed into Groq LLM.
- The LLM verifies or removes incorrect fields, then produces subheading-based summaries.
Matching with JD:
1. Lemma-Based: Jaccard overlap between lemmatized tokens from the JD and resume.
2. Semantic: Overall text similarity plus line-by-line JD comparisons.
3. Combined: Weighted average, default 50/50.

Local Deployment

Clone or Download the repository.
Install the dependencies (pinned in requirements.txt):
```
pip install -r requirements.txt
```
Run the Streamlit app:
```
streamlit run app.py
```
Open the URL provided by Streamlit.
Interact with the UI to upload a PDF resume and paste a job description.

Project Flow

app.py:
- Presents the Streamlit UI.
- Handles user input (resume + JD).
- Shows partial parse, LLM finalization, and matching outputs.
requirements.txt:
- Lists pinned versions for stable deployment.
Code & LLM synergy:
- The code-based approach ensures partial data extraction without relying solely on the LLM.
- The LLM refines that data, producing a final bullet-point summary.

Demo Links

Live App: Resume Parser App
Demo Video: Watch on Google Drive

Known Limitations

If the Groq LLM is unavailable or the key is invalid, you'll see a fallback message.
The code-based parse is minimal (phone/email regex, skill dictionary). Extend these methods for deeper extraction.
Torch-based dependencies can occasionally cause environment conflicts. See requirements.txt for pinned versions.

Contributing

Fork this repository.
Create a new branch for your features/fixes.
Open a Pull Request with a clear explanation.

License

This project is open-source under the MIT License. Feel free to use and adapt it.

Acknowledgements

Streamlit for interactive UI.
PyPDF2 for PDF text extraction.
Groq for LLM inference.
NLTK & sentence-transformers for textual processing & semantic matching.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
Kushal_Patel.pdf		Kushal_Patel.pdf
LICENSE		LICENSE
README.md		README.md
Resume_Parser.ipynb		Resume_Parser.ipynb
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Resume Parser & Job Matching

Live Demo

Video Demonstration

Features

How It Works

Local Deployment

Project Flow

Demo Links

Known Limitations

Contributing

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

kushalpatel0265/Resume-Parser

Folders and files

Latest commit

History

Repository files navigation

Resume Parser & Job Matching

Live Demo

Video Demonstration

Features

How It Works

Local Deployment

Project Flow

Demo Links

Known Limitations

Contributing

License

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages