Human Resources Analytics

Data analysis project of the Data Mining course held at University of Pisa (Master's degree in Computer Science, fall/winter 2017). Equally relevant contributions to this project were made by noobcode and anferico.

The projects consists in the application of Data Mining techniques to an HR dataset about people working in a certain company. The goal of this project is to identify those employees that left their job and define their typical profile (based on parameters like level of satisfaction, promotions obtained, number of projects and so on).

Main modules of the project

This work includes four main parts:

Data understanding: data visualization, statistics and cleaning. Relative Jupyter notebooks can be found under the folders data_understanding and More on data understanding.
Clustering: application of the K-Means, DBSCAN and Hierarchical clustering algorithms to identify patterns among employees. The Jupyter notebooks for this task are all stored inside the clustering folder.
Pattern mining and association rules: frequent patterns and association rules extraction through the Apriori algorithm. Some of the rules are also used in the definition of a rule-based classifier. Source code can be found under the two folders more_on_association_rules and association-rules-and-pattern-mining.
Binary classification: definition and evaluation of different decision-tree models for the binary classification of employees (whether or not they left their job at the company). Implementations can be inspected by reading the Jupyter notebooks inside the HR_classification folder.

Written report

A written report of the project is available in Italian (report/Relazione_Unita.pdf).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Human Resources Analytics

Main modules of the project

Written report

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
HR_classification		HR_classification
More on data understanding		More on data understanding
association-rules-and-pattern-mining		association-rules-and-pattern-mining
clustering		clustering
data		data
data_understanding		data_understanding
images		images
more_on_association_rules		more_on_association_rules
report		report
README.md		README.md

leqo-c/data-mining-2017

Folders and files

Latest commit

History

Repository files navigation

Human Resources Analytics

Main modules of the project

Written report

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages