Skip to content

farisawal/Job-Hiring-Based-on-Personality-Education-Experience-and-Age

Repository files navigation

Job-Hiring-Based-on-Personality-Education-Experience-and-Age

Supervised Learning: Decision Tree Classifier on topic Job Hiring Based on Personality, Education, Experience and Age

Google Drive link: python notebook, dataset, group details

Introduction

Employers are continually seeking for methods to streamline their hiring procedures and find the best applicants for their organisations in today's fast-paced and ever-changing employment market. Our AI model makes use of the most recent developments in machine learning and natural language processing to analyse a candidate's personality traits, educational history, professional experience, and age and offer insightful information about their fit for a given post.

Employers may analyse a big pool of candidates fast and correctly using our AI model, and then decide who to invite for interviews or offer employment openings based on data. As the AI model employs objective criteria to evaluate each candidate, this not only saves time and money but also assures that the recruiting process is fair and impartial.

Our AI model, which was trained to find patterns and correlations between many elements that affect work performance, is based on a vast dataset of information about jobs. Our AI model can give you useful insights to help you reach your objectives, whether you're an employer trying to recruit the top applicants or a job seeker looking to find the ideal employment match.

Problem Statement

  1. The traditional hiring process is time-consuming and expensive, resulting in a slow hiring process and a higher hiring price.
  2. Hiring is also a process that is subject to bias and inconsistency because recruiters are human with vulnerability and rely on subjective judgments.
  3. During interviews, candidates may misrepresent themselves due to nervousness. It is therefore difficult for the recruiter to determine their true personality traits and characteristics.

Objectives

  1. To develop an A.I. Model to accurately predict job hiring based on personality test results
  2. To reduce the time and cost associated with the hiring process by automating the personality testing and job hiring probability prediction process
  3. To minimize the bias and inconsistencies in job recruitment process by using objectives measure of personality traits and characteristics.

Research Background

Before the project delves into the dataset and the model, we begin our research by highlighting and investigating all the necessary components toward a successful interview as it led to a higher chance of getting a job offer. We scoured the internet and found 5 major tenets of successful interview which are interview preparation, honesty, openness, comfortability during the interview and agreeableness. These are the results of our preliminary research.

  1. "The Art of the Interview: 6 Tips to Help You Impress Your Next Employer": This article from Forbes provides tips on how to ace the interview process, including how to prepare beforehand, how to dress, and how to answer common interview questions. https://www.forbes.com/sites/ashleystahl/2019/06/26/the-art-of-the-interview-6-tips-to-help-you-impress-your-next-employer/

  2. "Why Honesty is Essential in the Workplace": This article from Business News Daily explains why honesty is a critical characteristic for employees and how it can impact a company's success. It also provides tips on how to cultivate honesty in the workplace. https://www.businessnewsdaily.com/6549-honesty-workplace.html

  3. "The Importance of Openness in the Workplace": This article from Harvard Business Review explores the benefits of creating a culture of openness in the workplace, including increased collaboration and innovation. It also provides tips on how to encourage openness among employees. https://hbr.org/2018/09/the-importance-of-openness-in-the-workplace

  4. "The Benefits of Comfortable Employees": This article from Inc. Magazine discusses the importance of employee comfort in the workplace and how it can impact productivity and morale. It also provides tips on how to create a comfortable work environment. https://www.inc.com/john-rampton/the-benefits-of-comfortable-employees.html

  5. "Why Agreeableness is Key to Building a Positive Workplace Culture": This article from Psychology Today explains why agreeableness is an essential characteristic for employees and how it can impact teamwork, communication, and morale. It also provides tips on how to hire agreeable employees and how to foster a culture of agreeableness in the workplace. https://www.psychologytoday.com/us/blog/brain-babble/201511/why-agreeableness-is-key-building-positive-workplace-culture

5 Major Tenets For Successful Interview

Interview: The interview process is an opportunity for employers to assess a candidate's communication skills, professionalism, and overall fit for the role and company culture. A successful interview can demonstrate that the candidate has the necessary skills and experience to succeed in the role, as well as a positive attitude and a willingness to learn.

Honesty: Employers value honesty because it is an important component of building trust in the workplace. Honest employees are more likely to take responsibility for their mistakes, communicate openly with their colleagues, and maintain the integrity of the company's policies and procedures.

Openness: Openness refers to a willingness to consider different perspectives and ideas. Employees who are open to feedback and new approaches are more likely to be adaptable and innovative, which can help the company stay competitive in a rapidly changing marketplace.

Comfort: Employers want employees who are comfortable in their role and can manage stress effectively. When employees feel comfortable, they are more likely to be productive, engaged, and committed to their work.

Agreeableness: Agreeable employees are easy to work with and get along well with their colleagues. They are more likely to collaborate effectively, resolve conflicts constructively, and contribute to a positive workplace culture.

How to Deploy Model

To deploy the Decision Tree Classifier model for job hiring based on personality, education, experience, and age, you will need to prepare the environment, load the dataset, split the data, train the model, test the model, save the model, and deploy the model in a way that suits your requirements.

  1. Prepare Environment – For our model, we use Google Collab as IDE.

alt text

  1. Load the datasets – Import the necessary libraries and load the datasets that have been downloaded and read it using pd.read_excel(file_path).

alt text

  1. Data Preparation

    1. Drop unwanted columns

    alt text

    1. Feature Selection

    alt text

  2. Define hyperparameter for search over to increase the model accuracy

alt text

  1. Split the data – Split the dataset into training and testing data using the train_test_split() function from the scikit-learn library.

    alt text

    alt text

  2. Train the model – Create an instance of the DecisionTreeClassifier class from the scikit-learn library and fit the training data to the model using the fit() function.

alt text

  1. Test the model – Test the trained model on the testing data using the predict() function and evaluate the model's performance using metrics such as accuracy, precision, recall, and F1-score.

alt text

Result:

alt text

Used dataset

Our model used dataset from Mendeley Data named the data of HEXACO personality traits and job search outcomes that you can find it in https://data.mendeley.com/datasets/p9hwfmfcz5/3 . The data size is collected from 773 Vietnamese university undergraduate. The dataset consists of 93 columns, each representing a different feature. The features are:

  1. Gender: 1 for Male and 2 for Female.
  2. Married: 1 for Yes and 2 for No
  3. Age: 1 for between 20-21, 2 for between 22-23, 3 for between 24-25, 5 for above 25
  4. Qualifications: 1 for finance, 2 for business, 3 for accounting, 4 for engineering, 5 for other.
  5. Experience: 1 for under 1 year, 2 for 1-2 years, 3 for over 2 years.
  6. Honesty1-Honesty10: Survey questions asking the applicant to rate themselves on 1-5 scale how honest they are.
  7. Emotion1-Emotion8: Survey questions asking the applicant to rate themselves on 1-5 scale how good they are in handling their emotion.
  8. Extravert2-Extravert10: Survey questions asking the applicant to rate themselves on 1-5 scale how extroverted they are.
  9. Agreeable1-Agreeable10: Survey questions asking the applicant to rate themselves on 1-5 scale how agreeable they are.
  10. Conscientious1-Conscientious8: Survey question asking the applicant to rate themselves on 1-8 scale how careful attention to detail, thoroughness, and a strong work ethic they are
  11. Openness1-Openness10: Survey questions asking the applicant to rate themselves on a 1-5 scale on how open-minded they are
  12. Clarity1-Clarity4: Survey questions asking the applicant to rate themselves on a 1-5 scale on how clear their communication is
  13. Size1-Size4: Survey questions asking the applicant to rate themselves on a 1-5 scale on how well they work in different team sizes
  14. Strength1-Strength3: Survey questions asking the applicant to rate themselves on a 1-5 scale on their personal strengths
  15. Comfort1-Comfort6: Survey questions asking the applicant to rate themselves on a 1-5 scale on how comfortable they are in different situations
  16. Proactive1-Proactive10: Survey questions asking the applicant to rate themselves on a 1-5 scale on how proactive they are
  17. Behavior1-Behavior5: Survey questions asking the applicant to rate themselves on a 1-5 scale on how they would behave in different situations
  18. Interview1-Interview2: Survey questions asking the applicant to rate how they felt about the interview process
  19. Offer1-Offer2: Recruiter will give candidates scale from 1-5 how likely they will get a job offer.

Scale:

  • 1: Strongly Disagree
  • 2: Disagree
  • 3: Neutral
  • 4: Agree
  • 5: Strongly Agree

The target variable is Offer1-Offer2, indicating whether or not the applicant was offered a job. The decision tree classifier will use the other 92 features to predict whether or not an applicant will be offered a job based on their personality, education, experience, and age

Data Preparation

The dataset then being checked for any null or wrong format cell.

alt text

Figure above shows the data is complete, no null value, every cell is in integer value, and after thorough data view by researchers every cell holds a value that are in scale of the description from data collector.

Researchers then find the average or mean for each category of personality and change and new columns for average of each category in the dataset. The new column are such.

  1. Ahonesty: Average of Honesty1-Honesty10
  2. Aemotion: Average of Emotion1-Emotion8
  3. Aextravert: Average of Extravert2-Extravert10
  4. Aagreeable: Average of Agreeable1-Agreeable10
  5. Aconscientious: Average of Conscientious1-Conscientious8
  6. Aopenness: Average of Openness1-Openness10
  7. Aclarity: Average of Clarity1-Clarity4
  8. Asize: Average of Size1-Size4
  9. Astrength: Average of Strength1-Strength3
  10. Acomfort: Average of Comfort1-Comfort6
  11. Aproactive: Average of Proactive1-Proactive10
  12. Abehavior: Average of Behavior1-Behavior5
  13. Ainterview: Average of Interview1-Interview2
  14. Aoffer: Average of Offer1-Offer2
  15. Classification : If Aoffer is 4 and above then Classification set as High, else Low. Meaning high probability of being offer a job and low probability of being offer a job.

Tools

Google collab as our IDE in running model

Programming Language

Python.

Why we use Python than others?.

First off, Python is a very flexible language with a vast selection of modules and frameworks made expressly for AI and machine learning development. For instance, widely used frameworks like TensorFlow, PyTorch, and Keras make it simple to create and train sophisticated neural networks, while libraries like NumPy, Pandas, and SciPy offer strong data processing and manipulation capabilities.

Additionally, Python has robust support for data science and visualisation, both of which are essential for creating powerful AI models. With the use of libraries like Matplotlib and Seaborn, developers can quickly produce visualisations that aid in the comprehension and analysis of data as well as the dissemination of insights to stakeholders.

In conclusion, Python is a well-liked option for building AI models because of its adaptability, simplicity, strong community support, and potent data science and visualisation skills.

Result Analysis

Permutation Importance

image

Permutation Importance is defined as the decrease of model score when a single feature value is randomly shuffled. In essence, all the feature value will determine the outcome to a certain degree whether it is small or large. All features have certain expected ranges value and every feature have relationship value therefore by inputting the value in a single feature in random shuffle will cause the model to be less accurate as the feature become less predictable and less dependable on that feature.

By observing the graph shown above, we can observe that average interview process (Ainterview) has the highest dependency for the model to determine the outcome of the job offer scale followed by average emotion (AEmotion), average honesty (AHonesty) and so on. Later we observe the feature selection ranking and determine whether these 3 features will become the most impactful feature on the dataset.

Feature Selection Ranking

image

Here we can observe the feature ranking from the most importance to least importance. AHonesty, AEmotion, Aopenness, Aproactive and Aagreeable are all tied to being the number one most important feature that will influence the model. This is accurate to the assumption as these five feature are also have some of the permutation importance. This indicate that these feature value heavily impacted the probability of job offer and should be prioritized by the interviewee to ensure the highest possible job offer probability.

Correlation Matrix

image

Correlation matrix is utilized in the analysis to determine the linear relationship strength between two features by calculating its coefficient value. The value of a positive or higher correlation coefficient value showcases that when the scale of one’s feature increases, then the other feature scale value also increases. However, when we observe the correlation matrix in this graph, the correlation coefficient value barely reached 0.5 percent and the highest correlation coefficient value are between Aagreeable and Aproactive with around 0.4 coefficient value. This indicates that most of the features aren’t strongly dependent on other features to determine the overall trend value. What we can gather is that while on surface, all these features are related to each skillset of interview skills and opinion, they are mostly non-dependent of one another suggesting that there no centralized feature that ties the dataset and feature together but mostly impacted by each individual feature to determine the job offer probability.

ScatterPlot Comparison

image

AAgreaable against Aproactive

image

AEmotion against AHonesty

These are scatterplot showcasing two features (AAgreaable against Aproactive) with the highest correlation coefficient against the lowest correlation coefficient (AEmotion against AHonesty). We can observe that the highest correlation coefficient scatterplot is more clumped together meanwhile AEmotion against AHonesty scatterplot are much more spread out. This indicates a trend that AAgreable feature value will have a more significant impact on Aproactive and indicates that bear the highest dependency on one another.

Conclusion

In conclusion, employers trying to find the best applicants for their companies can benefit from using an AI model for recruiting jobs based on personality, education, experience, and age. This AI model can rapidly and correctly assess a huge pool of candidates and make data-driven judgements about who to invite for interviews or offer employment openings by utilising the most recent developments in machine learning and natural language processing.

Additionally, the AI model makes sure that the recruiting procedure is impartial and fair by evaluating each candidate according to objective standards. This ensures that candidates are only assessed based on their qualifications and appropriateness for the position, helping to avoid unconscious prejudices.

Due to its adaptability, simplicity, strong community support, and potent data science and visualisation capabilities, Python is one of the most widely used programming languages for developing AI models. Python enables developers to quickly prototype and experiment with new models while having easy access to a large choice of libraries and frameworks created expressly for machine learning and AI development.

Generally speaking, an AI model for recruiting based on personality, education, experience, and age is a useful tool for both employers and job seekers alike, aiding in streamlining the hiring process, removing prejudice, and identifying the best applicant for the position.

About

Supervised Learning: Decision Tree Classifier on topic Job Hiring Based on Personality, Education, Experience and Age

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •