- Problem statement This project understands how the student's performance (test scores) is affected by other variables such as Gender, Ethnicity, Parental level of education, Lunch and Test preparation course.
- Data Collection Dataset Source - https://www.kaggle.com/datasets/spscientist/students-performance-in-exams?datasetId=74977 The data consists of 8 column and 1000 rows.
This dataset contains information about student performance in exams along with various demographic and background attributes. The dataset includes the following attributes:
- Gender: Sex of students (Male/Female)
- Race/Ethnicity: Ethnicity of students (Group A, B, C, D, E)
- Parental Level of Education: Parents' final education (Bachelor's Degree, Some College, Master's Degree, Associate's Degree, High School)
- Lunch: Whether the student has standard or free/reduced lunch
- Test Preparation Course: Whether the student completed a test preparation course before the exam
- Math Score
- Reading Score
- Writing Score
The data was collected from [source link or description of data source].
Before using the dataset, some preprocessing steps may be necessary, including handling missing values, encoding categorical variables, and scaling numerical features.
**
Check for missing values in the dataset.
Check for duplicate rows in the dataset.
Ensure correct data types for each column in the dataset.
Check the number of unique values for each column in the dat
This dataset can be used for various analyses and machine learning tasks, including but not limited to:
- Exploratory Data Analysis (EDA) to understand the distribution and relationships between different variables.
- Predictive modeling to predict student performance based on demographic and background attributes.
- Identifying factors influencing student performance and providing insights for educational interventions.
Please check the license of the dataset before usage.
If you use this dataset in your research or work, please cite the original source appropriately.
Acknowledgments to individuals or organizations who collected or provided the dataset.
List any references or related work that might be helpful for understanding or using the dataset.