Clean data and build a model to predict the salaries for the job postings contained in test_features.csv (1 million rows). Anaylze model accuracy.
The output of is a CSV file entitled test_salaries.csv
- train_features.csv : Each row represents metadata for an individual job. The “jobId” column represents a unique identifier for the job posting. The remaining columns describe features of the job.
- train_salaries.csv : Each row associates a “jobId” with a “salary”.
- test_features.csv : Similar to train_features.csv , each row represents metadata for an individual job.