Business Problem
- It is desired to develop a machine learning model that can predict whether people have diabetes or not when their characteristics are determined. You are expected to perform the necessary data analysis and feature engineering steps before developing the model.
Dataset Story
- The data set is part of a large data set held at the National Institutes of Diabetes-Digestive-Kidney Diseases in the USA. Data used in diabetes studies on Pima Indian women aged 21 and over living in Phoenix, the 5th largest city in the US State of Arizona. The target variable is specified as "outcome"; 1 indicates a positive diabetes test result and 0 indicates a negative.