Project

Prediction of Diabetes Occurrence​

In this project, we aim to predict the occurrence of diabetes within the PIMA Native American Group. We implemented the Decision Tree algorithm on Python. The data contains the following columns: times_pregnant: Number of times pregnant plasma_glucose: Concentration of plasma glucose in a 2 hour oral glucose tolerance test diastolic_blood_pressure: Measured in mmHg tricep_skin_fold_thickness: Measured in mm serum_insulin: Insulin concentration in serum in 2-hour period. Measured in (mu U/ml) body_mass_index: Weight in kg/height in (m^2) diabetes_pedigree_function: Function that assigns probability …

Prediction Of Iris Species

The Iris flower data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems. It is sometimes called Anderson’s Iris data set because Edgar Anderson collected the data to quantify the morphologic variation of Iris flowers of three related species. The data set consists of 50 samples from each of three species of Iris (Iris Setosa, Iris virginica, …

Prediction Of Startups Profit

In this project, we aim to predict the 50 startups profit. we implemented Multiple Linear Regression on Python. The data contains the following columns: R&D Spend Administration Marketing Spend State Profit . let’s get our environment ready with the libraries we’ll need and then import the data! Check out the Data . EDA Let’s create some simple plots to check out the data! . Training a Linear Regression Model Let’s now begin to train out …