Project

Prediction Of Tumor Severity

In this project, we aim to predict whether a tumor is benign or malignant. we implemented KNN on Python. The data contains the following columns: BI_RADS_assessment: Definitely benign(1) to Highly suggestive of malignancy (5) Age: patient’s age in years Shape: mass shape: round=1 oval=2 lobular=3 irregular=4 (nominal) Margin: mass margin: circumscribed=1 microlobulated=2 obscured=3 ill-defined=4 spiculated=5 (nominal) Density: mass density high=1 iso=2 low=3 fat-containing=4 (ordinal) Severity: Predictor Class: benign=0 or malignant=1 . let’s get our environment …

Prediction of Tomorrow Rain in Australia

In this project, we aim to predict whether or not it will rain tomorrow by training a Random Forest classification model on target RainTomorrow. This dataset contains daily weather observations from numerous Australian weather stations. The data contains the following columns: Date: The date of observation Location: The common name of the location of the weather station MinTemp: The minimum temperature in degrees celsius MaxTemp: The maximum temperature in degrees celsius Rainfall: The amount of rainfall recorded …

Gender Recognition By Voice

Logistic Regression is a Machine Learning classification algorithm that is used to predict the probability of a categorical dependent variable. In logistic regression, the dependent variable is a binary variable that contains data coded as 1 (yes, success, etc.) or 0 (no, failure, etc.). We use Logistic Regression To predict whether an email is spam (1) or not (0), Whether the tumor is malignant (1) or not (0), To predict whether a voice/face man (1) …

Public Transport Access Points in the ​UK

In this project, we are going to visualise and analyse all the access points of public transport in Manchester. This project is focusing on spatial data using geopandas and shapely in python. we will use the Counties and Unitary Authorities (December 2016) Super Generalised Clipped Boundaries in England and Wales and also national public transport access nodes datasets which they have been provided by the UK government. checkout the below links to download datasets: Dataset1 Dataset2 let’s importing …

Prediction of Pulsar Stars

The HTRU2 dataset describes a sample of pulsar candidates collected during the High Time Resolution Universe Survey. Pulsars are a rare type of Neutron star that produce radio emission detectable here on Earth. They are of considerable scientific interest as probes of space-time, the inter-stellar medium, and states of matter . As pulsars rotate, their emission beam sweeps across the sky, and when this crosses our line of sight, produces a detectable pattern of broadband …

Prediction of Breast Cancer Diagnosis

In this project we aim to Predict if tumor is benign or malignant by training a Random Forest classification and K-Means clustering model on target Diagnosis. The data contains the following columns: id: ID number diagnosis: The diagnosis of breast tissues (M = malignant, B = benign) radius_mean: mean of distances from center to points on the perimeter texture_mean: standard deviation of gray-scale values perimeter_mean: mean size of the core tumor area_mean: mean area size of …

Prediction of Shopping Behaviour

Stores are looking for new ways to promote their sale and increase their income. An increase can be found in cross-selling these days. Cross-selling is “an action or practice of selling an additional product or service to an existing customer”. It is important to understand how the products and services should be combined to increase their sale. It is the subject of a technique called Market Basket Analysis (MBA) or product association analysis.  Market Basket …

Prediction of Diabetes Occurrence​

In this project, we aim to predict the occurrence of diabetes within the PIMA Native American Group. We implemented the Decision Tree algorithm on Python. The data contains the following columns: times_pregnant: Number of times pregnant plasma_glucose: Concentration of plasma glucose in a 2 hour oral glucose tolerance test diastolic_blood_pressure: Measured in mmHg tricep_skin_fold_thickness: Measured in mm serum_insulin: Insulin concentration in serum in 2-hour period. Measured in (mu U/ml) body_mass_index: Weight in kg/height in (m^2) diabetes_pedigree_function: Function that assigns probability …