Tag Archives: Machine Learning

Prediction of Pulsar Stars

The HTRU2 dataset describes a sample of pulsar candidates collected during the High Time Resolution Universe Survey. Pulsars are a rare type of Neutron star that produce radio emission detectable here on Earth. They are of considerable scientific interest as probes of space-time, the inter-stellar medium, and states of matter . As pulsars rotate, their emission beam sweeps across the sky, and when this crosses our line of sight, produces a detectable pattern of broadband …

Prediction of Breast Cancer Diagnosis

In this project we aim to Predict if tumor is benign or malignant by training a Random Forest classification and K-Means clustering model on target Diagnosis. The data contains the following columns: id: ID number diagnosis: The diagnosis of breast tissues (M = malignant, B = benign) radius_mean: mean of distances from center to points on the perimeter texture_mean: standard deviation of gray-scale values perimeter_mean: mean size of the core tumor area_mean: mean area size of …

Prediction of Shopping Behaviour

Stores are looking for new ways to promote their sale and increase their income. An increase can be found in cross-selling these days. Cross-selling is “an action or practice of selling an additional product or service to an existing customer”. It is important to understand how the products and services should be combined to increase their sale. It is the subject of a technique called Market Basket Analysis (MBA) or product association analysis.  Market Basket …

Prediction of Diabetes Occurrence​

In this project, we aim to predict the occurrence of diabetes within the PIMA Native American Group. We implemented the Decision Tree algorithm on Python. The data contains the following columns: times_pregnant: Number of times pregnant plasma_glucose: Concentration of plasma glucose in a 2 hour oral glucose tolerance test diastolic_blood_pressure: Measured in mmHg tricep_skin_fold_thickness: Measured in mm serum_insulin: Insulin concentration in serum in 2-hour period. Measured in (mu U/ml) body_mass_index: Weight in kg/height in (m^2) diabetes_pedigree_function: Function that assigns probability …

Prediction Of Iris Species

The Iris flower data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems. It is sometimes called Anderson’s Iris data set because Edgar Anderson collected the data to quantify the morphologic variation of Iris flowers of three related species. The data set consists of 50 samples from each of three species of Iris (Iris Setosa, Iris virginica, …

Recommender Systems

What is Recommendation System : The recommendation systems are defined as a software tools to give suggestions for items to the users in which they might be interested. The suggestions might be related to decision-making processes which can be within these – what movie / TV series to watch, what playlist to listen, what items to buy, what news to read, or what videos to watch and many more in this list. The main objective …