One company prepared 10 great version of the same Ad. they are not very sure which Ad is suitable to put it on the social network. They want to put the Ad that will get the maximum clicks and lead to the best Conversion Rates. The company would like to hire a data scientist to find the best strategy to find out which version of the Ad is the best for the user and which …
You have a supermarket mall and through membership cards, you have some basic data about your customers like Customer ID, age, gender, annual income and spending score. Spending Score is something you assign to the customer based on your defined parameters like customer behavior and purchasing data. By the end of this case study, you would be able to answer the below questions. 1- How to achieve customer segmentation using Machine Learning algorithm (Hierarchical Clustering) …
In this project, we aim to predict whether a tumor is benign or malignant. we implemented KNN on Python. The data contains the following columns: BI_RADS_assessment: Definitely benign(1) to Highly suggestive of malignancy (5) Age: patient’s age in years Shape: mass shape: round=1 oval=2 lobular=3 irregular=4 (nominal) Margin: mass margin: circumscribed=1 microlobulated=2 obscured=3 ill-defined=4 spiculated=5 (nominal) Density: mass density high=1 iso=2 low=3 fat-containing=4 (ordinal) Severity: Predictor Class: benign=0 or malignant=1 . let’s get our environment …
In this project, we aim to predict whether or not it will rain tomorrow by training a Random Forest classification model on target RainTomorrow. This dataset contains daily weather observations from numerous Australian weather stations. The data contains the following columns: Date: The date of observation Location: The common name of the location of the weather station MinTemp: The minimum temperature in degrees celsius MaxTemp: The maximum temperature in degrees celsius Rainfall: The amount of rainfall recorded …
Logistic Regression is a Machine Learning classification algorithm that is used to predict the probability of a categorical dependent variable. In logistic regression, the dependent variable is a binary variable that contains data coded as 1 (yes, success, etc.) or 0 (no, failure, etc.). We use Logistic Regression To predict whether an email is spam (1) or not (0), Whether the tumor is malignant (1) or not (0), To predict whether a voice/face man (1) …
The HTRU2 dataset describes a sample of pulsar candidates collected during the High Time Resolution Universe Survey. Pulsars are a rare type of Neutron star that produce radio emission detectable here on Earth. They are of considerable scientific interest as probes of space-time, the inter-stellar medium, and states of matter . As pulsars rotate, their emission beam sweeps across the sky, and when this crosses our line of sight, produces a detectable pattern of broadband …
In this project we aim to Predict if tumor is benign or malignant by training a Random Forest classification and K-Means clustering model on target Diagnosis. The data contains the following columns: id: ID number diagnosis: The diagnosis of breast tissues (M = malignant, B = benign) radius_mean: mean of distances from center to points on the perimeter texture_mean: standard deviation of gray-scale values perimeter_mean: mean size of the core tumor area_mean: mean area size of …
Stores are looking for new ways to promote their sale and increase their income. An increase can be found in cross-selling these days. Cross-selling is “an action or practice of selling an additional product or service to an existing customer”. It is important to understand how the products and services should be combined to increase their sale. It is the subject of a technique called Market Basket Analysis (MBA) or product association analysis. Market Basket …