In this project, we aim to predict whether the message is spam or ham. we implemented Natural Language Processing, TF-IDF and SVM on Python. The data contains the following columns: Message: text message Category: Spam or Ham . let’s get our environment ready with the libraries we’ll need and then import the data! Check out the Data . Exploratory Data Analysis Let’s use describe by Category, this way we can begin to think about the …
The Iris flower data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems. It is sometimes called Anderson’s Iris data set because Edgar Anderson collected the data to quantify the morphologic variation of Iris flowers of three related species. The data set consists of 50 samples from each of three species of Iris (Iris Setosa, Iris virginica, …