Predicting Iris Flower Species with Machine Learning

Photo Credits:
Christina Brinza.

About the Dataset

Exploratory Data Analysis

From this graph below we can see that setosa species can be linearly separated but versicolor and virginica have overlap.

Density Plot

Density Plot shows the distribution of observations for sepal lengths.

Clustering Using K-means

Decision Tree

From the confusion matrix we see that 6 observations are incorrectly classified.

Decision Tree gives us 85% accuracy on unknown data.


Becker, R. A., Chambers, J. M. and Wilks, A. R. (1988) The New S Language. Wadsworth & Brooks/Cole. (has iris3 as iris.)



Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Poonam Rao

Exec Director StratEx - I bring to the table blend of data science, finance and strategy management skills with 20+ years of experience in insurance & fintech.