Predicting Penguins species

ML project:- supervised learning

Made by:-Koustubh sinha

Penguin dataset where we want to predict the species of penguins based on certain feature using appropriate model.

About penguin dataset

It is a great intro dataset for data exploration & visualization

The dataset consists of 7 columns.

-species: penguin species (Chinstrap, Adélie, or Gentoo) -culmen_length_mm: culmen length (mm) -culmen_depth_mm: culmen depth (mm) -flipper_length_mm: flipper length (mm) -body_mass_g: body mass (g) -island: island name (Dream, Torgersen, or Biscoe) in the Palmer Archipelago (Antarctica) -sex: penguin sex

What are culmen length & depth?

What are flippers?

The description of dataset and detailed exploratory analysis of it is done in predicting_penguins.ipynb file PLEASE go through it

Getting started

Workflow:-

* Step 1 :- Data Cleaning and visualization

Done through different plots.

* Step 2:- Exploratory data analysis and data preprocessing

Understand the data and make the dataset ready to be fitted in model(label encoder,normalization).

* Step 3:- Model Training and selection

As seen from the step 2, Knc has the best parameters to satisfy our dataset.

About KNeighborsClassifier

KNeighborsClassifier is one of the simplest Machine Learning algorithms based on Supervised Learning technique.
KNC algorithm assumes the similarity between the new case/data and available cases and put the new case into the category that is most similar to the available categories.
KNC algorithm stores all the available data and classifies a new data point based on the similarity. This means when new data appears then it can be easily classified into a well suite category by using KNC algorithm.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
penguins_size.csv		penguins_size.csv
predicting_penguins.ipynb		predicting_penguins.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting Penguins species

ML project:- supervised learning

Made by:-Koustubh sinha

Getting started

* Step 1 :- Data Cleaning and visualization

* Step 2:- Exploratory data analysis and data preprocessing

* Step 3:- Model Training and selection

About KNeighborsClassifier

* Step 4:- Testing new data points

About

Releases

Packages

Contributors 2

Languages

koustubh1317/penguinspecies

Folders and files

Latest commit

History

Repository files navigation

Predicting Penguins species

ML project:- supervised learning

Made by:-Koustubh sinha

Getting started

* Step 1 :- Data Cleaning and visualization

* Step 2:- Exploratory data analysis and data preprocessing

* Step 3:- Model Training and selection

About KNeighborsClassifier

* Step 4:- Testing new data points

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages