0% found this document useful (0 votes)

4 views

Python ML Algorithm

The document provides information about various machine learning algorithms and concepts: - It discusses supervised algorithms like logistic regression, KNN, naive Bayes, random forest, and support vector machines (SVM). Unsupervised algorithms like K-means and mean shift clustering are also covered. - Example datasets like iris flowers, breast cancer tumors, and social network ads are used to demonstrate how different algorithms can be applied. - Key concepts around kernels, centroids, and iterative processes in algorithms are introduced.

Uploaded by

janhavi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Python ML Algorithm

Uploaded by

janhavi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

About Python Software's

 Processor  32 or 64 bit
 Windows OS  7, 8 or 10
 Python  2.7, 3.7
 Python  Idle
 OpenCV  opencv2.4, opencv contrib 3.4
 Anaconda3  4.4
 Data Base  MySQL-Front
Basic Points About Python
 Numbers are mainly of two types - integers and
floats.
 There is no separate long type. The int type can be
an integer of any size.
 string like 'This is a string' or "It's a string!" .
 You can specify strings using single quotes such as
'Quote me on this' .
 Strings in double quotes work exactly the same way
as strings in single quotes. An example is "What's
your name?" .
Basic Points About Python
 There is no separate char data type in Python.
 Python is strongly object-oriented in the sense that
everything is an object including numbers, strings
and functions.
 The Python for loop is radically different from the
C/C++ for loop.
 There is no switch statement in Python. You can use
an if..elif..else statement
 Variables are used by just assigning them a value.
No declaration or data type definition is
needed/used.
Types of Machine Learning Algorithms

Supervised machine learning algorithms

Unsupervised machine learning algorithms
Reinforcement machine learning algorithms
Supervised – Logistic Regression
logistic regression model is one of the members of
supervised classification algorithm.
Logistic regression measures the relationship between
dependent variables (y) and independent variables (x) by
estimating the probabilities using a logistic function. (Sigmoid
curve)
Linear & Logistic Regression
Ex – Marriage – Linear (which year marriage will happen),
Logistic (marry or not prediction)
Independent variable – age, sex, family, job, salary, friends
etc.
Supervised – Logistic Regression
Dataset – Social_Network_Ads
dataset, categories such as id, gender, age etc. Now based on
these categories, we are going to train our machine and
predict the no. of purchases. So here, we have independent
variables as ‘age’, ‘expected salary’ and dependent
variable as ‘purchased’. Logistic regression algorithm to find
out the number of purchase using the existing data.
Logistic Regression produces results in a binary format.
The usual outputs of logistic regression are –
Yes and No
True and False
High and Low
Pass and Fail
Supervised – Logistic Regression
Dataset – Social_Network_Ads
Supervised – Logistic Regression
Dataset – Social_Network_Ads
KNN – K Nearest Neighbor
KNN – K Nearest Neighbor
Supervised - Gaussian naïve bayes
Naïve Bayes Classifier  There are three types of Naïve
Bayes models named Gaussian, Multinomial and Bernoulli
under scikit learn package.
Naïve Bayes is a classification technique used to build
classifier using the Bayes theorem.
Naive Bayes can be extended to real valued attributes, most
commonly by assuming a Gaussian distribution. This
extension of naive Bayes is called Gaussian Naive Bayes.
Gaussian (or Normal distribution) is the easiest to work with
because you only need to estimate the mean and the standard
deviation from your training data.
Supervised - Gaussian naïve bayes
Dataset – Breast Cancer Tumors
dataset 
Breast Cancer Wisconsin Diagnostic Database.
The dataset includes various information about
breast cancer tumors, as well as classification labels
of malignant or benign. The dataset has 569 instances,
or data, on 569 tumors and includes information on 30
attributes, or features, such as the radius of the tumor,
texture, smoothness, and area. We can import this
dataset from sklearn package.
Naïve Bayes
Naïve Bayes
Supervised – Random Forest Classifier
Dataset – Breast Cancer Tumors
It can be used both for classification and regression. It is
also the most flexible and easy to use algorithm.
A forest is comprised of trees. It is said that the more trees it
has, the more robust a forest is. Random forests creates
decision trees on randomly selected data samples, gets
prediction from each tree and selects the best solution by
means of voting.
It also provides a pretty good indicator of the feature
importance.
Random forest algorithm is an ensemble
classification algorithm. Ensemble classifier means a group of
classifiers.
Supervised – Random Forest Classifier
Ex - Want to go on a trip and you would like to travel to a
place which you will enjoy. To find a place  search online,
read reviews on travel blogs and portals, or you can also ask
your friends.
Decided to ask friends, and talk with them about their past
travel experience to various places. Will get some
recommendations from every friend.
Now you have to make a list of those recommended places.
Then, you ask them to vote (or select one best place for the
trip) from the list of recommended places you made. The place
with the highest number of votes will be your final choice for
the trip.
In the above decision process, there are two parts. First, asking your
friends about their individual travel experience and getting one
recommendation out of multiple places they have visited. This part is
like using the decision tree algorithm. Here, each friend makes a
selection of the places he or she has visited so far.
The second part, after collecting all the recommendations, is the voting
procedure for selecting the best place in the list of recommendations.
This whole process of getting recommendations from friends and voting
on them to find the best place is known as the random forests algorithm.
The collection of decision tree classifiers is also known as the forest.
The individual decision trees are generated using an attribute selection
indicator such as information gain, gain ratio, and Gini index for each
attribute. Each tree depends on an independent random sample. In a
classification problem, each tree votes and the most popular class is
chosen as the final result. In the case of regression, the average of all the
tree outputs is considered as the final result.
Supervised – Random Forest Classifier
Supervised – Support Vector Machines (SVM)
SVM  supervised machine learning algorithm that can be
used for both regression and classification.
The main concept of SVM is to plot each data item as a
point in n-dimensional space with the value of each feature
being the value of a particular coordinate.
simple graphical representation -
Supervised – Support Vector Machines (SVM)
Dataset – Iris Flower
dataset  iris flower
iris dataset which contains 3 classes of 50 instances each,
where each class refers to a type of iris plant. Each instance
has the four features namely sepal length, sepal width, petal
length and petal width. The SVM classifier to predict the class
of the iris plant based on 4 features.
Supervised – Support Vector Machines (SVM)
Dataset – Iris Flower
The three classes in the Iris dataset:
Iris-setosa (n=50)
 Iris-versicolor (n=50)
Iris-virginica (n=50)
The four features of the Iris dataset:
sepal length in cm
 sepal width in cm
 petal length in cm
 petal width in cm
Supervised – SVM
Kernel  It is a technique used by SVM. Basically these are
the functions which take low-dimensional input space and
transform it to a higher dimensional space.
Kernal function  linear, polynomial, gaussian (rbf) and
sigmoid. In this example, we will use the linear kernel.
SVM and Kernel SVM with Python's Scikit-Learn
Unsupervised (Clustering) – K-Means algorithm
Clustering  is a task of dividing the set of observations
into subsets, called clusters, in such a way that observations in
the same cluster are similar in one sense and they are
dissimilar to the observations in other clusters. In simple
words, we can say that the main goal of clustering is to group
the data on the basis of similarity and dissimilarity.
K-means algorithm is one of the well-known algorithms for
clustering the data. We need to assume that the numbers of
clusters are already known. The steps for this algorithm −
Step 1 − specify the desired number of K subgroups.
Step 2 − Fix the number of clusters and randomly assign
each data point to a cluster. Or in other words we need to
classify our data based on the number of clusters.
Unsupervised (Clustering) – K-Means algorithm
K-Means  flat clustering, iterative clustering algorithm,
centroid-based clustering.
As this is an iterative algorithm, we need to update the
locations of K centroids with every iteration until we find the
global optima or in other words the centroids reach at their
optimal locations.
In centroid-based clustering, clusters are represented by a
central vector or a centroid. This centroid might not
necessarily be a member of the dataset. Centroid-based
clustering is an iterative algorithm in which the notion of
similarity is derived by how close a data point is to the
centroid of the cluster.
https://mubaris.com/posts/kmeans-clustering/
Unsupervised (Clustering) – K-Means algorithm
Unsupervised (Clustering) – Mean Shift algorithm
It is another popular and powerful clustering algorithm used in
unsupervised learning.
It does not make any assumptions hence it is a non-parametric
algorithm. It is also called hierarchical clustering or mean shift cluster
analysis.
Basic steps of this algorithm −
First of all, we need to start with the data points assigned to a
cluster of their own.
Now, it computes the centroids and update the location of new
centroids.
By repeating this process, we move closer the peak of cluster i.e.
towards the region of higher density.
This algorithm stops at the stage where centroids do not move
anymore.

Machine Learning 1
No ratings yet
Machine Learning 1
29 pages
Machine Learning and Deep Learning Supervised Learning 1682688720
No ratings yet
Machine Learning and Deep Learning Supervised Learning 1682688720
121 pages
Machine Learning
100% (6)
Machine Learning
115 pages
Machine learning algorithms laiki
No ratings yet
Machine learning algorithms laiki
123 pages
11 W11NSE6220 - Fall 2023 - Zeng
No ratings yet
11 W11NSE6220 - Fall 2023 - Zeng
43 pages
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
No ratings yet
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
28 pages
Classification
No ratings yet
Classification
7 pages
UCS551 Chapter 6 - Classification
No ratings yet
UCS551 Chapter 6 - Classification
20 pages
Machine learning assingiment
No ratings yet
Machine learning assingiment
20 pages
Machine Learning Algorithms 1728923216
No ratings yet
Machine Learning Algorithms 1728923216
12 pages
05 Classification Part1
No ratings yet
05 Classification Part1
35 pages
Algorithms 1
No ratings yet
Algorithms 1
23 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Supervised Learning
No ratings yet
Supervised Learning
46 pages
Data Mining Classification Algorithms: Credits: Padhraic Smyth
No ratings yet
Data Mining Classification Algorithms: Credits: Padhraic Smyth
54 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
Supervised ML
No ratings yet
Supervised ML
69 pages
Unit - 3
No ratings yet
Unit - 3
83 pages
Bike Buyer Prediction Using Classification Algorithm
No ratings yet
Bike Buyer Prediction Using Classification Algorithm
19 pages
ML Assignment 2 PDF
No ratings yet
ML Assignment 2 PDF
9 pages
Scikit - Notes ML
100% (2)
Scikit - Notes ML
12 pages
Machine Learning Algorithms For Breast Cancer Prediction
No ratings yet
Machine Learning Algorithms For Breast Cancer Prediction
8 pages
Supervised Learning Classification Algorithms Comparison
No ratings yet
Supervised Learning Classification Algorithms Comparison
6 pages
Primer On Major Data Mining Algorithms
No ratings yet
Primer On Major Data Mining Algorithms
86 pages
Prediction of Breast Cancer Using Machine Learning Algorithms - 2nd Review
No ratings yet
Prediction of Breast Cancer Using Machine Learning Algorithms - 2nd Review
21 pages
ML Unit 2
No ratings yet
ML Unit 2
37 pages
Lesson 8 - Classification
No ratings yet
Lesson 8 - Classification
74 pages
Machine Learning For Beginners PDF
No ratings yet
Machine Learning For Beginners PDF
29 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
Wa0001
No ratings yet
Wa0001
39 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-04 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-04 Reference-Material-I
69 pages
Ushna FYP
No ratings yet
Ushna FYP
25 pages
Interview Preparing - ML Draft
No ratings yet
Interview Preparing - ML Draft
12 pages
Modern Machine Learning in Python
No ratings yet
Modern Machine Learning in Python
50 pages
CH 8 Data Mining
No ratings yet
CH 8 Data Mining
30 pages
unit 3 pdf
No ratings yet
unit 3 pdf
7 pages
SK Learn
No ratings yet
SK Learn
9 pages
Classification
No ratings yet
Classification
40 pages
Breast Cancer Classification
100% (2)
Breast Cancer Classification
16 pages
20MEMECH Part 3 - Classification
No ratings yet
20MEMECH Part 3 - Classification
49 pages
Project Occupancy Alfonso Vicente Aragues
No ratings yet
Project Occupancy Alfonso Vicente Aragues
18 pages
8 Classification
No ratings yet
8 Classification
45 pages
Slide 3
No ratings yet
Slide 3
23 pages
Chapter Four
No ratings yet
Chapter Four
75 pages
Machine Learning: Mona Leeza Email: Monaleeza - Bukc@bahria - Edu.pk
No ratings yet
Machine Learning: Mona Leeza Email: Monaleeza - Bukc@bahria - Edu.pk
60 pages
Lecture 9_Classification_Part 2_ec0c64efddca717f99b726e6fd37c459
No ratings yet
Lecture 9_Classification_Part 2_ec0c64efddca717f99b726e6fd37c459
26 pages
Module 5 - Supervised Learning Algorithms
No ratings yet
Module 5 - Supervised Learning Algorithms
38 pages
Module 6
No ratings yet
Module 6
82 pages
Types of Classification Algorithm
No ratings yet
Types of Classification Algorithm
27 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
AIML
No ratings yet
AIML
30 pages
ML Unit 2
No ratings yet
ML Unit 2
33 pages
Refer For KNNDecison Tree SVM
No ratings yet
Refer For KNNDecison Tree SVM
90 pages
Adbms Assignment 5: Q.1) Comparison of All Classification Algorithms Logistic Regression
No ratings yet
Adbms Assignment 5: Q.1) Comparison of All Classification Algorithms Logistic Regression
4 pages
Decision Tree Part 1
No ratings yet
Decision Tree Part 1
16 pages
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
No ratings yet
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
16 pages
Supervised Learning
No ratings yet
Supervised Learning
71 pages
ML & DL Notes
No ratings yet
ML & DL Notes
30 pages
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet
Malware Detection Using Machine Learning
No ratings yet
Malware Detection Using Machine Learning
5 pages
A Novel Remaining Useful Life Prediction Method Fo
No ratings yet
A Novel Remaining Useful Life Prediction Method Fo
14 pages
DL Brochure 1701951996207
No ratings yet
DL Brochure 1701951996207
18 pages
Twitter Data Preprocessing For Spam Detection: Myungsook Klassen
No ratings yet
Twitter Data Preprocessing For Spam Detection: Myungsook Klassen
6 pages
New Features For Framenet - Wordnet Mapping: Sara Tonelli and Daniele Pighin Fbk-Irst, Human Language Technologies
No ratings yet
New Features For Framenet - Wordnet Mapping: Sara Tonelli and Daniele Pighin Fbk-Irst, Human Language Technologies
9 pages
Wasserstein Generative Adversarial Networks For Bacterial Hemoglobin-Like Proteins Prediction
No ratings yet
Wasserstein Generative Adversarial Networks For Bacterial Hemoglobin-Like Proteins Prediction
18 pages
Unit 1 Lecture 3
No ratings yet
Unit 1 Lecture 3
5 pages
BT-3435 ALI (2)
No ratings yet
BT-3435 ALI (2)
49 pages
Final Report Template
No ratings yet
Final Report Template
28 pages
Major
No ratings yet
Major
15 pages
Orthod Craniofacial Res - 2021 - Monill‐González - Artificial intelligence in orthodontics Where are we now A scoping
No ratings yet
Orthod Craniofacial Res - 2021 - Monill‐González - Artificial intelligence in orthodontics Where are we now A scoping
10 pages
SMV 3
No ratings yet
SMV 3
23 pages
The Intelligent Vehicle Number Plate Recognition System Based On Arduino
No ratings yet
The Intelligent Vehicle Number Plate Recognition System Based On Arduino
19 pages
Qsar and Drug Design
No ratings yet
Qsar and Drug Design
65 pages
19bit0368 Capstone Final Review
No ratings yet
19bit0368 Capstone Final Review
48 pages
Deep Learning For Pothole Detection: Exploring YOLO V8 Algorithm's Performance in Pavement Detection
No ratings yet
Deep Learning For Pothole Detection: Exploring YOLO V8 Algorithm's Performance in Pavement Detection
1 page
Revolutionary Methodology For Detecting and Eliminating Strike-Through Text in Handwritten Kannada Documents
No ratings yet
Revolutionary Methodology For Detecting and Eliminating Strike-Through Text in Handwritten Kannada Documents
22 pages
Vaccination Scheduling
No ratings yet
Vaccination Scheduling
65 pages
Report WT
No ratings yet
Report WT
24 pages
Machine Learing r20 QP
No ratings yet
Machine Learing r20 QP
4 pages
f Sentiment Analysis on Large Scale Amazon Product Review
No ratings yet
f Sentiment Analysis on Large Scale Amazon Product Review
7 pages
Maths Notes
No ratings yet
Maths Notes
28 pages
Group 9 - Real-time DDoS Detection using Machine Learning
No ratings yet
Group 9 - Real-time DDoS Detection using Machine Learning
11 pages
Solar Power System
No ratings yet
Solar Power System
6 pages
Assignment B 2 EmailClassification
No ratings yet
Assignment B 2 EmailClassification
6 pages
QBANK_ML
No ratings yet
QBANK_ML
6 pages
Quality Reliability Eng - 2024 - Kazmi - Adaptive EWMA control chart by using support vector regression (2)
No ratings yet
Quality Reliability Eng - 2024 - Kazmi - Adaptive EWMA control chart by using support vector regression (2)
13 pages
Machine Learning On Mainstream Microcontrollers
No ratings yet
Machine Learning On Mainstream Microcontrollers
26 pages
Bihl, Trevor J. - Zobaa, Ahmed F - Big Data Analytics in Future Power Systems (2019)
No ratings yet
Bihl, Trevor J. - Zobaa, Ahmed F - Big Data Analytics in Future Power Systems (2019)
189 pages
Haneena Jasmine 2021 IOP Conf. Ser. Mater. Sci. Eng. 1114 012012
No ratings yet
Haneena Jasmine 2021 IOP Conf. Ser. Mater. Sci. Eng. 1114 012012
10 pages

Python ML Algorithm

Uploaded by

Python ML Algorithm

Uploaded by

About Python Software's

Supervised machine learning algorithms

You might also like