0% found this document useful (0 votes)

14 views12 pages

Decision Tree Algorithm

notes for Decision Tree Algorithm

Uploaded by

Aatish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views12 pages

Decision Tree Algorithm

notes for Decision Tree Algorithm

Uploaded by

Aatish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

Decision Tree Algorithm With

Hands-On Example

The decision tree is one of the most important machine learning

algorithms. It is used for both classification and regression problems.
In this article, we will go through the classification part.

What is a decision tree?

A decision tree is a classification and prediction tool having a tree-like

structure, where each internal node denotes a test on an attribute,
each branch represents an outcome of the test, and each leaf node
(terminal node) holds a class label.
Above we have a small decision tree. An important advantage of the
decision tree is that it is highly interpretable. Here If Height > 180cm
or if height < 180cm and weight > 80kg person is male.Otherwise
female. Did you ever think about how we came up with this decision
tree? I will try to explain it using the weather dataset.

Before going to it further I will explain some important terms related

to decision trees.

Entropy

In machine learning, entropy is a measure of the randomness in the

information being processed. The higher the entropy, the harder it is
to draw any conclusions from that information.
Information Gain

Information gain can be defined as the amount of information gained

about a random variable or signal from observing another random
variable.It can be considered as the difference between the entropy of
parent node and weighted average entropy of child nodes.

Gini Impurity

Gini impurity is a measure of how often a randomly chosen element

from the set would be incorrectly labeled if it was randomly labeled
according to the distribution of labels in the subset.

Gini impurity is lower bounded by 0, with 0 occurring if the data set

contains only one class.
There are many algorithms there to build a decision tree. They are

1. CART (Classification and Regression Trees) — This makes use

of Gini impurity as the metric.

2. ID3 (Iterative Dichotomiser 3) — This uses entropy and

information gain as metric.

In this article, I will go through ID3. Once you got it it is easy to

implement the same using CART.

Classification using the ID3 algorithm

Consider whether a dataset based on which we will determine whether
to play football or not.
Here There are for independent variables to determine the dependent
variable. The independent variables are Outlook, Temperature,
Humidity, and Wind. The dependent variable is whether to play
football or not.

As the first step, we have to find the parent node for our decision tree.
For that follow the steps:

Find the entropy of the class variable.

E(S) = -[(9/14)log(9/14) + (5/14)log(5/14)] = 0.94

note: Here typically we will take log to base 2.Here total there are 14
yes/no. Out of which 9 yes and 5 no.Based on it we calculated
probability above.

From the above data for outlook we can arrive at the following table
easily
Now we have to calculate average weighted entropy. ie, we have
found the total of weights of each feature multiplied by probabilities.

E(S, outlook) = (5/14)E(3,2) + (4/14)E(4,0) + (5/14)*E(2,3) = (5/14)(-

(3/5)log(3/5)-(2/5)log(2/5))+ (4/14)(0) + (5/14)((2/5)log(2/5)-
(3/5)log(3/5)) = 0.693

The next step is to find the information gain. It is the difference

between parent entropy and average weighted entropy we found
above.

IG(S, outlook) = 0.94 - 0.693 = 0.247

Similarly find Information gain for Temperature, Humidity, and Windy.

IG(S, Temperature) = 0.940 - 0.911 = 0.029

IG(S, Humidity) = 0.940 - 0.788 = 0.152

IG(S, Windy) = 0.940 - 0.8932 = 0.048

Now select the feature having the largest entropy gain. Here it is
Outlook. So it forms the first node(root node) of our decision tree.
Now our data look as follows

Since overcast contains only examples of class ‘Yes’ we can set it as

yes. That means If outlook is overcast football will be played. Now our
decision tree looks as follows.

The next step is to find the next node in our decision tree. Now we will
find one under sunny. We have to determine which of the following
Temperature, Humidity or Wind has higher information gain.
Calculate parent entropy E(sunny)

E(sunny) = (-(3/5)log(3/5)-(2/5)log(2/5)) = 0.971.

Now Calculate the information gain of Temperature. IG(sunny,

Temperature)

E(sunny, Temperature) = (2/5)E(0,2) + (2/5)E(1,1) +

(1/5)*E(1,0)=2/5=0.4

Now calculate information gain.

IG(sunny, Temperature) = 0.971–0.4 =0.571

Similarly we get

IG(sunny, Humidity) = 0.971

IG(sunny, Windy) = 0.020

Here IG(sunny, Humidity) is the largest value. So Humidity is the node
that comes under sunny.

For humidity from the above table, we can say that play will occur if
humidity is normal and will not occur if it is high. Similarly, find the
nodes under rainy.

Note: A branch with entropy more than 0 needs further

splitting.

Finally, our decision tree will look as below:

Classification using CART algorithm

Classification using CART is similar to it. But instead of entropy, we
use Gini impurity.

So as the first step we will find the root node of our decision
tree. For that Calculate the Gini index of the class variable

Gini(S) = 1 - [(9/14)² + (5/14)²] = 0.4591

As the next step, we will calculate the Gini gain. For that first, we
will find the average weighted Gini impurity of Outlook, Temperature,
Humidity, and Windy.

First, consider case of Outlook

Gini(S, outlook) = (5/14)gini(3,2) + (4/14)gini(4,0)+ (5/14)gini(2,3) =

(5/14)(1 - (3/5)² - (2/5)²) + (4/14)*0 + (5/14)(1 - (2/5)² - (3/5)²)=
0.171+0+0.171 = 0.342

Gini gain (S, outlook) = 0.459 - 0.342 = 0.117

Gini gain(S, Temperature) = 0.459 - 0.4405 = 0.0185

Gini gain(S, Humidity) = 0.459 - 0.3674 = 0.0916

Gini gain(S, windy) = 0.459 - 0.4286 = 0.0304

Choose one that has a higher Gini gain. Gini gain is higher for outlook.
So we can choose it as our root node.

Now you have got an idea of how to proceed further. Repeat the same
steps we used in the ID3 algorithm.

Advantages and disadvantages of decision trees

Advantages:

1. Decision trees are super interpretable

2. Require little data preprocessing

3. Suitable for low latency applications

Disadvantages:

1. More likely to overfit noisy data. The probability of overfitting

on noise increases as a tree gets deeper. A solution for it
is pruning. You can read more about pruning from my Kaggle
notebook. Another way to avoid overfitting is to use bagging
techniques like Random Forest. You can read more about
Random Forest from an article from neptune.ai.

References:
 https://www.saedsayad.com/decision_tree.htm

 Applied-ai course

Examples
No ratings yet
Examples
8 pages
6 DecisionTrees ID3 CART
No ratings yet
6 DecisionTrees ID3 CART
24 pages
Decision Tree
No ratings yet
Decision Tree
17 pages
DT Classifier
No ratings yet
DT Classifier
45 pages
MODULE 4-Dr - GM
No ratings yet
MODULE 4-Dr - GM
23 pages
Decision Tree
No ratings yet
Decision Tree
34 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
ML Unit-3
No ratings yet
ML Unit-3
92 pages
06 Classification Decision Tree
No ratings yet
06 Classification Decision Tree
42 pages
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Classification and Clustering
No ratings yet
Classification and Clustering
59 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Decision Tree
No ratings yet
Decision Tree
100 pages
ML Unit 2 Final - III Yr
No ratings yet
ML Unit 2 Final - III Yr
72 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
No ratings yet
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
7 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
CSC454 10
No ratings yet
CSC454 10
36 pages
Practice Q Machine Learning Ans
No ratings yet
Practice Q Machine Learning Ans
54 pages
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
11 pages
ML Unit 3 Qa
No ratings yet
ML Unit 3 Qa
26 pages
06 - Decision Trees
No ratings yet
06 - Decision Trees
14 pages
Decision Tree
No ratings yet
Decision Tree
66 pages
Decision Tree
No ratings yet
Decision Tree
36 pages
ML Classification Tree
No ratings yet
ML Classification Tree
36 pages
Tree Models
No ratings yet
Tree Models
42 pages
AIML Lect5 Decision Tree
No ratings yet
AIML Lect5 Decision Tree
33 pages
ML Unit-2 Material
No ratings yet
ML Unit-2 Material
20 pages
Decision Trees
No ratings yet
Decision Trees
19 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
Decision Tree Id3 Problem
No ratings yet
Decision Tree Id3 Problem
5 pages
Decision Tree
100% (4)
Decision Tree
66 pages
Decision Tree For Classification (ID3 Information Gain Entropy)
No ratings yet
Decision Tree For Classification (ID3 Information Gain Entropy)
3 pages
Day48 Decision Trees
No ratings yet
Day48 Decision Trees
5 pages
Decisiontrees
No ratings yet
Decisiontrees
28 pages
Experiment No-2
No ratings yet
Experiment No-2
4 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Unit IV - Decision Tree With ID3
No ratings yet
Unit IV - Decision Tree With ID3
8 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Naïve Bayes-DecisionTrees-RandomForest-SVM
No ratings yet
Naïve Bayes-DecisionTrees-RandomForest-SVM
26 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
Decision Trees For Classification - A Machine Learning Algorithm - Xoriant Blog
No ratings yet
Decision Trees For Classification - A Machine Learning Algorithm - Xoriant Blog
17 pages
Lecture2 DT
No ratings yet
Lecture2 DT
75 pages
IS4834 Week 8
No ratings yet
IS4834 Week 8
42 pages
Classification
No ratings yet
Classification
148 pages
Unit 4a Decision Tree
No ratings yet
Unit 4a Decision Tree
90 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Nest Learning Thermostat
From Everand
Nest Learning Thermostat
Arthur Tech
No ratings yet
Decision Tree
No ratings yet
Decision Tree
1 page
ASSESSMENT2
No ratings yet
ASSESSMENT2
22 pages
Decision Trees
No ratings yet
Decision Trees
29 pages
Dokumen - Tips Contoh Studi Kasus Decision Tree
No ratings yet
Dokumen - Tips Contoh Studi Kasus Decision Tree
11 pages
The Random Forest Algorithm - A Complete Guide - Built in
No ratings yet
The Random Forest Algorithm - A Complete Guide - Built in
12 pages
Import Import Def
No ratings yet
Import Import Def
2 pages
Decision Tree
No ratings yet
Decision Tree
44 pages
Lab Program 3
No ratings yet
Lab Program 3
6 pages
Svmsmote 061430
No ratings yet
Svmsmote 061430
2 pages
ML Exp 3
No ratings yet
ML Exp 3
6 pages
Mock-2 Decision Tree Solution
No ratings yet
Mock-2 Decision Tree Solution
4 pages
ID3 Decision Tree Algorithm
No ratings yet
ID3 Decision Tree Algorithm
18 pages
3ID3 Algorithm
No ratings yet
3ID3 Algorithm
9 pages
ID3 BuyPC
No ratings yet
ID3 BuyPC
3 pages
Module 5 Decision Tree Part2
No ratings yet
Module 5 Decision Tree Part2
47 pages
ML IA-2 Question Bank - 1
No ratings yet
ML IA-2 Question Bank - 1
24 pages
Unit3 ID3 DT Examples
No ratings yet
Unit3 ID3 DT Examples
12 pages
DM - Lab - 8 - Jupyter Notebook
No ratings yet
DM - Lab - 8 - Jupyter Notebook
5 pages
Decision Trees and Random Forests
No ratings yet
Decision Trees and Random Forests
36 pages
Decision Tree - Species Classification - Solution
No ratings yet
Decision Tree - Species Classification - Solution
14 pages
Decision Tree Question
No ratings yet
Decision Tree Question
6 pages
ID3 Decision Tree Explanation
No ratings yet
ID3 Decision Tree Explanation
8 pages
Daily AI Exercise - Kmeans - KNN
No ratings yet
Daily AI Exercise - Kmeans - KNN
15 pages
Decision Trees Pohon Keputusan
No ratings yet
Decision Trees Pohon Keputusan
5 pages
Decision Tree
No ratings yet
Decision Tree
9 pages
ID3 MedhaPradhan
No ratings yet
ID3 MedhaPradhan
22 pages
Decision Trees - Id3 Algorithms
No ratings yet
Decision Trees - Id3 Algorithms
12 pages
C4.5 Algorithm Decision Tree
No ratings yet
C4.5 Algorithm Decision Tree
18 pages
DM DT Solved Example 02 - Unlocked
No ratings yet
DM DT Solved Example 02 - Unlocked
3 pages

Decision Tree Algorithm

Uploaded by

Decision Tree Algorithm

Uploaded by

Decision Tree Algorithm With

The decision tree is one of the most important machine learning

What is a decision tree?

A decision tree is a classification and prediction tool having a tree-like

Before going to it further I will explain some important terms related

In machine learning, entropy is a measure of the randomness in the

Information gain can be defined as the amount of information gained

Gini impurity is a measure of how often a randomly chosen element

Gini impurity is lower bounded by 0, with 0 occurring if the data set

1. CART (Classification and Regression Trees) — This makes use

2. ID3 (Iterative Dichotomiser 3) — This uses entropy and

In this article, I will go through ID3. Once you got it it is easy to

Classification using the ID3 algorithm

Find the entropy of the class variable.

E(S) = -[(9/14)log(9/14) + (5/14)log(5/14)] = 0.94

E(S, outlook) = (5/14)*E(3,2) + (4/14)*E(4,0) + (5/14)*E(2,3) = (5/14)(-

The next step is to find the information gain. It is the difference

IG(S, outlook) = 0.94 - 0.693 = 0.247

Similarly find Information gain for Temperature, Humidity, and Windy.

IG(S, Temperature) = 0.940 - 0.911 = 0.029

IG(S, Humidity) = 0.940 - 0.788 = 0.152

IG(S, Windy) = 0.940 - 0.8932 = 0.048

Since overcast contains only examples of class ‘Yes’ we can set it as

E(sunny) = (-(3/5)log(3/5)-(2/5)log(2/5)) = 0.971.

Now Calculate the information gain of Temperature. IG(sunny,

E(sunny, Temperature) = (2/5)*E(0,2) + (2/5)*E(1,1) +

Now calculate information gain.

IG(sunny, Temperature) = 0.971–0.4 =0.571

IG(sunny, Humidity) = 0.971

IG(sunny, Windy) = 0.020

Note: A branch with entropy more than 0 needs further

Finally, our decision tree will look as below:

Classification using CART algorithm

Gini(S) = 1 - [(9/14)² + (5/14)²] = 0.4591

First, consider case of Outlook

Gini(S, outlook) = (5/14)gini(3,2) + (4/14)*gini(4,0)+ (5/14)*gini(2,3) =

Gini gain (S, outlook) = 0.459 - 0.342 = 0.117

Gini gain(S, Temperature) = 0.459 - 0.4405 = 0.0185

Gini gain(S, Humidity) = 0.459 - 0.3674 = 0.0916

Advantages and disadvantages of decision trees

1. Decision trees are super interpretable

2. Require little data preprocessing

3. Suitable for low latency applications

1. More likely to overfit noisy data. The probability of overfitting

You might also like

E(S, outlook) = (5/14)E(3,2) + (4/14)E(4,0) + (5/14)*E(2,3) = (5/14)(-

E(sunny, Temperature) = (2/5)E(0,2) + (2/5)E(1,1) +

Gini(S, outlook) = (5/14)gini(3,2) + (4/14)gini(4,0)+ (5/14)gini(2,3) =