0% found this document useful (0 votes)

4 views23 pages

Decision Tree Learning Notes On 23rd July

Decision tree learning is a method for classifying instances based on attribute-value pairs, using a tree structure where each node represents a test on an attribute. The ID3 algorithm, a key approach in this learning method, employs a top-down, greedy search to create a decision tree by selecting attributes based on information gain, which measures the reduction in entropy. This technique is robust to errors and can handle missing attribute values, making it suitable for various classification problems such as medical diagnoses and loan approvals.

Uploaded by

patteboinaharshitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views23 pages

Decision Tree Learning Notes On 23rd July

Uploaded by

patteboinaharshitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

MODULE -3

DECISION TREE LEARNING

Decision tree learning is a method for approximating

discrete-valued target functions, in which the learned

function is represented by a decision tree.

2
Decision Tree Example
D ECI SI ON T R E E R E PR E S E N T A TI O N

FIGURE: A
decision tree for the
concept PlayTennis. An
example is classified
by sorting it through the
tree to the appropriate
leaf node, then returning
the classification
associated with this leaf
• Decision trees classify instances by sorting them down the tree from the root to
some leaf node, which provides the classification of the instance.
• Each node in the tree specifies a test of some attribute of the instance, and each
branch descending from that node corresponds to one of the possible values for
this attribute.
• An instance is classified by starting at the root node of the tree, testing the
attribute specified by this node, then moving down the tree branch corresponding
to the value of the attribute in the given example. This process is then repeated
for the subtree rooted at the new node.
• Decision trees represent a disjunction of conjunctions of constraints on the
attribute values of instances.
• Each path from the tree root to a leaf corresponds to a conjunction of attribute tests,
and the tree itself to a disjunction of these conjunctions

For example,
The decision tree shown in above figure corresponds to the expression
(Outlook = Sunny Λ Humidity = Normal) 𝗏 ( Outlook = Overcast)
𝗏 (Outlook = Rain Λ Wind = Weak)

6
APPROPRIATE PROBLEMS F O R
DE CI SI ON T R E E L E A R N I N G
Decision tree learning is generally best suited to problems with the following
characteristics:
1. Instances are represented by attribute-value pairs – Instances are described by a
fixed set of attributes and their values

2. The target function has discrete output values – The decision tree assigns a
Boolean classification (e.g., yes or no) to each example. Decision tree methods
easily extend to learning functions with more than two possible output values.

3. Disjunctive descriptions may be required

4. The training data may contain errors – Decision tree learning methods are
robust to errors, both errors in classifications of the training examples and errors in the
attribute values that describe these examples.
5. The training data may contain missing attribute values – Decision tree
methods can be used even when some training examples have unknown values.

Decision tree learning has been applied to problems such as learning to classify medical
patients by their disease, equipment malfunctions by their cause, and loan applicants
by their likelihood of defaulting on payments. Such problems, in which the task is to
classify examples into one of a discrete set of possible categories, are often referred to as
classification problems.
T H E BASIC D E C ISI O N T R E E L E A R N I N G A L G O R I T H M

Most algorithms that have been developed for learning decision trees are variations
on a core algorithm that employs a top-down, greedy search through the space of
possible decision trees. This approach is exemplified by the ID3 algorithm
and its successor C4.5
What is the ID3 algorithm?

• ID3 stands for Iterative Dichotomiser 3

• ID3 is a precursor to the C4.5 Algorithm.

• The ID3 algorithm was invented by Ross Quinlan in 1975

• Used to generate a decision tree from a given data set by employing a top-down,
greedy search, to test each attribute at every node of the tree.

• The resulting tree is used to classify future samples.

ID3 Algorithm
ID3(Examples, Target_Attribute, Attributes)
Examples are the training examples. Target_Attribute is the attribute whose value is to
be predicted by the tree. Attributes is a list of other attributes that may be tested by
the learned decision tree. Returns a decision tree that correctly classifies the given
Examples.

ID3 Algorithm:
 Create a Root node for the tree
 If all Examples are positive, Return the single-node tree Root, with label = +
 If all Examples are negative, Return the single-node tree Root, with label = -
 If Attributes is empty, Return the single-node tree Root, with label = most common
value of Target_Attribute in Examples
 Otherwise Begin
 A ← the attribute from Attributes that best* classifies Examples
 The decision attribute for Root ← A
 For each possible value, vi, of A,
 Add a new tree branch below Root, corresponding to the test A = vi
 Let Examples vi, be the subset of Examples that have value vi for A
 If Examples vi , is empty then
Below this new branch add a leaf node with label = most common
value of Target_Attribute in Examples
 Else
Below this new branch add the subtree
ID3(Examples vi, Target_Attribute, Attributes – {A}))
 End
 Return Root

* The best attribute is the one with highest information gain

Which Attribute Is the Best Classifier?
• The central choice in the ID3 algorithm is selecting which attribute to test at each
node in the tree.

• A statistical property called information gain that measures how well a given
attribute separates the training examples according to their target classification.

• ID3 uses information gain measure to select among the candidate attributes at
each step while growing the tree.
ENTROPY MEASURES HOMOGENEITY OF EXAMPLES

• To define information gain, we begin by defining a measure called entropy.

Entropy measures the impurity of a collection of examples.
• Given a collection S, containing positive and negative examples of some target
concept, the entropy of S relative to this Boolean classification is

Where,
p+ is the proportion of positive examples in S
p- is the proportion of negative examples in S.
Example: Entropy

• Suppose S is a of 14 examples of some boolean concept, including 9

positive and 5 negative examples. Then the entropy of S relative to this boolean
classification iscollection

• The entropy is 0 if all members of S belong to the same class

• The entropy is 1 when the collection contains an equal number of positive and
negative examples
• If the collection contains unequal numbers of positive and negative examples, the
entropy is between 0 and 1
INFORMATION GAIN MEASURES THE EXPECTED
REDUCTION IN ENTROPY

• Information gain, is the expected reduction in entropy caused by partitioning the

examples according to this attribute.
• The information gain, Gain(S, A) of an attribute A, relative to a collection of
examples S, is defined as
Example: Information gain

Let, Values(Wind) = {Weak, Strong}

S = [9+, 5−]
SWeak = [6+, 2−]
SStrong = [3+, 3−]

Information gain of attribute Wind:

Gain(S, Wind) = Entropy(S) − 8/14 Entropy (SWeak) − 6/14 Entropy (SStrong)

= 0.94 – (8/14)* 0.811 – (6/14) *1.00
= 0.048
An Illustrative Example

• To illustrate the operation of ID3, consider the learning task represented by the
training examples of below table.
• Here the target attribute PlayTennis, which can have values yes or no for
different days.
• Consider the first step through the algorithm, in which the topmost node of the
decision tree is created.
Training Examples:
Day Outlook Temperature Humidity Wind PlayTennis
D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes
D4 Rain Mild High Weak Yes
D5 Rain Cool Normal Weak Yes
D6 Rain Cool Normal Strong No
D7 Overcast Cool Normal Strong Yes
D8 Sunny Mild High Weak No
D9 Sunny Cool Normal Weak Yes
D10 Rain Mild Normal Weak Yes
D11 Sunny Mild Normal Strong Yes
D12 Overcast Mild High Strong Yes
D13 Overcast Hot Normal Weak Yes
D14 Rain Mild High Strong No
ID3 determines the information gain for each candidate attribute (i.e., Outlook,
Temperature, Humidity, and Wind), then selects the one with highest information
gain
The information gain values for all four attributes are

• Gain(S, Outlook) = 0.246

• Gain(S, Humidity) = 0.151

• Gain(S, Wind) = 0.048

• Gain(S, Temperature) = 0.029

• According to the information gain measure, the Outlook attribute provides the
best prediction of the target attribute, PlayTennis, over the training examples.
Therefore, Outlook is selected as the decision attribute for the root node, and
branches are created below the root for each of its possible values i.e., Sunny,
Overcast, and Rain.

Unit 3
No ratings yet
Unit 3
46 pages
Unit-3 MLT
No ratings yet
Unit-3 MLT
74 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
Screenshot 2024-02-06 at 1.43.15 PM
No ratings yet
Screenshot 2024-02-06 at 1.43.15 PM
66 pages
Ai Mod3@Azdocuments - in
No ratings yet
Ai Mod3@Azdocuments - in
42 pages
ML - Unit 2 - Part I
No ratings yet
ML - Unit 2 - Part I
15 pages
Unit 2
100% (1)
Unit 2
42 pages
Module 3-1 PDF
No ratings yet
Module 3-1 PDF
43 pages
03 02 Decision Trees
No ratings yet
03 02 Decision Trees
61 pages
Video Tutorial: Decision Tree Learning
No ratings yet
Video Tutorial: Decision Tree Learning
21 pages
Module 3
No ratings yet
Module 3
103 pages
Decision Trees
No ratings yet
Decision Trees
7 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
4th Sem MA Module 3 Notes
No ratings yet
4th Sem MA Module 3 Notes
27 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
42 pages
AIML - Module 3 - Updated
No ratings yet
AIML - Module 3 - Updated
42 pages
Decision Tree 2
No ratings yet
Decision Tree 2
20 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
3 Decision Tree Learning
No ratings yet
3 Decision Tree Learning
38 pages
Module 2
No ratings yet
Module 2
42 pages
Module 2 Notes v1 PDF
No ratings yet
Module 2 Notes v1 PDF
20 pages
Module - 3 - DTL & Ann
No ratings yet
Module - 3 - DTL & Ann
10 pages
Mod 3 AIML QB With Answers
No ratings yet
Mod 3 AIML QB With Answers
26 pages
Module 2 Notes
No ratings yet
Module 2 Notes
20 pages
Lec-3-Decision Trees
No ratings yet
Lec-3-Decision Trees
47 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
Data Mining Practical 8
No ratings yet
Data Mining Practical 8
7 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
21 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
No ratings yet
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
19 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
Machine Learning: MVJ21CS62
No ratings yet
Machine Learning: MVJ21CS62
12 pages
L8-1-decisiontrees--random-forest (1)
No ratings yet
L8-1-decisiontrees--random-forest (1)
118 pages
NOTES Module 3 - Chapter 6 - Decision Tree Learning
No ratings yet
NOTES Module 3 - Chapter 6 - Decision Tree Learning
20 pages
Unit2 ML
No ratings yet
Unit2 ML
19 pages
ML UNIT 2 Decision Tree
No ratings yet
ML UNIT 2 Decision Tree
109 pages
Decision Tree Using ID3 Algorithm
No ratings yet
Decision Tree Using ID3 Algorithm
40 pages
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
48 pages
2025 Lecture07 P1 ID3
No ratings yet
2025 Lecture07 P1 ID3
41 pages
Ai 01 Id3
No ratings yet
Ai 01 Id3
7 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Data Structures: Notes For Lecture 13 Techniques of Data Mining by Samaher Hussein Ali
No ratings yet
Data Structures: Notes For Lecture 13 Techniques of Data Mining by Samaher Hussein Ali
8 pages
ID3
No ratings yet
ID3
7 pages
Unit 3
No ratings yet
Unit 3
81 pages
Decision Tree Learning Lecture
No ratings yet
Decision Tree Learning Lecture
13 pages
Decision Tree Classifier-Introduction, ID3
No ratings yet
Decision Tree Classifier-Introduction, ID3
34 pages
ML Unit 2-2-40
No ratings yet
ML Unit 2-2-40
39 pages
Module 3
No ratings yet
Module 3
101 pages
ML Lec5
No ratings yet
ML Lec5
7 pages
Module 3
No ratings yet
Module 3
102 pages
ML Lecture 13-14
No ratings yet
ML Lecture 13-14
33 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
2024 Lecture11 MLAlgorithms
No ratings yet
2024 Lecture11 MLAlgorithms
84 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
Unit-2 Notes
No ratings yet
Unit-2 Notes
20 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)
ML Model Question Bank-1
No ratings yet
ML Model Question Bank-1
4 pages
Registration Form
No ratings yet
Registration Form
1 page
Hamming Code
No ratings yet
Hamming Code
3 pages
Character Stuffing
No ratings yet
Character Stuffing
4 pages
Crc Program
No ratings yet
Crc Program
2 pages
GNR-18_CN_UNIT-1
No ratings yet
GNR-18_CN_UNIT-1
45 pages
Bit Stuffing
No ratings yet
Bit Stuffing
2 pages
Roles and Responsibilities Handbook
No ratings yet
Roles and Responsibilities Handbook
15 pages
GNR-18_CN_UNIT-5
No ratings yet
GNR-18_CN_UNIT-5
21 pages
Matco 9229
No ratings yet
Matco 9229
4 pages
ADS-Unit-2
No ratings yet
ADS-Unit-2
198 pages
3rd Sem CSIT-1 Data Structure Using C (MCQ)
No ratings yet
3rd Sem CSIT-1 Data Structure Using C (MCQ)
3 pages
Sorting: Data and File Structures Laboratory
No ratings yet
Sorting: Data and File Structures Laboratory
8 pages
Dsa Assignment: Name - Sanchit Namdeo Roll No. - 211020443 Branch - DSAI
No ratings yet
Dsa Assignment: Name - Sanchit Namdeo Roll No. - 211020443 Branch - DSAI
23 pages
10 BST
No ratings yet
10 BST
3 pages
Assignment 1
No ratings yet
Assignment 1
14 pages
Data Structures & Algorithm
No ratings yet
Data Structures & Algorithm
46 pages
BSTQuiz
No ratings yet
BSTQuiz
25 pages
MC4101-ADSA Unit-II 2
No ratings yet
MC4101-ADSA Unit-II 2
59 pages
CSC 411 Assignment
No ratings yet
CSC 411 Assignment
6 pages
Assignment - 2 - Attempt Review-1
No ratings yet
Assignment - 2 - Attempt Review-1
3 pages
06 Hands-On Activity 1
No ratings yet
06 Hands-On Activity 1
4 pages
CH # 6 (Trees)
No ratings yet
CH # 6 (Trees)
31 pages
Data Structures (Trees)
No ratings yet
Data Structures (Trees)
11 pages
Lecture - 7.6 (AVL Trees)
No ratings yet
Lecture - 7.6 (AVL Trees)
66 pages
Binary Search Tree For Product Management
No ratings yet
Binary Search Tree For Product Management
20 pages
Tree Representations
No ratings yet
Tree Representations
7 pages
Lecture07 AVL Tree
No ratings yet
Lecture07 AVL Tree
63 pages
Recommended Practice m2
No ratings yet
Recommended Practice m2
7 pages
5b Tree Indexes
No ratings yet
5b Tree Indexes
41 pages
Dbms Pyq
No ratings yet
Dbms Pyq
6 pages
Foundations of Machine Learning: Module 2: Linear Regression and Decision Tree
100% (2)
Foundations of Machine Learning: Module 2: Linear Regression and Decision Tree
16 pages
Unit 4
No ratings yet
Unit 4
50 pages
Non Linear Data Structure
No ratings yet
Non Linear Data Structure
59 pages
ESO 207A / 211 Data Structures and Algorithms
No ratings yet
ESO 207A / 211 Data Structures and Algorithms
13 pages
Trees
No ratings yet
Trees
8 pages
CS124 Spring 2011
No ratings yet
CS124 Spring 2011
6 pages
Module 3 DAA
No ratings yet
Module 3 DAA
16 pages
Binary Search Trees
No ratings yet
Binary Search Trees
21 pages
7 Trees
No ratings yet
7 Trees
30 pages

Decision Tree Learning Notes On 23rd July

Uploaded by

Decision Tree Learning Notes On 23rd July

Uploaded by

MODULE -3

DECISION TREE LEARNING

discrete-valued target functions, in which the learned

function is represented by a decision tree.

3. Disjunctive descriptions may be required

• ID3 stands for Iterative Dichotomiser 3

• ID3 is a precursor to the C4.5 Algorithm.

• The ID3 algorithm was invented by Ross Quinlan in 1975

• The resulting tree is used to classify future samples.

* The best attribute is the one with highest information gain

• To define information gain, we begin by defining a measure called entropy.

• Suppose S is a of 14 examples of some boolean concept, including 9

• The entropy is 0 if all members of S belong to the same class

• Information gain, is the expected reduction in entropy caused by partitioning the

Let, Values(Wind) = {Weak, Strong}

Information gain of attribute Wind:

Gain(S, Wind) = Entropy(S) − 8/14 Entropy (SWeak) − 6/14 Entropy (SStrong)

• Gain(S, Outlook) = 0.246

• Gain(S, Humidity) = 0.151

• Gain(S, Wind) = 0.048

• Gain(S, Temperature) = 0.029

You might also like