Decision Tree-Using Entropy

Uploaded by

mehtabksidhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views17 pages

Decision Tree-Using Entropy

Uploaded by

mehtabksidhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Decision Tree based Learning

Example

5/13/2022
Decision tree representation (PlayTennis)

Outlook=Sunny, Temp=Hot, Humidity=High, Wind=Strong No

5/13/2022
Decision trees expressivity
 Decision trees represent a disjunction of conjunctions on
constraints on the value of attributes:
(Outlook = Sunny  Humidity = Normal) 
(Outlook = Overcast) 
(Outlook = Rain  Wind = Weak)

5/13/2022
Top-down induction of Decision Trees
 ID3 (Quinlan, 1986) is a basic algorithm used to create DT's
 Given a training set of examples, the algorithms for building DT
performs search in the space of decision trees.
 The construction of the tree is top-down. The algorithm is greedy.
 The fundamental question is “which attribute should be tested next?
Which attribute gives us more information?”
 Select the best attribute
 A descendent node is then created for each possible value of this
attribute and data set is partitioned according to this value.
 The process is repeated for each successor node until all the
examples are classified correctly or there are no attributes left

5/13/2022
Which attribute is the best classifier?

 A statistical property called information gain, measures how

well a given attribute separates the training examples
 Information gain uses the notion of entropy, commonly used in
information theory
 Information gain = expected reduction of entropy

5/13/2022
5/13/2022
Entropy in binary classification
 Entropy measures the impurity of a collection of examples. It
depends from the distribution of the random variable p.
 S is a collection of training examples
 p+ the proportion of positive examples in S
 p– the proportion of negative examples in S
Entropy (S)  – p+ log2 p+ – p–log2 p– [0 log20 = 0]
Entropy ([14+, 0–]) = – 14/14 log2 (14/14) – 0 log2 (0) = 0
Entropy ([9+, 5–]) = – 9/14 log2 (9/14) – 5/14 log2 (5/14) = 0.94
Entropy ([7+, 7– ]) = – 7/14 log2 (7/14) – 7/14 log2 (7/14) =
= 1/2 + 1/2 = 1 [log21/2 = – 1]
Note: the log of a number < 1 is negative, 0  p  1, 0  entropy  1
Entropy in general
 Entropy measures the amount of information in a random
variable
H(X) = – p+ log2 p+ – p– log2 p– X = {+, –}
for binary classification [two-valued random variable]
c c
H(X) = –  pi log2 pi =  pi log2 1/ pi X = {i, …, c}
i=1 i=1
for classification in c classes
Example: rolling a die with 8, equally probable, sides
8
H(X) = –  1/8 log2 1/8 = – log2 1/8 = log2 8 = 3
i=1
Information gain as entropy reduction
 Information gain is the expected reduction in entropy caused by
partitioning the examples on an attribute.
 The higher the information gain the more effective the attribute
in classifying training data.
 Expected reduction in entropy knowing A

Gain(S, A) = Entropy(S) −  |Sv|

Entropy(Sv)
v  Values(A) |S|
Values(A) possible values for A
Sv subset of S for which A has value v
Example: expected information gain
 Let
 Values(Wind) = {Weak, Strong}
 S = [9+, 5−]
 SWeak = [6+, 2−]
 SStrong = [3+, 3−]
 Information gain due to knowing Wind:
Gain(S, Wind) = Entropy(S) − 8/14 Entropy(SWeak) − 6/14 Entropy(SStrong)
= 0.94 − 8/14  0.811 − 6/14  1.00
= 0.048
Which attribute is the best classifier?
First step: which attribute to test at the root?

 Which attribute should be tested at the root?

 Gain(S, Outlook) = 0.246
 Gain(S, Humidity) = 0.151
 Gain(S, Wind) = 0.048
 Gain(S, Temperature) = 0.029
 Outlook provides the best prediction for the target
 Lets grow the tree:
 add to the tree a successor for each possible value of Outlook
 partition the training samples according to the value of Outlook
After first step
Second step
 Working on Outlook=Sunny node:
Gain(SSunny, Humidity) = 0.970  3/5  0.0  2/5  0.0 = 0.970
Gain(SSunny, Wind) = 0.970  2/5  1.0  3/5  0.918 = 0 .019
Gain(SSunny, Temp.) = 0.970  2/5  0.0  2/5  1.0  1/5  0.0 = 0.570
 Humidity provides the best prediction for the target
 Lets grow the tree:
 add to the tree a successor for each possible value of Humidity
 partition the training samples according to the value of Humidity
Second and third steps

{D1, D2, D8} {D9, D11} {D4, D5, D10} {D6, D14}
No Yes Yes No
Thanks

Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
41 pages
Class 16 Decision Tree
No ratings yet
Class 16 Decision Tree
45 pages
Decision Tree
No ratings yet
Decision Tree
42 pages
Module 3 Chap 3 Decision Tree Learning
No ratings yet
Module 3 Chap 3 Decision Tree Learning
79 pages
7-Decision Trees Learning
No ratings yet
7-Decision Trees Learning
51 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
2c Decision Tree Algorithm
No ratings yet
2c Decision Tree Algorithm
21 pages
ML Lec5
No ratings yet
ML Lec5
7 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
6CS4-02 Machine Learning Manish Bhardwaj
No ratings yet
6CS4-02 Machine Learning Manish Bhardwaj
625 pages
SDG Sdgs DF
No ratings yet
SDG Sdgs DF
23 pages
Unit 3
No ratings yet
Unit 3
81 pages
Unit 3
No ratings yet
Unit 3
90 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
2.3 Decision-Tree-Algorithm
No ratings yet
2.3 Decision-Tree-Algorithm
61 pages
DMDW Co3 Session 14
No ratings yet
DMDW Co3 Session 14
55 pages
Lecture2 DT
No ratings yet
Lecture2 DT
75 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
Module 3
No ratings yet
Module 3
101 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
Recitation Decision Trees Adaboost 02-09-2006
No ratings yet
Recitation Decision Trees Adaboost 02-09-2006
30 pages
CENG313 Introduction To Data Science: Lecture 12: Classification Decision Trees
No ratings yet
CENG313 Introduction To Data Science: Lecture 12: Classification Decision Trees
61 pages
Tasks On Decision Trees
No ratings yet
Tasks On Decision Trees
11 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
21 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Decision Trees Notes
No ratings yet
Decision Trees Notes
11 pages
Decision - Tree
No ratings yet
Decision - Tree
75 pages
Decision Tree Example
No ratings yet
Decision Tree Example
21 pages
Module 2
No ratings yet
Module 2
42 pages
Module 2 Notes v1 PDF
No ratings yet
Module 2 Notes v1 PDF
20 pages
AIML - Module 3 - Updated
No ratings yet
AIML - Module 3 - Updated
42 pages
ID3 Explanation
No ratings yet
ID3 Explanation
23 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
No ratings yet
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
49 pages
Chapter 4
No ratings yet
Chapter 4
103 pages
Machine Learning - Part 1
100% (1)
Machine Learning - Part 1
80 pages
ML Lecture 13-14
No ratings yet
ML Lecture 13-14
33 pages
Module 3
No ratings yet
Module 3
102 pages
Machine Learning 10601 Recitation 8 Oct 21, 2009: Oznur Tastan
No ratings yet
Machine Learning 10601 Recitation 8 Oct 21, 2009: Oznur Tastan
46 pages
DAA Project
No ratings yet
DAA Project
20 pages
7 DecisionTree
No ratings yet
7 DecisionTree
58 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
ML Lecture04x2
No ratings yet
ML Lecture04x2
16 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
Chapter 4
No ratings yet
Chapter 4
103 pages
2.decision Tree
No ratings yet
2.decision Tree
74 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
3 Decision Trees - LMS
No ratings yet
3 Decision Trees - LMS
47 pages
Module - 3 - DTL & Ann
No ratings yet
Module - 3 - DTL & Ann
10 pages
L8-1-decisiontrees--random-forest (1)
No ratings yet
L8-1-decisiontrees--random-forest (1)
118 pages
classification1_046d35e24ecceb5e4206a255f4937440
No ratings yet
classification1_046d35e24ecceb5e4206a255f4937440
87 pages
ML - Module 2
No ratings yet
ML - Module 2
41 pages
Id3algorithm 200307175839
No ratings yet
Id3algorithm 200307175839
22 pages
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
No ratings yet
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
25 pages
L3 - Decision Trees
No ratings yet
L3 - Decision Trees
28 pages
Unit 2
No ratings yet
Unit 2
153 pages
ML Intro
No ratings yet
ML Intro
45 pages
BAYES Theorem
From Everand
BAYES Theorem
Jeffery Short
2/5 (5)
Functions and Probability for Sixth Graders
From Everand
Functions and Probability for Sixth Graders
Home School Brew
No ratings yet
Sample Anthropology Optional PAPER I Value Addition Notes
No ratings yet
Sample Anthropology Optional PAPER I Value Addition Notes
9 pages
LESSON-1
No ratings yet
LESSON-1
50 pages
Class 10 So. Sc. Summer Vacation Home Work 2024-25-1
No ratings yet
Class 10 So. Sc. Summer Vacation Home Work 2024-25-1
4 pages
Peer Tutoring and Partner Learning
No ratings yet
Peer Tutoring and Partner Learning
15 pages
Tyre Presentation PDF
No ratings yet
Tyre Presentation PDF
29 pages
Schneider GV2ME02 Datasheet
No ratings yet
Schneider GV2ME02 Datasheet
8 pages
Supply Chain Simulation
100% (1)
Supply Chain Simulation
8 pages
Does The Bushing of Variator Need Grease - DR - Pulley
No ratings yet
Does The Bushing of Variator Need Grease - DR - Pulley
3 pages
JFUE-Wahid Alfarizki Combustion
No ratings yet
JFUE-Wahid Alfarizki Combustion
15 pages
Arild 2017
No ratings yet
Arild 2017
16 pages
End Anchorage Options CarboDur - Nikos
No ratings yet
End Anchorage Options CarboDur - Nikos
16 pages
Pe LP 4
No ratings yet
Pe LP 4
2 pages
Mounting Techniques
No ratings yet
Mounting Techniques
23 pages
DX Diag
No ratings yet
DX Diag
37 pages
AI&ML Slow Learner Test2
No ratings yet
AI&ML Slow Learner Test2
9 pages
Introductions and Conclusions: 1 Introduction Contents
100% (1)
Introductions and Conclusions: 1 Introduction Contents
3 pages
stericlinProduktkatalog20192020EN 2
No ratings yet
stericlinProduktkatalog20192020EN 2
2 pages
Modeling The Potential Distribution of The Threatened Grey Necked Picathartes Picathartes Oreas Across Its Entire Range
No ratings yet
Modeling The Potential Distribution of The Threatened Grey Necked Picathartes Picathartes Oreas Across Its Entire Range
9 pages
Mc4101 Ads Notes Advance Data Structure Nodes
0% (1)
Mc4101 Ads Notes Advance Data Structure Nodes
144 pages
Iso 9001 Workbook and Training Package
No ratings yet
Iso 9001 Workbook and Training Package
3 pages
Lecture 2 Per Unit System Representation
No ratings yet
Lecture 2 Per Unit System Representation
8 pages
Group Assignment Literasi Dalam Bahasa Inggris
No ratings yet
Group Assignment Literasi Dalam Bahasa Inggris
5 pages
Pawling - Entrance Mats
No ratings yet
Pawling - Entrance Mats
12 pages
CD Player & FM Tuner PDF
No ratings yet
CD Player & FM Tuner PDF
8 pages
Fun Interesting Games For Kids To Play & Learn
No ratings yet
Fun Interesting Games For Kids To Play & Learn
10 pages
Ahjo Dep FM 001 - MDDR - 270320
No ratings yet
Ahjo Dep FM 001 - MDDR - 270320
2 pages
Cosina CSM: Click Here To Go To Main Camera Manual Site
No ratings yet
Cosina CSM: Click Here To Go To Main Camera Manual Site
8 pages
03 Rcaplan - Capa Plan 01
No ratings yet
03 Rcaplan - Capa Plan 01
2 pages
Linking Knowledge Application
No ratings yet
Linking Knowledge Application
27 pages
Sir Taha Popatia Lectures
No ratings yet
Sir Taha Popatia Lectures
14 pages

Decision Tree-Using Entropy

Uploaded by

Decision Tree-Using Entropy

Uploaded by

Decision Tree based Learning

Outlook=Sunny, Temp=Hot, Humidity=High, Wind=Strong No

 A statistical property called information gain, measures how

Gain(S, A) = Entropy(S) −  |Sv|

 Which attribute should be tested at the root?

You might also like