Decision Tree-Using Entropy

Uploaded by

mehtabksidhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views17 pages

Decision Tree-Using Entropy

Uploaded by

mehtabksidhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Decision Tree based Learning

Example

5/13/2022
Decision tree representation (PlayTennis)

Outlook=Sunny, Temp=Hot, Humidity=High, Wind=Strong No

5/13/2022
Decision trees expressivity
 Decision trees represent a disjunction of conjunctions on
constraints on the value of attributes:
(Outlook = Sunny  Humidity = Normal) 
(Outlook = Overcast) 
(Outlook = Rain  Wind = Weak)

5/13/2022
Top-down induction of Decision Trees
 ID3 (Quinlan, 1986) is a basic algorithm used to create DT's
 Given a training set of examples, the algorithms for building DT
performs search in the space of decision trees.
 The construction of the tree is top-down. The algorithm is greedy.
 The fundamental question is “which attribute should be tested next?
Which attribute gives us more information?”
 Select the best attribute
 A descendent node is then created for each possible value of this
attribute and data set is partitioned according to this value.
 The process is repeated for each successor node until all the
examples are classified correctly or there are no attributes left

5/13/2022
Which attribute is the best classifier?

 A statistical property called information gain, measures how

well a given attribute separates the training examples
 Information gain uses the notion of entropy, commonly used in
information theory
 Information gain = expected reduction of entropy

5/13/2022
5/13/2022
Entropy in binary classification
 Entropy measures the impurity of a collection of examples. It
depends from the distribution of the random variable p.
 S is a collection of training examples
 p+ the proportion of positive examples in S
 p– the proportion of negative examples in S
Entropy (S)  – p+ log2 p+ – p–log2 p– [0 log20 = 0]
Entropy ([14+, 0–]) = – 14/14 log2 (14/14) – 0 log2 (0) = 0
Entropy ([9+, 5–]) = – 9/14 log2 (9/14) – 5/14 log2 (5/14) = 0.94
Entropy ([7+, 7– ]) = – 7/14 log2 (7/14) – 7/14 log2 (7/14) =
= 1/2 + 1/2 = 1 [log21/2 = – 1]
Note: the log of a number < 1 is negative, 0  p  1, 0  entropy  1
Entropy in general
 Entropy measures the amount of information in a random
variable
H(X) = – p+ log2 p+ – p– log2 p– X = {+, –}
for binary classification [two-valued random variable]
c c
H(X) = –  pi log2 pi =  pi log2 1/ pi X = {i, …, c}
i=1 i=1
for classification in c classes
Example: rolling a die with 8, equally probable, sides
8
H(X) = –  1/8 log2 1/8 = – log2 1/8 = log2 8 = 3
i=1
Information gain as entropy reduction
 Information gain is the expected reduction in entropy caused by
partitioning the examples on an attribute.
 The higher the information gain the more effective the attribute
in classifying training data.
 Expected reduction in entropy knowing A

Gain(S, A) = Entropy(S) −  |Sv|

Entropy(Sv)
v  Values(A) |S|
Values(A) possible values for A
Sv subset of S for which A has value v
Example: expected information gain
 Let
 Values(Wind) = {Weak, Strong}
 S = [9+, 5−]
 SWeak = [6+, 2−]
 SStrong = [3+, 3−]
 Information gain due to knowing Wind:
Gain(S, Wind) = Entropy(S) − 8/14 Entropy(SWeak) − 6/14 Entropy(SStrong)
= 0.94 − 8/14  0.811 − 6/14  1.00
= 0.048
Which attribute is the best classifier?
First step: which attribute to test at the root?

 Which attribute should be tested at the root?

 Gain(S, Outlook) = 0.246
 Gain(S, Humidity) = 0.151
 Gain(S, Wind) = 0.048
 Gain(S, Temperature) = 0.029
 Outlook provides the best prediction for the target
 Lets grow the tree:
 add to the tree a successor for each possible value of Outlook
 partition the training samples according to the value of Outlook
After first step
Second step
 Working on Outlook=Sunny node:
Gain(SSunny, Humidity) = 0.970  3/5  0.0  2/5  0.0 = 0.970
Gain(SSunny, Wind) = 0.970  2/5  1.0  3/5  0.918 = 0 .019
Gain(SSunny, Temp.) = 0.970  2/5  0.0  2/5  1.0  1/5  0.0 = 0.570
 Humidity provides the best prediction for the target
 Lets grow the tree:
 add to the tree a successor for each possible value of Humidity
 partition the training samples according to the value of Humidity
Second and third steps

{D1, D2, D8} {D9, D11} {D4, D5, D10} {D6, D14}
No Yes Yes No
Thanks

Materials Management With Sap Erp Functionality and Technical Configuration Sap MM 4th Edition Sap Pre Abv0xou
33% (3)
Materials Management With Sap Erp Functionality and Technical Configuration Sap MM 4th Edition Sap Pre Abv0xou
2 pages
CT-1010-5Ghz New 10 Bands 10W Handheld 4G Mobile Phone WiF GPS Lojack Jammer
No ratings yet
CT-1010-5Ghz New 10 Bands 10W Handheld 4G Mobile Phone WiF GPS Lojack Jammer
2 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
41 pages
Class 16 Decision Tree
No ratings yet
Class 16 Decision Tree
45 pages
Decision Tree
No ratings yet
Decision Tree
42 pages
Module 3 Chap 3 Decision Tree Learning
No ratings yet
Module 3 Chap 3 Decision Tree Learning
79 pages
7-Decision Trees Learning
No ratings yet
7-Decision Trees Learning
51 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
2c Decision Tree Algorithm
No ratings yet
2c Decision Tree Algorithm
21 pages
ML Lec5
No ratings yet
ML Lec5
7 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
6CS4-02 Machine Learning Manish Bhardwaj
No ratings yet
6CS4-02 Machine Learning Manish Bhardwaj
625 pages
SDG Sdgs DF
No ratings yet
SDG Sdgs DF
23 pages
Unit 3
No ratings yet
Unit 3
81 pages
Unit 3
No ratings yet
Unit 3
90 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
2.3 Decision-Tree-Algorithm
No ratings yet
2.3 Decision-Tree-Algorithm
61 pages
DMDW Co3 Session 14
No ratings yet
DMDW Co3 Session 14
55 pages
Lecture2 DT
No ratings yet
Lecture2 DT
75 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
Module 3
No ratings yet
Module 3
101 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
Recitation Decision Trees Adaboost 02-09-2006
No ratings yet
Recitation Decision Trees Adaboost 02-09-2006
30 pages
CENG313 Introduction To Data Science: Lecture 12: Classification Decision Trees
No ratings yet
CENG313 Introduction To Data Science: Lecture 12: Classification Decision Trees
61 pages
Tasks On Decision Trees
No ratings yet
Tasks On Decision Trees
11 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
21 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Decision Trees Notes
No ratings yet
Decision Trees Notes
11 pages
Decision - Tree
No ratings yet
Decision - Tree
75 pages
Decision Tree Example
No ratings yet
Decision Tree Example
21 pages
Module 2
No ratings yet
Module 2
42 pages
Module 2 Notes v1 PDF
No ratings yet
Module 2 Notes v1 PDF
20 pages
AIML - Module 3 - Updated
No ratings yet
AIML - Module 3 - Updated
42 pages
ID3 Explanation
No ratings yet
ID3 Explanation
23 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
No ratings yet
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
49 pages
Chapter 4
No ratings yet
Chapter 4
103 pages
Machine Learning - Part 1
100% (1)
Machine Learning - Part 1
80 pages
ML Lecture 13-14
No ratings yet
ML Lecture 13-14
33 pages
Module 3
No ratings yet
Module 3
102 pages
Machine Learning 10601 Recitation 8 Oct 21, 2009: Oznur Tastan
No ratings yet
Machine Learning 10601 Recitation 8 Oct 21, 2009: Oznur Tastan
46 pages
DAA Project
No ratings yet
DAA Project
20 pages
7 DecisionTree
No ratings yet
7 DecisionTree
58 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
ML Lecture04x2
No ratings yet
ML Lecture04x2
16 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
Chapter 4
No ratings yet
Chapter 4
103 pages
2.decision Tree
No ratings yet
2.decision Tree
74 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
3 Decision Trees - LMS
No ratings yet
3 Decision Trees - LMS
47 pages
Module - 3 - DTL & Ann
No ratings yet
Module - 3 - DTL & Ann
10 pages
L8-1-decisiontrees--random-forest (1)
No ratings yet
L8-1-decisiontrees--random-forest (1)
118 pages
classification1_046d35e24ecceb5e4206a255f4937440
No ratings yet
classification1_046d35e24ecceb5e4206a255f4937440
87 pages
ML - Module 2
No ratings yet
ML - Module 2
41 pages
Id3algorithm 200307175839
No ratings yet
Id3algorithm 200307175839
22 pages
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
No ratings yet
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
25 pages
L3 - Decision Trees
No ratings yet
L3 - Decision Trees
28 pages
Unit 2
No ratings yet
Unit 2
153 pages
ML Intro
No ratings yet
ML Intro
45 pages
Star Wars Galaxies Manual PDF
No ratings yet
Star Wars Galaxies Manual PDF
55 pages
Computer Virus and Worms
No ratings yet
Computer Virus and Worms
28 pages
4 Pillars of OOPS CONCEPT
No ratings yet
4 Pillars of OOPS CONCEPT
9 pages
Preface: Understanding Minimalism
No ratings yet
Preface: Understanding Minimalism
2 pages
HGST
0% (1)
HGST
28 pages
Online Examcell Automation System
0% (1)
Online Examcell Automation System
69 pages
Notes - Introduction To AI, ML, DS
No ratings yet
Notes - Introduction To AI, ML, DS
61 pages
Furnace Maintenance and Operation Requirements in AMS 2750D
No ratings yet
Furnace Maintenance and Operation Requirements in AMS 2750D
5 pages
Word Shortcut Keys Excel Shortcut Keys
0% (1)
Word Shortcut Keys Excel Shortcut Keys
2 pages
Resume
No ratings yet
Resume
3 pages
Shell Programming
No ratings yet
Shell Programming
16 pages
Replacement Theory: Replacement Policy For Equipment Which Deteriorate Gradually
No ratings yet
Replacement Theory: Replacement Policy For Equipment Which Deteriorate Gradually
7 pages
Course08 - RelEval
No ratings yet
Course08 - RelEval
22 pages
F
No ratings yet
F
99 pages
Office 2013
No ratings yet
Office 2013
2 pages
Go Global Windows Datasheet
No ratings yet
Go Global Windows Datasheet
2 pages
MIS Chapter 5
No ratings yet
MIS Chapter 5
48 pages
L018N
No ratings yet
L018N
399 pages
HC W Driver Install
No ratings yet
HC W Driver Install
3 pages
Responsive Document - CREW: NARA: Regarding Record Management and Cloud Computing: 11/28/2011 Cloud Computing Bulletin-Draft1
No ratings yet
Responsive Document - CREW: NARA: Regarding Record Management and Cloud Computing: 11/28/2011 Cloud Computing Bulletin-Draft1
5 pages
Definition of A Loaded Die
No ratings yet
Definition of A Loaded Die
9 pages
Amadeus Queues Manual - v1
No ratings yet
Amadeus Queues Manual - v1
0 pages
Robot Builders Bonanza
No ratings yet
Robot Builders Bonanza
7 pages
Reenternal Kernels
No ratings yet
Reenternal Kernels
3 pages
SHFTCNT (1, 0, 1
No ratings yet
SHFTCNT (1, 0, 1
4 pages
Tarlac State University: Pre-Registration / Assessment Form
No ratings yet
Tarlac State University: Pre-Registration / Assessment Form
2 pages
ICPC Digital Strategy Guide
No ratings yet
ICPC Digital Strategy Guide
13 pages
COM Interview Questions: Object? Aggregation Is The Reuse Mechanism, in Which The Outer Object Exposes
No ratings yet
COM Interview Questions: Object? Aggregation Is The Reuse Mechanism, in Which The Outer Object Exposes
4 pages

Decision Tree-Using Entropy

Uploaded by

Decision Tree-Using Entropy

Uploaded by

Decision Tree based Learning

Outlook=Sunny, Temp=Hot, Humidity=High, Wind=Strong No

 A statistical property called information gain, measures how

Gain(S, A) = Entropy(S) −  |Sv|

 Which attribute should be tested at the root?

You might also like