0% found this document useful (0 votes)

11 views

Understanding of Working of DECISION TREE CART Algorithm

Decision tree

Uploaded by

Vamsi Vamsi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Understanding of Working of DECISION TREE CART Algorithm

Decision tree

Uploaded by

Vamsi Vamsi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

DECISION TREE

Classification And Regression Tree (CART)

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

1 Decision Tree

2 Algorithms involved in Decision Tree

3 CART & Its History

4 Applications
CONTENTS
5 Steps involved in building CART

6 Hyperparameters

7 Advantages & Disadvantages

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

1 Decision Tree

• A decision tree is a non-parametric

supervised learning algorithm. ROOT
NODE
• Which is utilized for both classification and
regression tasks. SUB TREE
SPLITTING DECISION
• It has a hierarchical, tree structure. NODE

• Which consists of a root node, branches,

internal nodes and leaf nodes. LEAF NODE

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

2 Algorithms involved in Decision Tree

• ID3 (Iterative Dichotomiser 3)

• C4.5 (successor of ID3)

• CART (Classification And Regression Tree)

• Chi-square automatic interaction detection (CHAID).

• MARS extends decision trees to handle numerical data better.

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

3 CART & History

CART - CLASSIFICATION AND REGRESSION TREES

CLASSIFICATION : REGRESSION:

INPUT: A KID WITH FRUITS INPUT: A KID WITH FRUITS

OUTPUT: WILL HE EAT OR NOT OUTPUT: HOW MANY FRUITS HE EATS

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

3 CART & History

CART

WHY CART?
Leo Breiman Friedman Olshen Stone

1984

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

3 CART & History

Handles only Handles Numerical and

categorical variables categorical variable

Multiway Splits Binary Splits

ID3, C4.5 CART

Not suitable for Efficiently works
Large Datasets for Large Datasets

Difficulty in handling Efficiently handles

missing data missing data

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

4 Applications

Disease – Yes / No
Medical
Disease - Severity

Loan – Yes / No

Where Banking

Loan – Amount

Customer Churn – Yes / No

Telecom
Pricing Strategies - Tariffs

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

5 Steps involved in building CART

Column 1 Gini impurity High purity

Dataset Column 2 Gini impurity Mid purity Decision

Column 3 Gini impurity Low purity

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

5 Steps involved in building CART

C = number of columns
GINI IMPURITY
Pi = proportion of ith class label

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

6 Hyperparameters

Regressor
Criteria - (“squared_error”, “friedman_mse”, “absolute_error”, “poisson”)
Splitter – (“best”, “random”)
Max_depth – (adjusting depth of tree)
Min_sample_split – (min no.of samples required to split an internal node)
Min_sample_leaf – (min no.of samples required to be at a leaf node)
Max_leaf_nodes – (tree with max_leaf_nodes in best-first fashion )
Max_features – (no.of features to consider for the best split)

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

6 Hyperparameters

Classifier
Criteria - (“gini”, “entropy”, “log_loss”)
Splitter – (“best”, “random”)
Max_depth – (adjusting depth of tree)
Min_sample_split – (min no.of samples required to split an internal node)
Min_sample_leaf – (min no.of samples required to be at a leaf node)
Max_leaf_nodes – (tree with max_leaf_nodes in best-first fashion )
Max_features – (“auto”, “sqrt”, “log2”)

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

7 Advantages and Disadvantages

Advantages
More robust in the presence of outliers

Not affected by monotonic transformations of variables

Automatic handling of the following:

Variable selection
Variable interaction modelling
Local effect modelling
Nonlinear relationship modelling
Missing values

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

7 Advantages and Disadvantages

Disadvantages
High variance and unstable if new data comes.

Doesn’t work well if data has many uncorrelated problem

Calculations involved can also become complex compared to

other algorithms, and it takes a longer time to train the model.

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

THANKYOU

J.S A I VARDHAN | B ATCH – 130 | EMP ID : 4795

Fire Safety Maintenance Report (FSMR) : Sinsp Ethel C Margaha
79% (24)
Fire Safety Maintenance Report (FSMR) : Sinsp Ethel C Margaha
15 pages
Decision Tree & Techniques
71% (7)
Decision Tree & Techniques
41 pages
Classification and Regression Trees (CART) Theory and Applications
No ratings yet
Classification and Regression Trees (CART) Theory and Applications
40 pages
CART - Machine Learning
No ratings yet
CART - Machine Learning
29 pages
Cart
No ratings yet
Cart
19 pages
TEAA_ Tree Ensembles-1
No ratings yet
TEAA_ Tree Ensembles-1
43 pages
Financial Applications of Classification and Regr
No ratings yet
Financial Applications of Classification and Regr
41 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Classification and Prediction
No ratings yet
Classification and Prediction
81 pages
CART
No ratings yet
CART
19 pages
Decision Tree Notes
No ratings yet
Decision Tree Notes
6 pages
Classification Using Decision Trees
No ratings yet
Classification Using Decision Trees
43 pages
PR GTU IMP questions by jay
No ratings yet
PR GTU IMP questions by jay
35 pages
decision tree
No ratings yet
decision tree
13 pages
7 - Classfication - Concept - DecisionTree - Evaluation
No ratings yet
7 - Classfication - Concept - DecisionTree - Evaluation
47 pages
CSE445 NSU Week_4
No ratings yet
CSE445 NSU Week_4
48 pages
Machine_Learning_Lecture_08_Decision Tree Learning (1)
No ratings yet
Machine_Learning_Lecture_08_Decision Tree Learning (1)
67 pages
AST Day 3 Slides
No ratings yet
AST Day 3 Slides
79 pages
Objective Segmentation
No ratings yet
Objective Segmentation
21 pages
ML Unit 3
No ratings yet
ML Unit 3
49 pages
Cartfromatob: James Guszcza, Fcas, Maaa
No ratings yet
Cartfromatob: James Guszcza, Fcas, Maaa
54 pages
Peer Reviewed Scientific Journals
No ratings yet
Peer Reviewed Scientific Journals
9 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
S&ML Unit 6- Q & A
No ratings yet
S&ML Unit 6- Q & A
12 pages
Presented by Elden 18mca514
No ratings yet
Presented by Elden 18mca514
15 pages
Ch13. Decision Tree: KH Wong
No ratings yet
Ch13. Decision Tree: KH Wong
82 pages
Concepts - Decision Trees
No ratings yet
Concepts - Decision Trees
23 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
59 pages
5-Classification (2)
No ratings yet
5-Classification (2)
59 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Decision Tree
No ratings yet
Decision Tree
74 pages
BANA 560 Lecture - 5 - NaiveBayes - Decision - Tree
No ratings yet
BANA 560 Lecture - 5 - NaiveBayes - Decision - Tree
42 pages
Classification - Decision Trees
No ratings yet
Classification - Decision Trees
96 pages
Business Analytics: Foundation: Material Handouts
No ratings yet
Business Analytics: Foundation: Material Handouts
7 pages
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
No ratings yet
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
34 pages
Tree Based Classifiers: Dinesh R
No ratings yet
Tree Based Classifiers: Dinesh R
54 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
Data Analytics & R Programming: Decision Tree Algorithm
No ratings yet
Data Analytics & R Programming: Decision Tree Algorithm
10 pages
Learning Analytics
No ratings yet
Learning Analytics
56 pages
Dadm s16 Cart
No ratings yet
Dadm s16 Cart
18 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
Decision Tree
100% (1)
Decision Tree
57 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
14 pages
Data Science Concepts Lesson04 Decision Tree Concepts
No ratings yet
Data Science Concepts Lesson04 Decision Tree Concepts
22 pages
Classification and Regression Tree Construction
No ratings yet
Classification and Regression Tree Construction
18 pages
Data Warehousing and Data Mining: Classification, Trees
No ratings yet
Data Warehousing and Data Mining: Classification, Trees
26 pages
Module 4 Lecture -2
No ratings yet
Module 4 Lecture -2
65 pages
ML Unit 3
No ratings yet
ML Unit 3
28 pages
EDA Cat2
No ratings yet
EDA Cat2
54 pages
Module 5: Data Mining Algorithms: Classification
No ratings yet
Module 5: Data Mining Algorithms: Classification
34 pages
Classification and Regression Trees
No ratings yet
Classification and Regression Trees
37 pages
2023-24_ML_NOTES_2
No ratings yet
2023-24_ML_NOTES_2
16 pages
Classification and Regression Trees
100% (1)
Classification and Regression Trees
60 pages
Session 9 10 Decision Tree
No ratings yet
Session 9 10 Decision Tree
41 pages
Week 4 - Classification - Decision Tree 1
No ratings yet
Week 4 - Classification - Decision Tree 1
40 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
06 Atkins Chap06
No ratings yet
06 Atkins Chap06
16 pages
Waterman - 1984 - Efficient Sequence Alignment Algorithms
No ratings yet
Waterman - 1984 - Efficient Sequence Alignment Algorithms
5 pages
Psychology Theories of Learning From S.K Mangal
No ratings yet
Psychology Theories of Learning From S.K Mangal
21 pages
Septic Shock
No ratings yet
Septic Shock
23 pages
MOS Ceiling 2.1.
No ratings yet
MOS Ceiling 2.1.
10 pages
Pathways To Resilience CCCP
No ratings yet
Pathways To Resilience CCCP
68 pages
Arc Discharge Laser Ablation Chemical Vapor Deposition
No ratings yet
Arc Discharge Laser Ablation Chemical Vapor Deposition
6 pages
History of Taxes in Guatemala
No ratings yet
History of Taxes in Guatemala
11 pages
34-ST-03-149.pdf Manifold Tech Spec
No ratings yet
34-ST-03-149.pdf Manifold Tech Spec
33 pages
HSE Program Plan Matrix
100% (3)
HSE Program Plan Matrix
2 pages
SIIOC 2024 brochure-FINAL
No ratings yet
SIIOC 2024 brochure-FINAL
2 pages
BS Iso TR 08363-1997 (1999)
No ratings yet
BS Iso TR 08363-1997 (1999)
16 pages
BDRRMC
100% (2)
BDRRMC
3 pages
Gibbs Free Energy
No ratings yet
Gibbs Free Energy
3 pages
Drug Study - OB Ward
No ratings yet
Drug Study - OB Ward
8 pages
BILL NO - 326
No ratings yet
BILL NO - 326
1 page
Berkeley: B Series Centrifugal Pumps
No ratings yet
Berkeley: B Series Centrifugal Pumps
36 pages
Color Monitor: Service Manual
No ratings yet
Color Monitor: Service Manual
34 pages
Thermoelectricity and Advanced Thermoelectric Materials Ranjan Kumar - The complete ebook is available for download with one click
100% (3)
Thermoelectricity and Advanced Thermoelectric Materials Ranjan Kumar - The complete ebook is available for download with one click
59 pages
Fall Protection Plan (Rev 00)
No ratings yet
Fall Protection Plan (Rev 00)
20 pages
Modeling and Process Parameter Optimization of Laser Cutting Based On Artifcial Neural Network and Intelligent Optimization Algorithm
No ratings yet
Modeling and Process Parameter Optimization of Laser Cutting Based On Artifcial Neural Network and Intelligent Optimization Algorithm
12 pages
Datasheet AX1117A (Regulador Router D-Link)
No ratings yet
Datasheet AX1117A (Regulador Router D-Link)
8 pages
Cell Aging & Cell Death
77% (13)
Cell Aging & Cell Death
94 pages
June 2017 (v1) QP
No ratings yet
June 2017 (v1) QP
12 pages
Ejaculation: Jump To Navigation Jump To Search
No ratings yet
Ejaculation: Jump To Navigation Jump To Search
14 pages
4-1 Katz, E (1992) The Big Lie Human Restoration of Nature
No ratings yet
4-1 Katz, E (1992) The Big Lie Human Restoration of Nature
8 pages
Coding
No ratings yet
Coding
3 pages
Researchthe Primary Difference Between The Problem Statements in Qualitative and Quantitative Studies Lie
No ratings yet
Researchthe Primary Difference Between The Problem Statements in Qualitative and Quantitative Studies Lie
3 pages
Final TR Papr Products 4 PUBSY 13.02.2019
No ratings yet
Final TR Papr Products 4 PUBSY 13.02.2019
103 pages