Decision Tree Induction Algorithm

Uploaded by

sravyasri2806

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views6 pages

Decision Tree Induction Algorithm

Uploaded by

sravyasri2806

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Decision tree induction algorithm:

A decision tree is a machine learning algorithm that creates a tree-like model of decisions and their
possible consequences. In classification, a decision tree is used to classify input data into one of
several possible classes. Here are the steps in how a decision tree works in classification in data
mining

A decision tree is a machine learning algorithm that creates a

tree-like model of decisions and their possible consequences. In
classification, a decision tree is used to classify input data into one
of several possible classes. Here are the steps in how a decision
tree works in classification in data mining:
here leaf node is assigned as class label. Here root node uses the
attributes body temperature to separate warm-blooded from cold-
blooded vertebrates.
Starting from root node , we apply the test condition to the record and
follow the

Data Preparation: The first step is to collect and prepare the

data. The data must be cleaned, pre-processed, and formatted in a
way that can be used by the decision tree algorithm.
Tree Construction: The decision tree algorithm starts by
selecting the best feature to split the data. The feature with
the highest information gain or the lowest Gini index is
selected as the root node. The data is then split into subsets based
on the values of this feature.

Recursive Partitioning: The algorithm then recursively repeats

this process for each subset, selecting the best feature to split
the data and creating new nodes for each feature. This
process is repeated until all the data has been classified into a set
of leaf nodes.
Splitting: It is a process of dividing a node into two or more sub-nodes.
Pruning: When we remove sub-nodes of a decision node, this
process is called pruning. The decision tree can be pruned to
prevent overfitting, which is when the model performs well on
the training data but poorly on the testing data.

Prediction: Once the decision tree is constructed, it can be used

to predict the target variable for new data by traversing the tree
from the root to the appropriate leaf node. At each node, the
feature value of the new data is compared to the value of the
node, and the algorithm follows the appropriate branch of the tree.
Evaluation: The final step is to evaluate the performance of the
decision tree on a testing dataset. This step is crucial to ensure
that the model can generalize well to new data and is not
overfitting to the training data.
They are also capable of handling both categorical and continuous
data and can handle missing data. However, decision trees can
overfit the training data, leading to poor performance on new data,
and they may not be suitable for complex data with many
features.
Advantages of decision tree :
 Decision trees are able to generate understandable rules.
 Decision trees perform classification without requiring much
computation.
Entropy : Entropy refers to a common way to measure impurity
in the decision tree. It measures the impurity in data set.

Information gain (gini) : It refers to the decline in entropy after

the dataset is split. It is also called as entropy reduction.
The skeleton decision tree induction algorithm also known as TreeGrowth is shown
in Algorithm 3.1 presents a pseudo code for decision tree induction algorithm. The input to
this algorithm consists of the training records E and the attribute set F. The algorithm works
by recursively selecting the best attribute to split the data (Step 7) and expanding the nodes
of the tree (Steps 11 and 12) until the stopping criterion is met (Step 1).
The details of this algorithm are explained below.
1. The createNode() : This Function extends the decision tree by creating a new node. A
node in the decision tree either has a test condition, denoted as node.test cond, or a class
label, denoted as node.label.

2. The find_best_split() : function determines which attribute should be selected as the test
condition for splitting the training records. The choice of test condition depends on which
impurity measure is used to determine the goodness of a split. The popular measures include
entropy and the Gini index.

3. The Classify() : This Function determines the class label to be assigned to a leaf node. For
each leaf node t, let p(i|t) denote the fraction of training records from class i associated with
the node t. the leaf node is assigned to the class that has the majority number of training
records :
Algorithm 3.1 A skeleton decision tree induction algorithm.

TreeGrowth (E, F) # E= Training records and F= attribute set

1: if stopping cond(E,F) = true then # to stop or terminate the recursive condition if all records have
same class label or same attribute values
2: leaf = createNode(). # To extends the decision tree by creating a new node which is test

condition or a class label

3: leaf. Label = Classify (E). # determines the class label and assigned to a
leaf node 4: return leaf.
5: else
6: root = createNode().
7: root. Test cond = find best split(E, F). # recursively select best attribute to

9: for each v ∈ V do
split data. 8: let V = {v|v is a possible outcome of root.test cond }.

10: Ev = {e | root.test cond(e) = v and e ∈ E}.

11: child = TreeGrowth(Ev, F). # steps 11 and 12 to expand the nodes of tree until step
1 met 12: add child as descendent of root and label the edge (root → child) as v.
13: end for
14: end if
15: return root.

where the argmax operator returns the class i that maximizes p(i|t).

5. The stopping Cond() : Function is used to terminate the tree-growing process by testing
whether all the records have same class label or same attribute values. After building the
decision tree, a tree-pruning step can be performed to reduce the size of the decision tree.

Example for decision tree induction algorithm :

Training set, Test test and Classifier are given below:
CLASSIFIER OR
CLASSIFICATION
MODEL

TFM - Leticia Fernandez
No ratings yet
TFM - Leticia Fernandez
105 pages
Data Mining & Warehousing - S. Prabhu
No ratings yet
Data Mining & Warehousing - S. Prabhu
144 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
17 pages
Decision Tree & Techniques
71% (7)
Decision Tree & Techniques
41 pages
ML Unit 3
No ratings yet
ML Unit 3
15 pages
Lets - Learn - AI - StepUp - Module - NITI AAYOG
No ratings yet
Lets - Learn - AI - StepUp - Module - NITI AAYOG
304 pages
Industrial Applications of Machine Learning Pedro Larrañaga All Chapters Instant Download
100% (7)
Industrial Applications of Machine Learning Pedro Larrañaga All Chapters Instant Download
55 pages
Decision Tree
100% (1)
Decision Tree
57 pages
Introduction To Data Science
75% (4)
Introduction To Data Science
74 pages
Hands On Machine Learning With Scikit Learn and TensorFlow Techniques and Tools to Build Learning Machines 1st Edition by AurÃ©lien GÃ©ron 9352135210 9789352135219 - Read the ebook online or download it for the best experience
100% (8)
Hands On Machine Learning With Scikit Learn and TensorFlow Techniques and Tools to Build Learning Machines 1st Edition by AurÃ©lien GÃ©ron 9352135210 9789352135219 - Read the ebook online or download it for the best experience
85 pages
1-Trends and Developments in Artificial Intelligence
No ratings yet
1-Trends and Developments in Artificial Intelligence
178 pages
Crtreport - Contents 2
No ratings yet
Crtreport - Contents 2
40 pages
MI_Unit 4
No ratings yet
MI_Unit 4
79 pages
Decision Trees Edited
No ratings yet
Decision Trees Edited
56 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
Vehere Packetworker br151
No ratings yet
Vehere Packetworker br151
55 pages
Aiml M4 C1
No ratings yet
Aiml M4 C1
101 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
PythonMalware FirstReview
No ratings yet
PythonMalware FirstReview
25 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
Unit-2 Material (1)
No ratings yet
Unit-2 Material (1)
52 pages
CSL0777 L25
No ratings yet
CSL0777 L25
39 pages
Decision Tree
No ratings yet
Decision Tree
74 pages
AIML report
No ratings yet
AIML report
22 pages
ML Assignment (22BCE8086) 2
No ratings yet
ML Assignment (22BCE8086) 2
19 pages
Machine_Learning_Lecture_08_Decision Tree Learning (1)
No ratings yet
Machine_Learning_Lecture_08_Decision Tree Learning (1)
67 pages
Deps 087669
No ratings yet
Deps 087669
14 pages
Introduction To Decision Tree: Gini Index
No ratings yet
Introduction To Decision Tree: Gini Index
15 pages
Decision Tree
No ratings yet
Decision Tree
68 pages
Classification and Prediction
No ratings yet
Classification and Prediction
81 pages
Sen2020 Chapter InDepthAnalysisOfLungDiseasePr
No ratings yet
Sen2020 Chapter InDepthAnalysisOfLungDiseasePr
11 pages
Generative Artificial Intelligence: A Systematic Review and Applications
No ratings yet
Generative Artificial Intelligence: A Systematic Review and Applications
40 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
30 pages
A House Price Valuation Based On The Random Forest Approach: The Mass Appraisal of Residential Property in South Korea
No ratings yet
A House Price Valuation Based On The Random Forest Approach: The Mass Appraisal of Residential Property in South Korea
13 pages
SOC VS CSIRTS PvIB-SOCCRATES-VisionRoadmap
No ratings yet
SOC VS CSIRTS PvIB-SOCCRATES-VisionRoadmap
9 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Unit 4
No ratings yet
Unit 4
33 pages
Decision tree
No ratings yet
Decision tree
16 pages
JETIR2411279
No ratings yet
JETIR2411279
7 pages
Real-World Image Datasets For Federated Learning
No ratings yet
Real-World Image Datasets For Federated Learning
8 pages
DMI UNIT 4
No ratings yet
DMI UNIT 4
34 pages
ML Unit 3
No ratings yet
ML Unit 3
30 pages
The Impact of AI on Tax Compliance and Reporting
No ratings yet
The Impact of AI on Tax Compliance and Reporting
20 pages
Lecture 7.1 - Decision Tree Classification
No ratings yet
Lecture 7.1 - Decision Tree Classification
15 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
AIML Removed Merged
No ratings yet
AIML Removed Merged
31 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Amaazonmnn
No ratings yet
Amaazonmnn
16 pages
AIML Removed
No ratings yet
AIML Removed
25 pages
Deciosn_tree_(1)
No ratings yet
Deciosn_tree_(1)
5 pages
U4 ML Updated
No ratings yet
U4 ML Updated
32 pages
AI and Robotics Complete practice set final - converted
No ratings yet
AI and Robotics Complete practice set final - converted
12 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
Decision Tree
No ratings yet
Decision Tree
11 pages
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
No ratings yet
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
22 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
10.1 Decision Tree
No ratings yet
10.1 Decision Tree
17 pages
CV Siddhartha Shrestha
No ratings yet
CV Siddhartha Shrestha
5 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
Unit-3 Introduction To Machine Learning Algorithms
No ratings yet
Unit-3 Introduction To Machine Learning Algorithms
18 pages
2024 and 2025 Python (TM)
No ratings yet
2024 and 2025 Python (TM)
9 pages
Tree
No ratings yet
Tree
7 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
11 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Lecture Note #5_PEC-CS701E
No ratings yet
Lecture Note #5_PEC-CS701E
16 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Om's Resume
No ratings yet
Om's Resume
1 page
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
14 pages
Decision Tree Classification Algorithm (2)
No ratings yet
Decision Tree Classification Algorithm (2)
11 pages
Decision Tree in Data Mining
No ratings yet
Decision Tree in Data Mining
1 page
DECSION TREE
No ratings yet
DECSION TREE
6 pages
2 - Decision Tree
No ratings yet
2 - Decision Tree
23 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
5 pages
decision tree
No ratings yet
decision tree
13 pages
SYLLABUS
No ratings yet
SYLLABUS
1 page
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
11 pages
SEJ LeveragingGenerativeAI F
No ratings yet
SEJ LeveragingGenerativeAI F
86 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
Resume Sample
No ratings yet
Resume Sample
1 page
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
38 pages
Generative AI With Large Language Models
100% (2)
Generative AI With Large Language Models
31 pages
Design And Analysis Of Algorithm
From Everand
Design And Analysis Of Algorithm
Bhupendra Mandloi
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet