Cap6 - Data Reduction
Cap6 - Data Reduction
Cap6 - Data Reduction
Data Reduction
1. Overview
2. The Curse of Dimensionality
3. Data Sampling
4. Binning and Reduction of Cardinality
Data Reduction
1. Overview
2. The Curse of Dimensionality
3. Data Sampling
4. Binning and Reduction of Cardinality
Overview
• Data reduction techniques can be applied to
achieve a reduced representation of the data
set.
• The goal is to provide the mining process with
a mechanism to produce the same (or almost
the same) outcome when it is applied over
reduced data instead of the original data.
Overview
• Data Reduction techniques are usually
categorized into three main families: