Lecture 6 Compress
Lecture 6 Compress
Lecture 6 Compress
Alternative names
◦Knowledge discovery (mining) in databases (KDD),
archeology, knowledge extraction, data/pattern analysis,
data dredging, information harvesting, business intelligence,
Simply put etc.
4
Potential Applications
Other Applications
◦ Text mining (news group, email, documents) and Web mining
◦ Stream data mining
◦ Bioinformatics and bio-data analysis
Ex.: Market Analysis and Management
Where does the data come from?—Credit card transactions, loyalty cards,
discount coupons, customer complaint calls, surveys …
Target marketing
◦ Find clusters of “model” customers who share the same characteristics: interest,
income level, spending habits, etc.,
◦ E.g. Most customers with income level 60k – 80k with food expenses $600 - $800 a month live in that
area
◦ Determine customer purchasing patterns over time
◦ E.g. Customers who are between 20 and 29 years old, with income of 20k – 29k usually buy this type of
CD player
6
Ex.: Market Analysis and Management (2)
Fraud detection
◦ Find outliers of unusual transactions
Financial planning
◦ Summarize and compare the resources and spending
7
KDD Process: Several Key Steps
8
A typical DM System Architecture
Database
Technology Statistics
Machine
Information Learning
Science Data Mining
Visualization Other
Disciplines
• Not all “Data Mining System” performs true data mining
machine learning system, statistical analysis (small amount of data)
Database system (information retrieval, deductive querying…)
12