0% found this document useful (0 votes)

10 views35 pages

Lecture 1 - Introduction To Machine Learning

The document outlines the fundamentals of machine learning, including definitions, motivations, and types of learning such as supervised, unsupervised, and reinforcement learning. It emphasizes the importance of data in training models and the role of machine learning in various applications like spam detection and credit card fraud detection. Additionally, it discusses the commercial and scientific motivations driving the growth of machine learning technologies.

Uploaded by

mdimranulhaque.hstu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views35 pages

Lecture 1 - Introduction To Machine Learning

Uploaded by

mdimranulhaque.hstu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Outline

• Definitions
• Motivation
• Examples
• Learning Problem
• Components of Learning
• Types of Learning
• Puzzle

3
A Few Quotes
“A breakthrough in machine learning would be worth ten Microsofts”
– (Bill Gates, Chairman, Microsoft)

“Machine learning is the next Internet”

– (Tony Tether, Director, DARPA)

“Machine learning is the hot new thing”

– (John Hennessy, President, Stanford)

“Web rankings today are mostly a matter of machine learning”

– (Prabhakar Raghavan, Dir. Research, Yahoo)

“Machine learning is going to result in a real revolution”

– (Greg Papadopoulos, CTO, Sun)

“Machine learning is today’s discontinuity”

– (Jerry Yang, CEO, Yahoo)
Definitions Motivation Examples Learning Problem Components of Learning Types of Learning Puzzle 4
Machine Learning Popularity
Google Trends: Machine Learning

5
So What Is Machine Learning (Informally)?

• Automating automation
• Getting computers to program themselves
• Writing software is the bottleneck
• Let the data do the work instead!

Traditional Programming Machine Learning

6
What Is Machine Learning (Formally)?
"A computer program is said to learn from experience E with respect to some class of tasks T and
performance measure P, if its performance at tasks in T, as measured by P, improves with experience E."
--Tom M. Mitchell

"Machine learning is the training of a model from data that generalizes a decision against a performance
measure."
--Jason Brownlee

"A branch of artificial intelligence in which a computer generates rules underlying or based on raw data that
has been fed into it."
--Dictionary.com

"Machine learning is a scientific discipline that is concerned with the design and development of algorithms
that allow computers to evolve behaviors based on empirical data, such as from sensor data or databases."
--Wikipedia
7
Why “Learn”?

• Machine learning is programming computers to optimize a

performance criterion using example data or past experience.
• There is no need to “learn” to calculate payroll
• Learning is used when:
– Human expertise does not exist (navigating on Mars),
– Humans are unable to explain their expertise (speech recognition)
– Solution changes in time (routing on a computer network)

8
What We Talk About When We Talk
About“Learning”
• Learning general models from a data of particular examples

• Data is cheap and abundant (data warehouses, data marts); knowledge is

expensive and scarce.

• Build a model that is a good and useful approximation to the data.

9
ML at a Glance
• Machine Learning
– Study of algorithms that
– improve their performance
– at some task
– with experience
• Optimize a performance criterion using example data or past experience.
• Role of Statistics: Inference from a sample
• Role of Computer science: Efficient algorithms to
– Solve the optimization problem
– Representing and evaluating the model for inference

10
Commercial Motivation for Machine Learning
• Lots of data is being collected
and warehoused
– Web data, e-commerce
– purchases at department/
grocery stores
– Bank/Credit Card
transactions

• Computers have become cheaper and more powerful

• Competitive Pressure is Strong
– Provide better, customized services for an edge (e.g. in Customer Relationship
Management)
Definitions Motivation Examples Learning Problem Components of Learning Types of Learning Puzzle 11
Scientific Motivation for Machine Learning
• Data collected and stored at
enormous speeds (GB/hour)
– remote sensors on a satellite
– telescopes scanning the skies
– microarrays generating gene
expression data
– scientific simulations
generating terabytes of data
• Traditional techniques infeasible for raw data
• Machine Learning may help scientists
– in classifying and segmenting data
– in Hypothesis Formation 12
Practical Machine Learning Examples

Spam Detection Credit Card Fraud Detection Speech Recognition

Face Detection Product Recommendation Sentiment Analysis

Definitions Motivation Examples Learning Problem Components of Learning Types of Learning Puzzle 13
Example: Predicting how a viewer will rate a movie

• 10% improvement = 1 million dollar prize

• The essence of machine learning
– A pattern exists
– We cannot pin it down mathematically
– We have data on it

Definitions Motivation Examples Learning Problem Components of Learning Types of Learning Puzzle 14
Movie rating – a solution

http://work.caltech.edu/slides/slides01.pdf
15
The Learning Approach

http://work.caltech.edu/slides/slides01.pdf
16
Components of Learning
http://work.caltech.edu/slides/slides01.pdf

Definitions Motivation Examples Learning Problem Components of Learning Types of Learning Puzzle 17
Components of Learning
Formalization

http://work.caltech.edu/slides/slides01.pdf
18
http://work.caltech.edu/slides/slides01.pdf
19
Solution Components

http://work.caltech.edu/slides/slides01.pdf
20
A simple hypothesis set

http://work.caltech.edu/slides/slides01.pdf
21
Types of Learning
• Supervised (inductive) learning
– Training data includes desired outputs
• Unsupervised learning
– Training data does not include desired outputs
• Reinforcement learning
– Rewards from sequence of actions

Definitions Motivation Examples Learning Problem Components of Learning Types of Learning Puzzle 22
Supervised & Unsupervised Learning
Supervised Learning Unsupervised Learning
• Labeled dataset • Unlabeled dataset
• Establish relationship between input and • Decipher structure of the data
output • Output attributes are not defined
• Generate output for new data points • Clustering: Kmeans, DBScan, Hierarchical
• Reliable models but expensive and limited algorithms, Self Organizing Maps, etc
• Classification: Associative classifiers, • Associations: Apriori, FP-Growth, …
Decision Trees, Instance Learning,
Bayesian Learning, Kernel machines,
Neural Networks, Genetic Algorithms, etc
• Regression: Linear Regression, …
Reinforcement Learning
• Maximizing the rewards from the results
• Also called credit assessment learning
• Additional decision about rewards
• Explore the tradeoff between exploring and
exploiting the data
23
Supervised Learning: Classification
• a way to identify a grouping technique for a given dataset
• depending on a value of the target or output attribute, the entire dataset can be qualified to belong to a class
• this technique helps in identifying the data behavior patterns
Determine good or bad customers?
Total Money Spent

Total Items Purchased

All the customers who spend more than 800 dollars in a single
purchase are categorized as good customers. 24
Classification: Definition
• Given a collection of records (training set )
– Each record contains a set of attributes, one of the attributes is the class.
• Find a model for class attribute as a function of the values of
other attributes.
• Goal: previously unseen records should be assigned a class
as accurately as possible.
– A test set is used to determine the accuracy of the model.
– Usually, the given data set is divided into training and test sets
– with training set used to build the model and test set used to validate it.

25
Classification Example

Tid Refund Marital Taxable Refund Marital Taxable

Status Income Cheat Status Income Cheat

1 Yes Single 125K No No Single 75K ?

2 No Married 100K No Yes Married 50K ?
3 No Single 70K No No Married 150K ?
4 Yes Married 120K No Yes Divorced 90K ?
5 No Divorced 95K Yes No Single 40K ?
6 No Married 60K No
10
No Married 80K ? Test
7 Yes Divorced 220K No Set
8 No Single 85K Yes
9 No Married 75K No Learn
Training
10 No Single 90K Yes Model
10

Set Classifier
Supervised: Regression
• Predict a value of a given continuous valued variable based on the values of other
variables, assuming a linear or nonlinear model of dependency.
• Greatly studied in statistics, neural network fields.
• Examples:
– Predicting sales amounts of new product based on advetising expenditure.
– Predicting wind velocities as a function of temperature, humidity, air pressure,
etc.
– Time series prediction of stock market indices.

27
Unsupervised: Clustering

• Given a set of data points, each having a set of attributes,

and a similarity measure among them, find clusters such
that
– Data points in one cluster are more similar to one another.
– Data points in separate clusters are less similar to one another.
• Similarity Measures:
– Euclidean Distance if attributes are continuous.
– Other Problem-specific Measures.

28
Illustrating Clustering
Euclidean Distance Based Clustering in 3-D space.

Intracluster distances Intercluster distances

are minimized are maximized
Clustering: Application

• Document Clustering:
– Goal: To find groups of documents that are similar to each other
based on the important terms appearing in them.
– Approach: To identify frequently occurring terms in each
document. Form a similarity measure based on the frequencies of
different terms. Use it to cluster.
– Gain: Information Retrieval can utilize the clusters to relate a
new document or search term to clustered documents.
Example: Google Scholar

30
Unsupervised Learning: Association Rule Discovery:
Application

• Supermarket shelf management.

– Goal: To identify items that are bought together by sufficiently
many customers.
– Approach: Process the point-of-sale data collected with barcode
scanners to find dependencies among items.
– A classic rule --
• If a customer buys diaper and milk, then he is very likely to buy beer.
• So, don’t be surprised if you find six-packs stacked next to diapers!

31
Reinforcement Learning

• Topics:
– Policies: what actions should an agent take in a particular situation
– Utility estimation: how good is a state (used by policy)
• No supervised output but delayed reward
• Credit assignment problem (what was responsible for the outcome)
• Applications:
– Game playing
– Robot in a maze
– Multiple agents, partial observability, ...

32
A Learning Puzzle

Definitions Motivation Examples Learning Problem Components of Learning Types of Learning Puzzle 33
Takeaway

https://learning.acm.org/webinar_pdfs/PedroDomingos_FTFML_WebinarSlides.pdf
34
Most of the knowledge in the world in the future is going
to be extracted by machines and will reside in machines.
Yann LeCun, Director of AI Research, Facebook

35
The Five Tribes of Machine Learning

Pedro Domingos, University of Washington

https://learning.acm.org/webinar_pdfs/PedroDomingos_FTFML_WebinarSlides.pdf
36
Thank You.
Next Lecture - Exploratory Data Analysis

Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
Resume For Accenture
No ratings yet
Resume For Accenture
1 page
Machine Learnning
No ratings yet
Machine Learnning
17 pages
Unit1 2
No ratings yet
Unit1 2
101 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
Ch3-Machine Learning
No ratings yet
Ch3-Machine Learning
124 pages
Machine Learning KTU Module 1
No ratings yet
Machine Learning KTU Module 1
77 pages
ML Introduction
No ratings yet
ML Introduction
54 pages
ML 01
No ratings yet
ML 01
44 pages
DS - NLP
No ratings yet
DS - NLP
39 pages
Machine Learning
No ratings yet
Machine Learning
26 pages
Lecture 1.2 Introduction To Machine Learning
No ratings yet
Lecture 1.2 Introduction To Machine Learning
31 pages
1 Leaning Introduction
No ratings yet
1 Leaning Introduction
29 pages
Chapter 1 Introduction To Machine Learning
No ratings yet
Chapter 1 Introduction To Machine Learning
29 pages
Training Report On Machine Learning
No ratings yet
Training Report On Machine Learning
27 pages
Unit - 2 Machine Learning
No ratings yet
Unit - 2 Machine Learning
45 pages
Machine Learning - UNIT I
No ratings yet
Machine Learning - UNIT I
70 pages
UNIT I-Machine Learning
No ratings yet
UNIT I-Machine Learning
68 pages
Unit 1
No ratings yet
Unit 1
92 pages
Unit - 5.1 - Introduction To Machine Learning
No ratings yet
Unit - 5.1 - Introduction To Machine Learning
38 pages
UNIT I-Part 1
No ratings yet
UNIT I-Part 1
52 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
46 pages
Introduction To ML
100% (1)
Introduction To ML
39 pages
Ai Chapter 5
No ratings yet
Ai Chapter 5
45 pages
Topic 1
No ratings yet
Topic 1
39 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
Lecture 1
No ratings yet
Lecture 1
30 pages
5th Sem Report
No ratings yet
5th Sem Report
29 pages
I2ml3e Chap1
No ratings yet
I2ml3e Chap1
20 pages
ML Chap1
No ratings yet
ML Chap1
26 pages
Lect1 Introduction
No ratings yet
Lect1 Introduction
38 pages
ML Notes
No ratings yet
ML Notes
18 pages
Chapter 7 - Artificial Intelligence Application
No ratings yet
Chapter 7 - Artificial Intelligence Application
29 pages
Ch7 Introduction To Machine Learning
No ratings yet
Ch7 Introduction To Machine Learning
29 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
20 pages
AI Chapter 3 Part 1
No ratings yet
AI Chapter 3 Part 1
33 pages
ML Introduction-06!08!21
No ratings yet
ML Introduction-06!08!21
25 pages
Module 1
No ratings yet
Module 1
175 pages
Unit 3
No ratings yet
Unit 3
62 pages
Machine Learning
No ratings yet
Machine Learning
46 pages
13,14 Lecture
No ratings yet
13,14 Lecture
41 pages
Unit Iii
No ratings yet
Unit Iii
39 pages
Intro To Machine Learning
No ratings yet
Intro To Machine Learning
25 pages
Learning Paradigms
No ratings yet
Learning Paradigms
41 pages
Artificial Intelligence: Chapter 5 - Machine Learning
No ratings yet
Artificial Intelligence: Chapter 5 - Machine Learning
30 pages
Introducation To Machine and Learning Deternunistic Models
No ratings yet
Introducation To Machine and Learning Deternunistic Models
24 pages
Machine-Learning NOTE2025 2
No ratings yet
Machine-Learning NOTE2025 2
331 pages
Unit 01
No ratings yet
Unit 01
32 pages
Learning and Planning
No ratings yet
Learning and Planning
107 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
49 pages
01 LecIntro
No ratings yet
01 LecIntro
23 pages
Introduction To Machine Learning For Beginners
No ratings yet
Introduction To Machine Learning For Beginners
5 pages
Internship Report
No ratings yet
Internship Report
31 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
42 pages
Machine Learning
No ratings yet
Machine Learning
74 pages
What Is Machine Learning?
No ratings yet
What Is Machine Learning?
6 pages
Msa 02
No ratings yet
Msa 02
9 pages
Lecture 5 Bayesian
No ratings yet
Lecture 5 Bayesian
37 pages
Lecture 4 - Decision Tree
No ratings yet
Lecture 4 - Decision Tree
48 pages
Lecture 6 - Association Analysis
No ratings yet
Lecture 6 - Association Analysis
62 pages
Contoh Surat Lamaran Kerja
No ratings yet
Contoh Surat Lamaran Kerja
3 pages
Difference Between High Level Language and Low Level Language
No ratings yet
Difference Between High Level Language and Low Level Language
9 pages
Statistical Tests For Comparing Machine Learning Algorithms
No ratings yet
Statistical Tests For Comparing Machine Learning Algorithms
8 pages
Lis 115 Course Content Summary
No ratings yet
Lis 115 Course Content Summary
2 pages
DW-2 Marks
No ratings yet
DW-2 Marks
11 pages
TEXT BOOK:"Client/Server Survival Guide" Wiley INDIA Publication, 3 Edition, 2011. Prepared By: B.Loganathan
No ratings yet
TEXT BOOK:"Client/Server Survival Guide" Wiley INDIA Publication, 3 Edition, 2011. Prepared By: B.Loganathan
41 pages
ch11. Improving Decision Making and Managing Artificial Intelligence
No ratings yet
ch11. Improving Decision Making and Managing Artificial Intelligence
21 pages
Fundamentals of AI Answers
No ratings yet
Fundamentals of AI Answers
3 pages
Internshipreport 2
No ratings yet
Internshipreport 2
20 pages
Info Technology
No ratings yet
Info Technology
9 pages
Department of Computer Science & Engineering: Assignment-WEEK-2
No ratings yet
Department of Computer Science & Engineering: Assignment-WEEK-2
2 pages
Course File Crypto Sem I 2023 and 24
No ratings yet
Course File Crypto Sem I 2023 and 24
2 pages
Hacking Spatial Causalities in The Algor
No ratings yet
Hacking Spatial Causalities in The Algor
4 pages
Data Science Unit 1 Notes
No ratings yet
Data Science Unit 1 Notes
22 pages
Natural Language Processing
No ratings yet
Natural Language Processing
57 pages
SQL - Basics
No ratings yet
SQL - Basics
25 pages
Program
No ratings yet
Program
3 pages
DATA MANAGEMENT OFFICER II TRA Qs&AS
No ratings yet
DATA MANAGEMENT OFFICER II TRA Qs&AS
10 pages
OBIEE - Quick Guide
No ratings yet
OBIEE - Quick Guide
78 pages
Basic Database System Termonologies
No ratings yet
Basic Database System Termonologies
2 pages
DWDM Unit 2
No ratings yet
DWDM Unit 2
16 pages
Dice Resume CV Jason Azim
No ratings yet
Dice Resume CV Jason Azim
12 pages
Document From Harsh Agarwal
No ratings yet
Document From Harsh Agarwal
2 pages
Resume Mini
No ratings yet
Resume Mini
10 pages
Fundamentals Fo Database Course Outline 2023 For Stdeunts
No ratings yet
Fundamentals Fo Database Course Outline 2023 For Stdeunts
4 pages
Cartography Thesis
100% (2)
Cartography Thesis
7 pages
Software Awarded List
No ratings yet
Software Awarded List
1 page
AI Impact On Stock Market Trading
No ratings yet
AI Impact On Stock Market Trading
2 pages
Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions
No ratings yet
Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions
34 pages

Lecture 1 - Introduction To Machine Learning

Uploaded by

Lecture 1 - Introduction To Machine Learning

Uploaded by

Outline

“Machine learning is the next Internet”

“Machine learning is the hot new thing”

“Web rankings today are mostly a matter of machine learning”

“Machine learning is going to result in a real revolution”

“Machine learning is today’s discontinuity”

Traditional Programming Machine Learning

• Machine learning is programming computers to optimize a

• Data is cheap and abundant (data warehouses, data marts); knowledge is

• Build a model that is a good and useful approximation to the data.

• Computers have become cheaper and more powerful

Spam Detection Credit Card Fraud Detection Speech Recognition

Face Detection Product Recommendation Sentiment Analysis

• 10% improvement = 1 million dollar prize

Total Items Purchased

Tid Refund Marital Taxable Refund Marital Taxable

1 Yes Single 125K No No Single 75K ?

• Given a set of data points, each having a set of attributes,

Intracluster distances Intercluster distances

• Supermarket shelf management.

Pedro Domingos, University of Washington

You might also like