0% found this document useful (0 votes)

6 views

1 - Machine Learning Overview

Uploaded by

Dương

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

1 - Machine Learning Overview

Uploaded by

Dương

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

Introduction to

Machine Learning
by Tran Thi Oanh

2024 VNU-IS
1
Outline
➢Machine Learning: Concepts and examples
➢Types of ML models
o Supervised
o Unsupervised
o Semi-supervised
o Reinforcement Learning
➢Steps to build a ML model
➢How to choose a good ML models
➢Q&A

2
Machine Learning: Concepts and
examples

3
4
➢What do you think is the difference between programming a machine
to follow rules and training a machine to learn from data?

➢Why do you think machines need to "learn" instead of being

explicitly programmed for every task?

5
Traditional Programming

Data
Computer Output
Program

Machine Learning
Data
Computer Program
Output

Machine learning enables computers to learn from data, recognize patterns,

and make predictions or decisions without explicit programming.
6
A Few Quotes
➢“A breakthrough in machine learning would be worth ten Microsofts”
(Bill Gates, Chairman, Microsoft)
➢“Machine learning is the next Internet”
(Tony Tether, Director, DARPA)
➢Machine learning is the hot new thing”
(John Hennessy, President, Stanford)
➢“Web rankings today are mostly a matter of machine learning”
(Prabhakar Raghavan, Dir. Research, Yahoo)
➢“Machine learning is going to result in a real revolution”
(Greg Papadopoulos, CTO, Sun)
➢“Machine learning is today’s discontinuity”
(Jerry Yang, CEO, Yahoo)

7
What is ML?
➢Machine learning (ML) is a field of study in artificial
intelligence concerned with the development and study
of statistical algorithms that can learn
from data and generalize to unseen data and thus
perform tasks without explicit instructions.

Wikipedia

8
Development of ML

9
➢Why do you think the concept of "machine learning" emerged in the
1950s, long before we had today's powerful computers?

➢How do you think the rise of big data has influenced the development
of ML?

➢What recent technological advancements have made ML more

prominent?

10
Applications in Business and Economics

11
12
Types of Machine Learning

13
➢If you were training a computer to identify objects in pictures, what
information would you need to give it?

➢What’s the difference between a problem where you know the

correct answer (supervised) versus one where you don’t
(unsupervised)?

➢How do you think a robot could learn to clean a room without being
told the rules (reinforcement learning)?

14
Types of Machine Learning
➢Supervised (inductive) learning
o Training data includes desired outputs
➢Unsupervised learning
o Training data does not include desired outputs
➢Semi-supervised learning
o Training data includes a few desired outputs
➢Reinforcement learning
o Rewards from sequence of actions

15
Supervised ML
➢Q: If you were teaching someone to tell the difference between apples
and oranges, what kinds of features would you tell them to look for?
➢A: Color, shape, texture, maybe even taste.

➢Q: What do you think the machine is looking for when distinguishing
cats from dogs?
➢A: It’s looking at patterns in the image like the shape of the ears, the
length of the fur, or even colors. The machine uses these features to
make its decision.

16
Supervised ML (2)

17
Supervised ML algorithms
➢Given many labeled
examples, the machine
gets supervision through
correct labels to detect
patterns.
➢ML algorithms: ANN, DT,
Naïve Bayes, k-NN, etc.

Source: https://www.javatpoint.com/supervised-machine-learning

18
Un-supervised ML
➢Q: Have you ever been in a situation where you didn’t know
everyone’s name but still noticed groups forming? What clues did you
use to figure out who might belong together?
➢A: I might notice how they dress or what they’re talking about.

➢Q: How do you think a machine groups data without being told what
the groups are?
➢A: The machine looks for patterns, like similarities in features (just
like you might notice how people dress or what they’re talking
about), and groups similar things together.

19
20
21
Supervised vs. un-supervised

22
Un-supervised ML algorithms
➢k-means
➢HAC
➢Etc.

23
Reinforcement Learning
➢Q:Think of a video game where you earn points for doing something right
and lose points for making mistakes. How do you improve as you keep
playing?
➢A:By learning which actions get me more points and avoiding mistakes.

➢Q: How do you think a machine learns what to do in a game-like situation?

➢A: It tries different actions and keeps track of what works. If an action
earns it points (or rewards), it does that more often, and if it loses points, it
avoids that action.

24
Reinforcement learning
➢Task
Learn how to behave successfully to achieve a goal while interacting with an
external environment
o Learn via experiences (trial and error) !
➢Examples
o Game playing: player knows whether it win or lose, but not know how to
move at each step
o Control: a traffic system can measure the delay of cars, but not know how to
decrease it.

25
RL is learning from interaction

26
Steps to build a ML model

27
Define the Problem
➢What is the task you want to solve?
o Is it a classification task (e.g., spam detection), a regression task (e.g.,
predicting house prices), or another type (e.g., clustering)?
➢What is the expected outcome?
o Decide if you want the model to make predictions (supervised learning),
discover patterns (unsupervised learning), or make decisions based on
rewards (reinforcement learning).

28
Collect and Understand the Data
➢Gather relevant data:
o Collect data from sources like databases, surveys, or APIs. The data should
relate to the problem you're solving.
➢Understand the data:
o Explore the data’s structure (rows and columns) and types of variables
(numerical, categorical, text, etc.).
➢Formulate features and labels:
o In supervised learning, decide which column will be the label (what you want
to predict) and which will be features (what helps make the prediction).

29
Clean and Prepare the Data
➢Handle missing data:
o Remove or fill in missing data (e.g., using mean, median, or more advanced
techniques).
➢Remove outliers:
o Outliers can skew results, so you may want to remove or handle them.
➢Normalize/Scale the data:
o Features like age or income may have very different ranges. Normalize them so that
all features are on a similar scale, especially for algorithms sensitive to scale (e.g.,
neural networks).
➢Encode categorical data:
o Convert categorical features (e.g., "Yes/No", "Red/Blue") into numerical format,
using techniques like one-hot encoding or label encoding.

30
Split the Data (Train/Test Split)

31
Choosing the right models

32
Train models
➢Fit the model to the training
data:
o During this phase, the model
learns patterns from the training
data to make predictions. The
model adjusts its internal
parameters based on the
features and labels in the
training set.
➢Monitor training performance:
o Keep an eye on loss metrics to
ensure the model is learning well
and not overfitting.
33
Train Models - Loss functions

34
Evaluate models

35
Tuning models
➢Principle about iterative improvement:
o Machine learning is an iterative process involving continuous refinement.
➢Principle about training and optimization:
o Learning process involves optimizing a loss function to minimize prediction
errors.

36
Source: https://www.linkedin.com/pulse/guide-hyperparameter-tuning-tushar-aggarwal
37
Key issues in choosing right ML
models

38
Model

39
Choosing the right models
➢Define the task and goal: What type of prediction or pattern
recognition is required?
o Is the problem a prediction (supervised) or pattern-finding (unsupervised)
task?
o Are the labels in the dataset available for supervised learning?
➢Analyze the data: Check for data size, features, and quality.
o How much data do you have? Is it enough to support complex models?
o What is the nature of the data—numerical, categorical, text, or image?

40
Choosing the right models
➢Choose models based on performance, complexity and scalability.
o Is the problem simple or complex? Do you need a simple model (logistic
regression, decision tree) or a more complex one (SVM, neural network)?
o How much computational power and time are available for training the
model?
o Will the model need to handle a large dataset or be deployed in real-time
systems?
➢Model interpretation
o Does the application require transparency? For example, do you need to
explain predictions to stakeholders?
o Are there regulatory requirements for model interpretability?

41
Choosing the right models
➢Optimize the model:
o Fine-tune hyperparameters and check for overfitting or underfitting.

42
Overfitting and underfitting

➢Blue point: training data

➢Red points: validation data
A well fitted model generalises well.
43
Strategies for reduce underfitting
→increase the model size or complexity
o Increase #hidden layers and #nodes in each layer
o Use many epoch to train model
o Add activation function
o Etc.

44
Strategies for reduce overfitting
Strategies normally involves adding some kind of regularisation either to the
network or the dataset.
➢Reducing batch size - adds more noise every step.
➢Reducing model size - so that model can't just 'remember' the dataset.
➢Adding regularisation to the network e.g.:
o L2 and L1 regularisation
o Dropout
o Batch normalisation
➢Early stopping
➢Collect more data or Data augmentation
45
Model Evaluation

46
Interpretation
➢Many modern ML models, especially deep learning models,
function as "black boxes," meaning they produce predictions
without providing clear explanations for how the results are
derived. This is particularly problematic in areas like
healthcare or law, where decisions can have significant
impacts on individuals.

Challenge: How can we trust and understand models when we

cannot easily interpret their decisions?
47
A recap

48
Q&A

49
1. In your own words, how would you explain what a machine learning
model is to someone who has never heard of it before? Can you give a
real-world example of how it might be used?
2. When training a machine learning model, what do you think happens if
the training data contains errors or mislabeled examples? How do you
think this would affect the model's ability to make good predictions?
3. Think of an everyday activity (e.g., recommending movies, filtering spam
emails, or detecting fake news). How do you think a machine learning
model could be used in that situation? What kind of data would the
model need to learn from, and what predictions would it make?

50
4. Imagine you trained a machine learning model to recognize dogs, but it
only ever saw pictures of one type of dog (e.g., golden retrievers). What
might happen if you asked the model to recognize a different breed, like a
poodle? Why is it important for a model to generalize rather than memorize
specific examples?
5. Given a task like recognizing handwritten numbers, what factors would
you consider when choosing the right machine learning model to use? How
might different types of models (e.g., decision trees, neural networks, etc.)
approach the task differently?
6. What are some ethical considerations we should think about when
building machine learning models? How could a model that makes
predictions affect people in ways we may not expect?

51
52
53
Questions

4
54
55
56

Machine Learning?
100% (2)
Machine Learning?
114 pages
Blood Relationship As A Basis of Inheritance Under Islamic Law A Case Study of The Inner and Outer Circles of Family
No ratings yet
Blood Relationship As A Basis of Inheritance Under Islamic Law A Case Study of The Inner and Outer Circles of Family
206 pages
Jim Holland - The Complete Book of Drum Fills
100% (12)
Jim Holland - The Complete Book of Drum Fills
66 pages
LANI NEW Applicant Form PDF
No ratings yet
LANI NEW Applicant Form PDF
2 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
53 pages
Introduction To ML
No ratings yet
Introduction To ML
48 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Firoz Topic 0 Ppt
No ratings yet
Firoz Topic 0 Ppt
24 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Tutorial Sheet1 (M.L.)
No ratings yet
Tutorial Sheet1 (M.L.)
49 pages
Chapter 5 AI
No ratings yet
Chapter 5 AI
40 pages
Lecture 1
No ratings yet
Lecture 1
65 pages
Class1-%20Introduction%20and%20foundation-1717413257735
No ratings yet
Class1-%20Introduction%20and%20foundation-1717413257735
23 pages
Study On Machine Learning Research Paper
No ratings yet
Study On Machine Learning Research Paper
17 pages
Lecture 01 Introducing ML 13102022 031101pm
No ratings yet
Lecture 01 Introducing ML 13102022 031101pm
36 pages
ML-cahp-1
No ratings yet
ML-cahp-1
35 pages
Data Management and Data Transformation, Introduction To Machine Learning
No ratings yet
Data Management and Data Transformation, Introduction To Machine Learning
54 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
asset-v1_MKAU+SEng9032+DEV_01+type@asset+block@ChapOne
No ratings yet
asset-v1_MKAU+SEng9032+DEV_01+type@asset+block@ChapOne
29 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
DIR Notes 1
No ratings yet
DIR Notes 1
39 pages
Mlfa Autumn 22 Lec 01
No ratings yet
Mlfa Autumn 22 Lec 01
43 pages
CHP 1
No ratings yet
CHP 1
47 pages
Fundamentals of ML 1
No ratings yet
Fundamentals of ML 1
38 pages
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-10-33
No ratings yet
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-10-33
24 pages
Module 1
No ratings yet
Module 1
175 pages
ML Lec 02 Introduction II
No ratings yet
ML Lec 02 Introduction II
22 pages
Machine Learning Batch 8 2021
100% (1)
Machine Learning Batch 8 2021
73 pages
Lecture01 Introduction To Machine Learning (Chapter1)
No ratings yet
Lecture01 Introduction To Machine Learning (Chapter1)
64 pages
Unit 1 - Machine Learning - NOTES1 - ML
No ratings yet
Unit 1 - Machine Learning - NOTES1 - ML
52 pages
lec001
No ratings yet
lec001
17 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
Basic of Machine Learning
No ratings yet
Basic of Machine Learning
7 pages
ML Revision
No ratings yet
ML Revision
207 pages
Machine Learning in Unit-1
No ratings yet
Machine Learning in Unit-1
10 pages
Presentation1.Pptx Tanushka - Copy
No ratings yet
Presentation1.Pptx Tanushka - Copy
13 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
19 pages
Lect1 Introduction
No ratings yet
Lect1 Introduction
38 pages
Rohit Unit 1 ML Notes
No ratings yet
Rohit Unit 1 ML Notes
27 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
48 pages
Unit 1 - ML
No ratings yet
Unit 1 - ML
61 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
4 pages
Unit Iv Parametric Machine Learning
No ratings yet
Unit Iv Parametric Machine Learning
4 pages
Machine Learning Unit - 1
No ratings yet
Machine Learning Unit - 1
154 pages
Deep Learnng IA
No ratings yet
Deep Learnng IA
69 pages
Lecture 1
No ratings yet
Lecture 1
24 pages
Lecture 2 Introduction To ML
No ratings yet
Lecture 2 Introduction To ML
35 pages
MLP Unit-I
No ratings yet
MLP Unit-I
62 pages
Supervised and Deep Learning
No ratings yet
Supervised and Deep Learning
83 pages
Day 4-2 Compressed
No ratings yet
Day 4-2 Compressed
16 pages
Machine learning_question bank
No ratings yet
Machine learning_question bank
45 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
UNIT 1
No ratings yet
UNIT 1
38 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
12 pages
Machine: Learning ATO Z - I
No ratings yet
Machine: Learning ATO Z - I
131 pages
AI unit 1
No ratings yet
AI unit 1
36 pages
Unit-I
No ratings yet
Unit-I
23 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
15 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Oral Communication - Worksheet No.3
No ratings yet
Oral Communication - Worksheet No.3
2 pages
Dhrriti Jain Mid-Term Gender&Sexuality JSJC'19
No ratings yet
Dhrriti Jain Mid-Term Gender&Sexuality JSJC'19
7 pages
Zameer Ahmad
No ratings yet
Zameer Ahmad
1 page
2013 March Medical Technologist Licensure Examination
No ratings yet
2013 March Medical Technologist Licensure Examination
4 pages
psychology syllabus cbcs, hons.and gen
No ratings yet
psychology syllabus cbcs, hons.and gen
27 pages
IBMN8D510
No ratings yet
IBMN8D510
512 pages
Ronen Et Al. - 2022 - DeepDPM Deep Clustering With an Unknown Number of Clusters
No ratings yet
Ronen Et Al. - 2022 - DeepDPM Deep Clustering With an Unknown Number of Clusters
24 pages
Cambridge Assessment International Education: Mathematics 9709/62 May/June 2019
No ratings yet
Cambridge Assessment International Education: Mathematics 9709/62 May/June 2019
15 pages
2024-2025 Courses
No ratings yet
2024-2025 Courses
23 pages
Bi Year 4 Module 9 (LP 129-144)
No ratings yet
Bi Year 4 Module 9 (LP 129-144)
17 pages
RecruitCRM - Trainee Software Engineer 2023
No ratings yet
RecruitCRM - Trainee Software Engineer 2023
3 pages
Mind Education
100% (2)
Mind Education
4 pages
Nicole Wolcott - Resume
No ratings yet
Nicole Wolcott - Resume
3 pages
11-16 Math Lesson Plan Perimeter
No ratings yet
11-16 Math Lesson Plan Perimeter
6 pages
Leadership Theory Reflection Paper - Emily Cohen
No ratings yet
Leadership Theory Reflection Paper - Emily Cohen
3 pages
For Proposal Zonio 1
No ratings yet
For Proposal Zonio 1
39 pages
Mock 2
No ratings yet
Mock 2
5 pages
Mat3004 Applied-linear-Algebra TH 1.1 47 Mat3004
No ratings yet
Mat3004 Applied-linear-Algebra TH 1.1 47 Mat3004
2 pages
Inglés María María
No ratings yet
Inglés María María
11 pages
Overview of Human Behavior in Organization
No ratings yet
Overview of Human Behavior in Organization
29 pages
Nature of Inquiry and Research
79% (14)
Nature of Inquiry and Research
18 pages
EWS Gender Neutral CSAB 2
No ratings yet
EWS Gender Neutral CSAB 2
19 pages
Nippu Jha CV
No ratings yet
Nippu Jha CV
2 pages
Math1 Q1 Week4 Day2
No ratings yet
Math1 Q1 Week4 Day2
7 pages
Mastery Level
No ratings yet
Mastery Level
1 page
Linear Mixed Models A Practical Guide Using Statistical Software 1st Edition Brady West - Download the full ebook set with all chapters in PDF format
100% (3)
Linear Mixed Models A Practical Guide Using Statistical Software 1st Edition Brady West - Download the full ebook set with all chapters in PDF format
47 pages