0% found this document useful (0 votes)

11 views6 pages

EXAMPLE ML in real life

The document outlines three examples of machine learning applications: house price prediction, book genre exploration, and spill detection from video. Each example details the steps involved, including problem definition, dataset building, model training, evaluation, and inference, emphasizing the importance of data preprocessing and model selection. Key concepts such as linear models, k-means clustering, and convolutional neural networks are introduced to illustrate the methodologies used in these tasks.

Uploaded by

ashwani verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views6 pages

EXAMPLE ML in real life

Uploaded by

ashwani verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

EXAMPLE 1 House price prediction

STEP One: Defining the problem

Step Two: Building a Dataset

• Data collection: You collect numerous examples of homes sold in your neighbourhood within
the past year, and pay a real estate appraiser to appraise the homes whose selling price is not
known.
• Data exploration: You confirm that all of your data is numerical because most machine
learning models operate on sequences of numbers. If there is textual data, you need to
transform it into numbers. You'll see this in the next example.
• Data cleaning: Look for things such as missing information or outliers, such as the 10-room
mansion. Several techniques can be used to handle outliers, but you can also just remove
those from your dataset.
• Data visualization: You can plot home values against each of your input variables to
look for trends in your data. In the following chart, you see that when lot size increases,
the house value increases.

Step Three: Model Training

Prior to actually training your model, you need to split your data. The standard practice is to
put 80% of your dataset into a training dataset and 20% into a test dataset.

Linear model selection

As you see in the preceding chart, when lot size increases, home values increase too. This
relationship is simple enough that a linear model can be used to represent this relationship.

A linear model across a single input variable can be represented as a line. It becomes a plane
for two variables, and then a hyperplane for more than two variables. The intuition, as a line
with a constant slope, doesn't change.

Step Four: Evaluation

One of the most common evaluation metrics in a regression scenario is called root mean
square or RMS. The math is beyond the scope of this lesson, but RMS can be thought of roughly as
the "average error” across your test dataset, so you want this value to be low.

The math behind RMS

In the following chart, you can see where the data points are in relation to the blue line. You want
the data points to be as close to the "average" line as possible, which would mean less net error.
You compute the root mean square between your model’s prediction for a data point in your test
dataset and the true value from your data. This actual calculation is beyond the scope of this
lesson, but it's good to understand the process at a high level.

Interpreting Results
In general, as your model improves, you see a better RMS result. You may still not be confident
about whether the specific value you’ve computed is good or bad.

Many machine learning engineers manually count how many predictions were off by a threshold
(for example, $50,000 in this house pricing problem) to help determine and verify the model's
accuracy.

Step Five: Inference: Try out your model

Now you are ready to put your model into action. As you can see in the following image, this
means seeing how well it predicts with new data not seen during model training.

Terminology

• Continuous: Floating-point values with an infinite range of possible values. The opposite
of categorical or discrete values, which take on a limited number of possible values.
• Hyperplane: A mathematical term for a surface that contains more than two planes.
• Plane: A mathematical term for a flat surface (like a piece of paper) on which two points
can be joined by a straight line.
• Regression: A common task in supervised machine learning.

EXAMPLE 2 -Book Genre Exploration

Step One: Define the Problem

Step Two: Build your Dataset

To test the hypothesis, you gather book description text for 800 romance books published in the
current year.
Before you can train the model, you need to do some data pre-processing , called data vectorization, to
convert text into numbers.
You transform this book description text into what is called a bag of words representation shown
in the following image so that it is understandable by machine learning models.
Step Three: Train the Model
You pick a common cluster-finding model called k-means . In this model, you can change a
model parameter, k , to be equal to how many clusters the model will try to find in your
dataset.
Your data is unlabelled you don't how many microgenres might exist. So you train your
model multiple times using different values for k each time.

K=2 K=3
Step Four: Model Evaluation
In machine learning, numerous statistical metrics or methods are available to evaluate a
model. In this use case, the silhouette coefficient is a good choice. This metric describes how
well your data was clustered by the model. To find the optimal number of clusters, you plot
the silhouette coefficient as shown in the following image below. You find the optimal value is
when k=19 .

Step Five: Inference (Use the Model)

As you inspect the different clusters found when k=19 , you find a surprisingly large cluster of
books. Here's an example from fictionalized cluster #7.

Clustered data

As you inspect the preceding table, you can see that most of these text snippets are indicating
that the characters are in some kind of long-distance relationship. You see a few other self-
consistent clusters and feel you now have enough useful data to begin writing an article on
unexpected modern romance microgenres.

***********************************************************************************

EXAMPLE 3 Spill Detection from Video

Step One: Defining the Problem

Detecting spills with machine learning

Step Two: Model Training (and selection)

This task is a supervised classification task, as shown in the following image. As shown in the
image above, your goal will be to predict if each image belongs to one of the following classes:

• Contains spill
• Does not contain spill

Step Two: Building a Dataset

• Collecting
• Using historical data, as well as safely staged spills, you quickly build a collection of
images that contain both spills and non-spills in multiple lighting conditions and
environments.
• Exploring and cleaning
• You go through all the photos to ensure the spill is clearly in the shot. There are
Python tools and other techniques available to improve image quality, which you
can use later if you determine a need to iterate.
• Data vectorization (converting to numbers)
• Many models require numerical data, so all your image data needs to be
transformed into a numerical format. Python tools can help you do this
automatically.
• In the following image, you can see how each pixel in the image on the left can be
represented in the image on the right by a number between 0 and 1, with 0 being
completely black and 1 being completely white.

Chemical spill image Numeric representation of chemical spill image

Split the data
• You split your image data into a training dataset and a test dataset.

Step Three: Model Training

Traditionally, solving this problem would require hand-engineering features on top of the
underlying pixels (for example, locations of prominent edges and corners in the image), and
then training a model on these features.

Today, deep neural networks are the most common tool used for solving this kind of
problem. Many deep neural network models are structured to learn the features on top of
the underlying pixels so you don’t have to learn them. You’ll have a chance to take a deeper
look at this in the next lesson, so we’ll keep things high-level for now.

CNN (convolutional neural network)

Neural networks are beyond the scope of this lesson, but you can think of them as a
collection of very simple models connected together. These simple models are called neurons,
and the connections between these models are trainable model parameters called weights.
Convolutional neural networks are a special type of neural network particularly good at
processing images.

Step Four: Model Evaluation

There are many different statistical metrics you can use to evaluate your model. As you gain
more experience in machine learning, you will learn how to research which metrics can help
you evaluate your model most effectively. Here's a list of common metrics:

Accuracy False positive rate Precision

Confusion matrix False negative rate Recall

F1 Score Log Loss ROC curve

Negative predictive value Specificity

The common problem is that Precision and Recall will be effective. You can think
of precision as answering the question, "Of all predictions of a spill, how many were right?"
and recall as answering the question, "Of all actual spills, how many did we detect?"
Manual evaluation plays an important role. You are unsure if your staged spills are
sufficiently realistic compared to actual spills. To get a better sense how well your model
performs with actual spills, you find additional examples from historical records. This allows
you to confirm that your model is performing satisfactorily.

Step Five: Model Inference

The model can be deployed on a system that enables you to run machine learning workloads
such as AWS Panorama.
Thankfully, most of the time, the results will be from the class 'Does not contain spill.'

No spill detected

But, when the class 'Contains spill' is detected, a simple paging system could alert the team to
respond.

Spill detected

mean-stack-technologies-lab-module-2
No ratings yet
mean-stack-technologies-lab-module-2
93 pages
_OceanofPDF.com_Hands-On_Machine_Learning_from_Scratch_-_Venelin_Valkov
No ratings yet
_OceanofPDF.com_Hands-On_Machine_Learning_from_Scratch_-_Venelin_Valkov
119 pages
ML Interactively
No ratings yet
ML Interactively
273 pages
Semi Supervised Learning
No ratings yet
Semi Supervised Learning
86 pages
Data Prep and Cleaning For Machine Learning
No ratings yet
Data Prep and Cleaning For Machine Learning
22 pages
ICT 233 -- Lecture 3
No ratings yet
ICT 233 -- Lecture 3
17 pages
Machine Learning Introduction
100% (1)
Machine Learning Introduction
20 pages
Om Scratch
100% (1)
Om Scratch
124 pages
Unit III
No ratings yet
Unit III
19 pages
2482
No ratings yet
2482
41 pages
ML - 03 - Machine Learning Systems
No ratings yet
ML - 03 - Machine Learning Systems
60 pages
AD8552-ML-UNIT-V (1)
No ratings yet
AD8552-ML-UNIT-V (1)
78 pages
Unit 7 ML
No ratings yet
Unit 7 ML
33 pages
Keyboard Operating Manual V1.2 201105
No ratings yet
Keyboard Operating Manual V1.2 201105
34 pages
ML Unit 1
No ratings yet
ML Unit 1
22 pages
Salazar CPE124 Courswork 1
No ratings yet
Salazar CPE124 Courswork 1
22 pages
BSC It
No ratings yet
BSC It
89 pages
WynHopkins 2022 PagesaftertheIndexint PowerBIfortheExcelAna
No ratings yet
WynHopkins 2022 PagesaftertheIndexint PowerBIfortheExcelAna
5 pages
Session 4 Machine Learning Process (1)
No ratings yet
Session 4 Machine Learning Process (1)
28 pages
10P-2L8M-D5 Product Specifications (Comprehensive)
No ratings yet
10P-2L8M-D5 Product Specifications (Comprehensive)
4 pages
Bachelor Thesis Software Testing
100% (3)
Bachelor Thesis Software Testing
7 pages
Gfk2222y Rx3i Rx7i Cpu Ref Manual
No ratings yet
Gfk2222y Rx3i Rx7i Cpu Ref Manual
288 pages
PhonePe Statement Mar2024 Apr2024
No ratings yet
PhonePe Statement Mar2024 Apr2024
14 pages
Machine Learning Section2 Ebook
No ratings yet
Machine Learning Section2 Ebook
16 pages
GTX 770 User Guide PDF
No ratings yet
GTX 770 User Guide PDF
35 pages
@vtudeveloper.in ISMLA Mod 5
No ratings yet
@vtudeveloper.in ISMLA Mod 5
30 pages
A 6 Step Field Guide for Building Machine Learning Projects
No ratings yet
A 6 Step Field Guide for Building Machine Learning Projects
17 pages
Hmls
No ratings yet
Hmls
126 pages
Dpmo U1 A1 Jufm
No ratings yet
Dpmo U1 A1 Jufm
8 pages
Chapter 02 Overview - 4
No ratings yet
Chapter 02 Overview - 4
43 pages
Functions & Storage Classes - DPP 01
No ratings yet
Functions & Storage Classes - DPP 01
5 pages
A to Z of Machine Learning by Rashvandh
No ratings yet
A to Z of Machine Learning by Rashvandh
34 pages
DP Lab Manual
No ratings yet
DP Lab Manual
90 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
ML1
No ratings yet
ML1
69 pages
Go Math 5th Grade Homework Book Online
100% (1)
Go Math 5th Grade Homework Book Online
8 pages
pre-assess-report-4022153
No ratings yet
pre-assess-report-4022153
12 pages
Fullpapers Biometrikd8bc041810full PDF
No ratings yet
Fullpapers Biometrikd8bc041810full PDF
9 pages
الفصل ١
No ratings yet
الفصل ١
15 pages
ML_DA
No ratings yet
ML_DA
55 pages
Tense Chart
No ratings yet
Tense Chart
1 page
Machine Learning With Python
100% (2)
Machine Learning With Python
137 pages
week3A
No ratings yet
week3A
18 pages
Management Information System Literature Review
100% (1)
Management Information System Literature Review
6 pages
DsNaIT v2.0
No ratings yet
DsNaIT v2.0
43 pages
Oe Cae 3
No ratings yet
Oe Cae 3
7 pages
Machine Learning P
No ratings yet
Machine Learning P
9 pages
Machine Learning Path
No ratings yet
Machine Learning Path
21 pages
Capabilities, Minimization, and Transformation of Sequential Machines
No ratings yet
Capabilities, Minimization, and Transformation of Sequential Machines
27 pages
Lesson 2.2 - Comparing and Ordering Integers
No ratings yet
Lesson 2.2 - Comparing and Ordering Integers
18 pages
unit1
No ratings yet
unit1
21 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
10 pages
IP 24052025 News Updates English
No ratings yet
IP 24052025 News Updates English
2 pages
UNIT 1 PART 4
No ratings yet
UNIT 1 PART 4
8 pages
Machine Learning
No ratings yet
Machine Learning
51 pages
ML_notion_1
No ratings yet
ML_notion_1
18 pages
A Short Guide For Feature Engineering and Feature Selection
No ratings yet
A Short Guide For Feature Engineering and Feature Selection
32 pages
Fundamental Steps of Digital Image Processing
No ratings yet
Fundamental Steps of Digital Image Processing
3 pages
Manual Data
No ratings yet
Manual Data
13 pages
Improving and Measuring Cache Performance
No ratings yet
Improving and Measuring Cache Performance
8 pages
Samsung Price List Jan. '25
No ratings yet
Samsung Price List Jan. '25
9 pages
Part 2 Introduction To ML
No ratings yet
Part 2 Introduction To ML
13 pages
Air quality prediction using machine learning
No ratings yet
Air quality prediction using machine learning
29 pages
KharanshuTiwariResume without photo (1)_231002_11_231105_184550
No ratings yet
KharanshuTiwariResume without photo (1)_231002_11_231105_184550
1 page
Unit III - I
No ratings yet
Unit III - I
15 pages
Unit 1 Machine Learning - PDF Lands
No ratings yet
Unit 1 Machine Learning - PDF Lands
5 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Ai Notes
No ratings yet
Ai Notes
8 pages
FC Shift Sales Report: Sales Transactions Manual / POS
No ratings yet
FC Shift Sales Report: Sales Transactions Manual / POS
3 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
8 pages
4 Automatic Outlier Detection Algorithms in Python
No ratings yet
4 Automatic Outlier Detection Algorithms in Python
2 pages
TCS Prime Interview Experience
No ratings yet
TCS Prime Interview Experience
15 pages
CS601_Machine Learning_Unit 1_Notes_1672759748
No ratings yet
CS601_Machine Learning_Unit 1_Notes_1672759748
13 pages
Cyber Security Notes Unit 4
No ratings yet
Cyber Security Notes Unit 4
15 pages
CAS CI-1580A: HMI Setting
No ratings yet
CAS CI-1580A: HMI Setting
6 pages
The Hardware System: Ciobanu M. Ioana Aurelia
No ratings yet
The Hardware System: Ciobanu M. Ioana Aurelia
3 pages
Unit-I
No ratings yet
Unit-I
23 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
3 Must-Have Projects For Your Data Science Portfolio - by Aakash N S - Jovian - Jan, 2021 - Medium
No ratings yet
3 Must-Have Projects For Your Data Science Portfolio - by Aakash N S - Jovian - Jan, 2021 - Medium
1 page
Getting Ready for ML Projects - Zero to Mastery Data Science and Machine Learning Bootcamp
No ratings yet
Getting Ready for ML Projects - Zero to Mastery Data Science and Machine Learning Bootcamp
2 pages
Senior Construction Project Manager in Washington DC Resume Craig Esty
No ratings yet
Senior Construction Project Manager in Washington DC Resume Craig Esty
3 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
Module 2
No ratings yet
Module 2
20 pages
Stackflow 200: SP IP
No ratings yet
Stackflow 200: SP IP
2 pages
Self-Study Plan For Becoming A Quantitative Trader - Part II
No ratings yet
Self-Study Plan For Becoming A Quantitative Trader - Part II
4 pages
Question Paper Code: U4010: of Life Has Improved. There Is Much More Variety - Our Lives. We Have A
No ratings yet
Question Paper Code: U4010: of Life Has Improved. There Is Much More Variety - Our Lives. We Have A
5 pages
Machine Learning: Hands-On for Developers and Technical Professionals
From Everand
Machine Learning: Hands-On for Developers and Technical Professionals
Jason Bell
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
From Everand
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
Andrei Besedin
2.5/5 (2)
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet

EXAMPLE ML in real life

Uploaded by

EXAMPLE ML in real life

Uploaded by

EXAMPLE 1 House price prediction

STEP One: Defining the problem

Step Two: Building a Dataset

Step Three: Model Training

Linear model selection

Step Four: Evaluation

The math behind RMS

Step Five: Inference: Try out your model

EXAMPLE 2 -Book Genre Exploration

Step One: Define the Problem

Step Two: Build your Dataset

Step Five: Inference (Use the Model)

EXAMPLE 3 Spill Detection from Video

Detecting spills with machine learning

Step Two: Model Training (and selection)

Step Two: Building a Dataset

Chemical spill image Numeric representation of chemical spill image

Step Three: Model Training

CNN (convolutional neural network)

Step Four: Model Evaluation

Accuracy False positive rate Precision

Confusion matrix False negative rate Recall

F1 Score Log Loss ROC curve

Negative predictive value Specificity

Step Five: Model Inference

You might also like