0% found this document useful (0 votes)

42 views38 pages

UNIT-I

The document provides an introduction to Machine Learning (ML), covering its evolution, definitions, and different paradigms such as supervised, unsupervised, and reinforcement learning. It highlights the importance of data types in ML and discusses various algorithms and applications across different fields. Key concepts and advantages of each learning paradigm are also outlined, emphasizing the role of data in training models for accurate predictions.

Uploaded by

navzkaur13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views38 pages

UNIT-I

Uploaded by

navzkaur13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

1

UNIT-I:

Introduction to Machine Learning: Evolution of Machine Learning, Paradigms for ML,

Learning by Rote, Learning by Induction, Reinforcement Learning, Types of Data, Matching,
Stages in Machine Learning, Data Acquisition, Feature Engineering, Data Representation,
Model Selection, Model Learning, Model Evaluation, Model Prediction, Search and Learning,
Data Sets.

Evolution of Machine Learning:

Machine learning (ML) is an important tool for the goal of leveraging technologies around artificial
intelligence.

Because of its learning and decision-making abilities, machine learning is often referred to as AI, though,
in reality, it is a subdivision of AI.

Until the late 1970s, it was a part of AI’s evolution. Then, it branched off to evolve on its own.

Machine learning has become a very important response tool for cloud computing and e-commerce, and
is being used in a variety of cutting-edge technologies.

Below is a brief history of machine learning and its role in data management.

Machine learning is a necessary aspect of modern business and research for many organizations today.

It uses algorithms and neural network models to assist computer systems in progressively improving
their performance.

Machine learning algorithms automatically build a mathematical model using sample data – also known
as “training data” – to make decisions

without being specifically programmed to make those decisions.

Machine learning (ML) has evolved from simple algorithms and rule-based systems to complex models
and large datasets.

Some key milestones in the evolution of machine learning include:

1943

Walter Pitts and Warren McCulloch published the first mathematical model of a neural network.

1949

Donald Hebb published The Organization of Behavior, which presented theories on how behavior relates
to neural networks and brain activity.

1950

Alan Turing created the Turing Test to determine if a computer has real intelligence.

Murali Krishna B | VLITS, CSE DEPT

1956

Arthur Samuel created a program for playing championship-level computer checkers, popularizing the
term "machine learning".

1967

Cover and Hart published an article on the Nearest Neighbor Algorithm, which automatically identifies
patterns within large datasets.

2002

The release of Torch, an open-source software library, made it easier for researchers and developers to
build machine learning models.

Rise of GPUs

Graphics Processing Units (GPUs) excel at parallel processing, making them ideal for training complex
neural networks.

Definition of Machine learning (ML):

A Machine Learning system learns from historical data, builds the prediction models, and whenever it
receives new data, predicts the output for it. The accuracy of predicted output depends upon the amount

Murali Krishna B | VLITS, CSE DEPT

of data, as the huge amount of data helps to build a better model which predicts the output more
accurately.

Machine learning (ML) is a subset of artificial intelligence (AI) that allows computers to learn and improve
from data without being explicitly programmed. ML systems use algorithms to analyze large data sets,
identify patterns, and make predictions and decisions.

Machine learning accesses vast amounts of data (both structured and unstructured) and learns from it to
predict the future. It learns from the data by using multiple algorithms and techniques. Below is a
diagram that shows how a machine learns from data.

Fig. How does Machine Learning can work with past data

Machine learning is now used in many areas of business, including:

Analysing sales data

Real-time mobile personalization

Fraud detection

Product recommendations

Learning management systems

Dynamic pricing

Natural language processing

Paradigms for ML:

Machine learning is commonly separated into three main learning paradigms:

1. Supervised Learning

2. Unsupervised Learning and

3. Reinforcement Learning.

These paradigms differ in the tasks they can solve and in how the data is presented to the computer.

1. Supervised Learning:

Murali Krishna B | VLITS, CSE DEPT

Supervised learning is a type of machine learning method in which we provide sample labeled data to the
machine learning system in order to train it, and on that basis, it predicts the output.

The system creates a model using labeled data to understand the datasets and learn about each data,
once the training and processing are done then we test the model by providing a sample data to check
whether it is predicting the exact output or not.

The goal of supervised learning is to map input data with the output data. The supervised learning is
based on supervision, and it is the same as when a student learns things in the supervision of the teacher.

(or)

Fig. Supervised machine Learning

In the real-world, supervised learning can be used for Risk Assessment, Image classification, Fraud
Detection, spam filtering, etc.

The supervised learning is again classified in to two types

Murali Krishna B | VLITS, CSE DEPT

I. Classification -- Classification algorithms are used when the output variable is categorical, which means
there are two classes such as Yes-No, Male-Female, True-false, etc. Below are some popular Classification
algorithms

-> Random Forest

-> Decision Trees

-> Logistic Regression

-> Support vector Machines

II.Regression -- Regression algorithms are used if there is a relationship between the input variable and
the output variable. It is used for the prediction of continuous variables, such as Weather forecasting,
Market Trends, etc. Below are some popular Regression algorithms

-> Linear Regression

-> Regression Trees

-> Non-Linear Regression

-> Bayesian Linear Regression

-> Polynomial Regression

Advantages of Supervised learning:

-> With the help of supervised learning, the model can predict the output on

the basis of prior experiences.

-> In supervised learning, we can have an exact idea about the classes of objects.

- > Supervised learning model helps us to solve various real-world problems such

as fraud detection, spam filtering, etc.

2. Unsupervised Learning:

“Unsupervised learning is a type of machine learning in which models are trained

using unlabelled dataset and are allowed to act on that data without any

supervision “

Murali Krishna B | VLITS, CSE DEPT

Unsupervised learning is a learning method in which a machine learns without any supervision.

(or)

Fig. Unsupervised machine Learning

The training is provided to the machine with the set of data that has not been labeled, classified, or
categorized, and the algorithm needs to act on that data without any supervision.

The goal of unsupervised learning is to restructure the input data into new features or a group of objects
with similar patterns.

In unsupervised learning, we don't have a predetermined result. The machine tries to find useful insights
from the huge amount of data.

 It can be further classifieds into two categories of algorithms:

Clustering -- Clustering is a method of grouping the objects into clusters such that objects with most
similarities remains into a group and has less or no similarities with the objects of another group. Cluster
analysis finds the commonalities between the data objects and categorizes them as per the presence and
absence of those commonalities.

Association -- An association rule is an unsupervised learning method which is used for finding the
relationships between variables in the large database.

Murali Krishna B | VLITS, CSE DEPT

It determines the set of items that occurs together in the dataset. Association rule makes marketing
strategy more effective. Such as people who buy X item (suppose a bread) are also tend to purchase Y
(Butter/Jam) item. A typical example of Association rule is Market Basket Analysis.

Below is the list of some popular unsupervised learning algorithms:

K-means clustering

KNN (k-nearest neighbors)

Hierarchal clustering

Anomaly detection

Neural Networks

Principle Component Analysis

Independent Component Analysis

Apriori algorithm

Advantages of Unsupervised Learning:

-> Unsupervised learning is used for more complex tasks as compared to supervised learning because, in
unsupervised learning, we don't have labeled input data.

-> Unsupervised learning is preferable as it is easy to get unlabeled data in comparison to labeled data.

3. Reinforcement Learning:

The goal of reinforcement learning is to train an agent to complete a task within an uncertain
environment. The agent receives observations and a reward from the environment and sends actions to
the environment. The reward measures how successful action is with respect to completing the task goal.

Below is an example that shows how a machine is trained to identify shapes

Fig. Reinforcement learning

Murali Krishna B | VLITS, CSE DEPT

Reinforcement learning can also be used to:

Automated robots

Natural language processing

Marketing and advertising

Image processing

Industrial automation

Resource management

Supply chain management

Finance and trading

Healthcare

Autonomous vehicles

Gaming

Recommendation systems

Learning by Rote(or) Rote Learning:

ROTE or “Return on Tangible Equity” is a ratio that helps measure a company's profitability.

Rote learning is a memorization technique based on repetition. The method rests on the premise that
the recall of repeated material becomes faster the more one repeats it. Some of the alternatives to rote
learning include meaningful learning, associative learning, spaced repetition and active learning.

Rote learning is the process of memorizing specific new items as they are encountered.

The meaning of rote in ‘rote learning’ itself means learning by repetition. The process of repeating
something over and over engages the short-term memory and allows us to quickly remember basic things
like facts, dates, names, multiplication tables, etc. It differs from other forms of learning in that it doesn’t
require the learner to carefully think about something, and is rather dependent on the act of repetition
itself.

Rote learning is widely used in the mastery of foundational knowledge. Examples of school topics where
rote learning is frequently used include phonics in reading, the periodic table in chemistry, multiplication
tables in mathematics, anatomy in medicine, cases or statutes in law, basic formulae in any science, etc.

Rote learning is widely used in the mastery of foundational knowledge Examples of school topics where
rote learning is frequently used multiplication tables in mathematics, anatomy in medicine, cases or
statutes in law, basic formulae in any science, etc.

Rote learning is also used to describe a simple learning pattern used in machine learning, although it does
not involve repetition, unlike the usual meaning of rote learning. The machine is programmed to keep a

Murali Krishna B | VLITS, CSE DEPT

history of calculations and compare new input against its history of inputs and outputs, retrieving the
stored output if present. This pattern requires that the machine can be modeled as a pure function —
always producing same output for same input — and can be formally described as follows:

Advantages of rote learning techniques:

Rote learning is considered useful for a variety of reasons. Here are a few

Rote learning requires very little analysis

With rote, one can remember just about anything over time and repetition.

Rote learning allows one to recall information wholly, and even to retain it for life.

Rote learning makes it easier for people to score who find it difficult to understand or master read

Rote learning can help improve short-term memory.

Disadvantages of rote learning techniques:

On the other hand, there are a few drawbacks of rote learning that you need to be aware of as well

The repetitive nature of rote learning can become dull

One can easily lose focus while rote learning

Rote learning is not holistic.

There is no connection between new and old information with rote learning

Rote learning doesn’t lead to a deeper understanding of the information.

Learning by Induction:

Inductive learning is a machine learning technique that uses a labeled dataset to train a model to
generalize and make predictions about new data:

Inductive learning involves a two-phase process: training and testing. During the training phase,

the machine learning model learns from a labeled dataset, where the input data is paired with their
corresponding outputs.

1. Training

The model analyzes a labeled dataset to learn patterns and build a generalized representation of the
data.

Murali Krishna B | VLITS, CSE DEPT

2. Testing

The trained model is tested on a separate dataset to evaluate its performance. The model's ability to
generalize and make accurate predictions on new data is assessed.

3. Refining

The model's hypotheses are refined based on feedback from the evaluation step. The model is updated
or revised to improve its performance and generalize better to new instances.

Inductive learning is also known as concept learning. It's a way for AI systems to use a generalized rule to
carry out observations.

Applications:

Credit risk assessment,

Disease diagnosis,

Face recognition,

Autonomous driving.

Reinforcement Learning:

Reinforcement learning (RL) is a machine learning (ML) technique that trains software to
make decisions to achieve the most optimal results.

It mimics the trial-and-error learning process that humans use to achieve their goals.

The goal of reinforcement learning is to train an agent to complete a task within an

uncertain environment. The agent receives observations and a reward from the
environment and sends actions to the environment. The reward measures how successful
action is with respect to completing the task goal.

(or)

Reinforcement learning (RL) is a machine learning technique that teaches software to make
decisions by using a reward-and-punishment system.

RL mimics the way humans learn through trial and error, where actions that lead to a
desired outcome are reinforced, while actions that don't are ignored.

Murali Krishna B | VLITS, CSE DEPT

Key Concepts of Reinforcement Learning

Agent: The learner or decision-maker.

Environment: Everything the agent interacts with.

State: A specific situation in which the agent finds itself.

Action: All possible moves the agent can make.

Reward: Feedback from the environment based on the action taken.

Reinforcement learning is all about making decisions sequentially.

In simple words, we can say that the output depends on the state of the current input and
the next input depends on the output of the previous input

Example: Chess game,text summarization

Uses of Reinforcement Learning:

1.Managing self-driving cars

2.Addressing the energy consumption problem

3.Traffic signal control

4.Healthcare

5.Robotics

6.Marketing

Murali Krishna B | VLITS, CSE DEPT

7.Gaming

Types of Data:

In Machine learning, it is very important to know appropriate datatypes of independent and

dependent variable.

as it provides the basis for selecting classification or regression models.

Incorrect identification of data types leads to incorrect modeling which in turn leads to an
incorrect solution.

Data is a crucial component in the field of Machine Learning.

It refers to the set of observations or measurements that can be used to train a machine-
learning model.

The quality and quantity of data available for training and testing play a significant role in
determining the performance of a machine-learning model.

Data can be in various forms such as numerical, categorical, or time-series data, and can
come from various sources such as databases, spreadsheets, or APIs.

Here I will be discussing different types of data types with suitable examples.

The Data type is broadly classified into

1.Quantitative

2.Qualitative

1. Quantitative data type: –

Murali Krishna B | VLITS, CSE DEPT

This type of data type consists of numerical values. Anything which is measured by numbers.

E.g., Profit, quantity sold, height, weight, temperature, etc.

This is again of two types

A.) Discrete data type: –

The numeric data which have discrete values or whole numbers.

This type of variable value if expressed in decimal format will have no proper meaning. Their
values can be counted.

E.g.: – No. of cars you have, no. of marbles in containers, students in a class, etc.

B.) Continuous data type: –

The numerical measures which can take the value within a certain range.

This type of variable value if expressed in decimal format has true meaning.

Their values can not be counted but measured. The value can be infinite

E.g.: – height, weight, time, area, distance, measurement of rainfall, etc.

Murali Krishna B | VLITS, CSE DEPT

2. Qualitative data type: –

These are the data types that cannot be expressed in numbers. This describes categories or
groups and is hence known as the categorical data type.

This can be divided into:-

a). Structured Data:

This type of data is either number or words. This can take numerical values but
mathematical operations cannot be performed on it. This type of data is expressed in
tabular format.

E.g.) Sunny=1, cloudy=2, windy=3 or binary form data like 0 or1, Good or bad, etc.

b). Unstructured data:

This type of data does not have the proper format and therefore known as unstructured
data. This comprises textual data, sounds, images, videos, etc.

Murali Krishna B | VLITS, CSE DEPT

Besides this, there are also other types refer as Data Types preliminaries or Data Measures:-

Nominal

Ordinal

Interval

Ratio

These can also be refer different scales of measurements.

I. Nominal Data Type:

This is in use to express names or labels which are not order or measurable.

E.g., male or female (gender), race, country, etc.

II. Ordinal Data Type:

Murali Krishna B | VLITS, CSE DEPT

This is also a categorical data type like nominal data but has some natural ordering
associated with it.

E.g., Likert rating scale, Shirt sizes, Ranks, Grades, etc.

III. Interval Data Type:

This is numeric data which has proper order and the exact zero means the true absence of a
value attached. Here zero means not a complete absence but has some value. This is the
local scale.

E.g., Temperature measured in degree Celsius, time, Sat score, credit score, pH, etc.
difference between values is familiar. In this case, there is no absolute zero. Absolute

IV. Ratio Data Type:

This quantitative data type is the same as the interval data type but has the absolute zero.
Here zero means complete absence and the scale starts from zero. This is the global scale.

E.g., Temperature in Kelvin, height, weight, etc.

Murali Krishna B | VLITS, CSE DEPT

Matching:

Matching is a process that uses machine learning algorithms to compare data sets and
identify matches between records.

The goal of data matching is to identify and compare data to find the data points that refer
to the same entity.

Data matching can help identify duplicate records, detect patterns and irregularities, and
improve the accuracy of searches

Matching algorithms can be used to pair users with products, services, or information, such
as recommending products on e-commerce platforms or matching job seekers with
opportunities

Stages in Machine Learning:

Machine learning, a subset of artificial intelligence, teaches computers to mimic human

thinking by training them with real-world data.

To train a machine with specific data we have to follow predefined steps and this whole
process is known as a machine learning lifecycle.

The goal of the 7 Stages framework is to break down all necessary tasks in Machine Learning
and organize them in a logical way.

1.Problem Definition

2.Data Collection

3.Data Preparation

Murali Krishna B | VLITS, CSE DEPT

4.Data Visualization

5.ML Modeling

6.Feature Engineering

7.Model Deployment

These 7 stages are the key steps in our framework. We have categorized them additionally
into groups to get a better understanding of the larger picture.

The stages are grouped into 3 phases:

i.Business Value

ii.Proof of Concept (POC)

iii.Production

Phase 1 — Business Value

It is absolutely crucial to adopt a business mindset when thinking about a problem that
should be solved with Machine Learning — defining customer benefits and creating business
impact is top priority.

Domain expertise and knowledge is also essential as the true power of data can only be
harnessed if the domain is well known and understood.

Phase 2 — Proof of Concept (POC)

Proof of Concept (POC) is the most comprehensive part of our framework.

Murali Krishna B | VLITS, CSE DEPT

From Data Collection to Feature Engineering, 5 stages of our ML framework are included
here.

Core of any POC to test an idea in terms of its feasibility and value to the business.

Also, questions around performance and evaluation metrics are answered in that phase.

Phase 3 — Production

In the third phase, one is taking the ML model and scaling it.

The goal is to integrate Machine Learning into a business process solving a problem with a
superior solution compared to, for example, traditional programming.

The process of taking a trained ML model and making its predictions available to users or
other systems is known as model deployment.

1. Problem Definition

The first stage in the DDS Machine Learning Framework is to define and understand the
problem that someone is going to solve.

Start by analyzing the goals and the why behind a particular problem statement.

Understand the power of data and how one can use it to make a change and drive results.

And asking the right questions is always a great start.

Here we can arise Few possible questions like What is the business?, Why does the problem
need to be solved? Is a traditional solution available to solve the problem?,If probabilistic in
nature, then does available data allow to model it?, What is a measurable business goal?

2. Data Collection

Once the goal is clearly defined, one has to start getting the data that is needed from
various available data sources.

Murali Krishna B | VLITS, CSE DEPT

Here we can arise Few possible questions like What data do I need for my project? Where is
that data available?, How can I obtain it?, What is the most efficient way to store and access
all of it?

There are many different ways to collect data that is used for Machine Learning. For
example, focus groups, interviews, surveys, and internal usage & user data.

Also, public data can be another source and is usually free. These include research and
trade associations such as banks, publicly-traded corporations, and others.

If data isn’t publicly available, one could also use web scraping to get it (however, there are
some legal restrictions).

3. Data Preparation

The third stage is the most time-consuming and labour-intensive.

Data Preparation can take up to 70% and sometimes even 90% of the overall project time.
But what is the purpose of this stage?

Well, the type and quality of data that is used in a Machine Learning model affects the
output considerably.

In Data Preparation one explores, pre-processes, conditions, and transforms data prior to
modeling and analysis.

Here some steps involved in this stage are:

Data Filtering

Data Validation & Cleansing

Data Formatting

Data Aggregation & Reconciliation

Murali Krishna B | VLITS, CSE DEPT

4. Data Visualization

Data Visualization is used to perform Exploratory Data Analysis (EDA).

Visualization is an incredibly helpful tool to identify patterns and trends in data, which leads
to clearer understanding and reveals important insights.

Data Visualization also helps for faster decision making through the graphical illustration.

Here are some common ways of visualization:

Area Chart

Bar Chart

Box-and-whisker Plots

Bubble Cloud

Dot Distribution Map

Heat Map

Histogram

Network Diagram

Word Cloud

Murali Krishna B | VLITS, CSE DEPT

5. ML Modeling

Finally, this is where ‘the magic happens’. Machine Learning is finding patterns in data, and
one can perform either supervised or unsupervised learning.

ML tasks include regression, classification, forecasting, and clustering.

In this stage of the process one has to apply mathematical, computer science, and business
knowledge to train a Machine Learning algorithm that will make predictions based on the
provided data

6. Feature Engineering

Machine Learning algorithms learn recurring patterns from data. Carefully engineered
features are a robust representation of those patterns.

Feature Engineering is a process to achieve a set of features by performing mathematical,

statistical, and heuristic procedures.

It is a collection of methods for identifying an optimal set of inputs to the Machine Learning
algorithm. Feature Engineering is extremely important because well-engineered features
make learning possible with simple models.

Following are the characteristics of good features:

Murali Krishna B | VLITS, CSE DEPT

Represents data in an unambiguous way

Ability to captures linear and non-linear relationships among data points

Capable of capturing the precise meaning of input data

Capturing contextual details

7. Model Deployment

The last stage is about putting a Machine Learning model into a production environment to
make data-driven decisions in a more automated way.

Robustness, compatibility, and scalability are important factors that should be tested and
evaluated before deploying a model.

There are various ways such as Platform as a Service (PaaS) or Infrastructure as a Service
(IaaS).

Data Acquisition:

In Machine learning Data acquisition is the process of gathering and preparing data from
various sources to train a machine learning model.

Murali Krishna B | VLITS, CSE DEPT

It's the first step in the machine learning process and is critical for the effectiveness of the
model.

A key component of the data acquisition process is the analog-to-digital converter (ADC),

which transforms the signal into data that the processor can understand

Data acquisition in the context of Machine Learning refers to the process of collecting,
gathering, and preparing data from various sources to build and train a machine learning
model.

Various Data Sources

Databases: Extracting data from structured databases such as SQL or NoSQL databases.

Files: Gathering data from CSV files, Excel spreadsheets, text files, and more.

APIs: Retrieving data from Application Programming Interfaces (APIs) provided by various
online platforms.

Web Scraping: Extracting data from websites by parsing their HTML content.

Sensors and IoT Devices: Collecting data from sensors and Internet of Things (IoT) devices.

Feature Engineering:

Feature engineering is the pre-processing step of machine learning, which is used to

transform raw data into features that can be used for creating a predictive model using
Machine learning or statistical Modelling.

Feature engineering in machine learning aims to improve the performance of models.

Feature engineering is a machine learning technique that involves transforming raw data
into features that can be used to train and make predictions by Machine learning models.

Murali Krishna B | VLITS, CSE DEPT

Feature engineering is a machine learning technique that leverages data to create new
variables that aren't in the training set.

Purpose of Feature engineering improves model performance by selecting and transforming

relevant features from raw data.

Importance The accuracy of a machine learning model depends on the quality of the data
used for training, so feature engineering is a crucial preprocessing technique.

Feature Engineering is the process of creating new features or transforming existing

features to improve the performance of a machine-learning model.

It involves selecting relevant information from raw data and transforming it into a format
that can be easily understood by a model.

The goal is to improve model accuracy by providing more meaningful and relevant
information.

Feature engineering is the process of transforming raw data into features that are suitable
for machine learning models.

In other words, it is the process of selecting, extracting, and transforming the most relevant
features from the available data to build more accurate and efficient machine learning
models

Murali Krishna B | VLITS, CSE DEPT

Data Representation:

The word data refers to constituting people, things, events, ideas. It can be a
title, an integer, or anycast. After collecting data the investigator has to
condense them in tabular form to study their salient features. Such an
arrangement is known as the presentation of data.

It refers to the process of condensing the collected data in a tabular form or

graphically. This arrangement of data is known as Data Representation.

The row can be placed in different orders like it can be presented in ascending
orders, descending order, or can be presented in alphabetical order.

Example: Let the marks obtained by 10 students of class V in a class test, out of
50 according to their roll numbers, be:

39, 44, 49, 40, 22, 10, 45, 38, 15, 50

The data in the given form is known as raw data. The above given data can be
placed in the serial order as shown below:

Model Selection:

In machine learning, the process of selecting the top model or algorithm from a list of
potential models to address a certain issue is referred to as model selection.

It entails assessing and contrasting various models according to how well they function and
choosing the one that reaches the highest level of accuracy or prediction power.

Murali Krishna B | VLITS, CSE DEPT

Because different models have varied levels of complexity, underlying assumptions, and
capabilities, model selection is a crucial stage in the machine-learning pipeline.

Finding a model that fits the training set of data well and generalizes well to new data is the
objective.

While a model that is too complex may overfit the data and be unable to generalize, a
model that is too simple could underfit the data and do poorly in terms of prediction.

Fig. Model Selection

Problem formulation: Clearly express the issue at hand, including the kind of predictions or
task that you'd like the model to carry out (for example, classification, regression, or
clustering).

Candidate model selection: Pick a group of models that are appropriate for the issue at
hand. These models can include straightforward methods like decision trees or linear
regression as well as more sophisticated ones like deep neural networks, random forests, or
support vector machines.

Performance evaluation: Establish measures for measuring how well each model performs.
Common measurements include area under the receiver's operating characteristic curve
(AUC-ROC), recall, F1-score, mean squared error, and accuracy, precision, and recall.

The type of problem and the particular requirements will determine which metrics are used.

Training and evaluation: Each candidate model should be trained using a subset of the
available data (the training set), and its performance should be assessed using a different

Murali Krishna B | VLITS, CSE DEPT

subset (the validation set or via cross-validation). The established evaluation measures are
used to gauge the model's effectiveness.

Model comparison: Evaluate the performance of various models and determine which one
performs best on the validation set. Take into account elements like data handling
capabilities, interpretability, computational difficulty, and accuracy.

Hyperparameter tuning: Before training, many models require that certain

hyperparameters, such as the learning rate, regularisation strength, or the number of layers
that are hidden in a neural network, be configured. Use methods like grid search, random
search, and Bayesian optimization to identify these hyperparameters' ideal values.

Final model selection: After the models have been analysed and fine-tuned, pick the model
that performs the best. Then, this model can be used to make predictions based on fresh,
unforeseen data.

Model Evaluation:

Model evaluation in machine learning is the process of determining a model’s performance

via a metrics-driven analysis. Machine Learning Model does not require hard-coded
algorithms. We feed a large amount of data to the model and the model tries to figure out
the features on its own to make future predictions.

It can be performed in two ways:

Offline: The model is evaluated after training during experimentation or continuous

retraining.

Online: The model is evaluated in production as part of model monitoring.

The metrics selection for the analysis varies depending on the data, algorithm, and use case.

The Importance of in ML model evaluation ensures that production models’ performance is:

Optimal: The productionized model(s) performs as well as is currently achievable, typically

in comparison to multiple other trained models.

Reliable: The productionized model(s) behaves as expected. The behavioural profile of the
model is an in-depth review of how the model maps inputs to outputs—overall and with
respect to specific data slices—as defined by feature contribution, counterfactual analysis,
and fairness tests.

Murali Krishna B | VLITS, CSE DEPT

So we must also use some techniques to determine the predictive power of the model.

Evaluation Metrics in Machine Learning:

Evaluation is always good in any field, right? In the case of machine learning, it is best
practice

To evaluate the performance of such a model there are metrics as mentioned below:

Classification Accuracy

Logarithmic loss

Area under Curve

F1 score

Precision

Recall

Confusion Matrix

Classification Accuracy:

Classification accuracy is a fundamental metric for evaluating the performance of a

classification model, providing a quick snapshot of how well the model is performing in
terms of correct predictions.

This is calculated as the ratio of correct predictions to the total number of input Samples.

Accuracy= Totalnumberofinputsamples/No.ofcorrectpredictions

For example, we have a 90% sample of class A and a 10% sample of class B in our training
set. Then, our model will predict with an accuracy of 90% by predicting all the training
samples belonging to class A. If we test the same model with a test set of 60% from class A
and 40% from class B. Then the accuracy will fall, and we will get an accuracy of 60%.

Log Loss

It is the evaluation measure to check the performance of the classification model. It

measures the amount of divergence of predicted probability with the actual label. So lesser
the log loss value, more the perfectness of model. For a perfect model, log loss value = 0.
For instance, as accuracy is the count of correct predictions i.e. the prediction that matches
the actual label, Log Loss value is the measure of uncertainty of our predicted labels based
on how it varies from the actual label.

Murali Krishna B | VLITS, CSE DEPT

where,

N : no. of samples.

M : no. of attributes.

yij : indicates whether ith sample belongs to jth class or not.

pij : indicates probability of ith sample belonging to jth class.

Area under Curve:

The AUC-ROC curve, or Area Under the Receiver Operating Characteristic curve, is a
graphical representation of the performance of a binary classification model at various
classification thresholds. It is commonly used in machine learning to assess the ability of a
model to distinguish between two classes, typically the positive class (e.g., presence of a
disease) and the negative class (e.g., absence of a disease).

ROC: Receiver Operating Characteristics

AUC: Area Under Curve

AUC stands for the Area Under the Curve, and the AUC curve represents the area under the
ROC curve.

Murali Krishna B | VLITS, CSE DEPT

It measures the overall performance of the binary classification model. It represents the
probability with which our model can distinguish between the two classes present in our
target.

Where,

TPR – True Positive Rate

FPR – False Positive Rate

Precision and F1 – Score:

The F1-score is a measure of a model’s performance that combines precision and recall. It is
defined as the harmonic mean of precision and recall, where the best value is 1 and the
worst value is 0.

There is another metric named Precision. In R, precision is a measure of a model’s

performance that tells you how many of the positive predictions made by the model are
actually correct.

It is calculated as the number of true positive predictions divided by the number of true
positive and false positive predictions.

F1-score is used to evaluate the overall performance of a classification model. It is the

harmonic mean of precision and recall,

Murali Krishna B | VLITS, CSE DEPT

Confusion Matrix:

A confusion matrix is a matrix that summarizes the performance of a machine learning

model on a set of test data.

It is a means of displaying the number of accurate and inaccurate instances based on the
model’s predictions. It is often used to measure the performance of classification models,
which aim to predict a categorical label for each input instance.

The matrix displays the number of instances produced by the model on the test data.

True Positive (TP): The model correctly predicted a positive outcome (the actual outcome
was positive).

True Negative (TN): The model correctly predicted a negative outcome (the actual outcome
was negative).

False Positive (FP): The model incorrectly predicted a positive outcome (the actual outcome
was negative). Also known as a Type I error.

False Negative (FN): The model incorrectly predicted a negative outcome (the actual
outcome was positive). Also known as a Type II error.

We need a Confusion Matrix When assessing a classification model’s performance, a

confusion matrix is essential.

Model Prediction:

Predictive modelling is a process used in data science to create a mathematical model that
predicts an outcome based on input data.

It involves using statistical algorithms and machine learning techniques to analyze historical
data and make predictions about future or unknown events.

In predictive modelling, the goal is to build a model that can accurately predict the target
variable (the outcome we want to predict) based on one or more input variables (features).
The model is trained on a dataset that includes both the input variables and the known
outcome, allowing it to learn the relationships between the input variables and the target
variable.

Once the model is trained, it can be used to make predictions on new data where the target
variable is unknown.

Murali Krishna B | VLITS, CSE DEPT

The accuracy of the predictions can be evaluated using various metrics, such as accuracy,
precision, recall, and F1 score, depending on the nature of the problem.

Predictive modelling is used in a wide range of applications, including sales forecasting, risk
assessment, fraud detection, and healthcare.

It can help businesses make informed decisions, optimize processes, and improve outcomes
based on data-driven insights.

Importance of Predictive Modeling

Predictive modelling is important for several reasons:

Decision Making:

It helps businesses and organizations make informed decisions by providing insights into
future trends and outcomes based on historical data.

Risk Management:

It helps in assessing and managing risks by predicting potential outcomes and allowing
organizations to take proactive measures.

Resource Optimization:

It helps in optimizing resources such as time, money, and manpower by providing forecasts
and insights that can be used to allocate resources more efficiently.

Customer Insights:

It helps in understanding customer behaviour and preferences, which can be used to

personalize products, services, and marketing strategies.

Competitive Advantage:

It can provide a competitive advantage by enabling organizations to anticipate market

trends and customer needs ahead of competitors.

Cost Reduction:

By predicting future outcomes, organizations can reduce costs associated with errors,
inefficiencies, and unnecessary expenditures.

Applications of Predictive Modeling

The practical impact of predictive modelling across various domains are:

Murali Krishna B | VLITS, CSE DEPT

 Finance

Risk Assessment:

Predictive modeling helps banks and financial institutions assess the creditworthiness of
individuals and businesses, making lending decisions more informed and reducing the risk of
defaults.

Fraud Detection:

By analysing patterns in transactions and account activity, predictive modeling can detect
fraudulent activities and prevent financial losses.

 Healthcare

Disease Prediction:

Predictive modeling can help healthcare professionals predict the likelihood of diseases
such as diabetes, heart disease, and cancer in patients, allowing for early intervention and
personalized treatment plans.

Resource Allocation:

Hospitals and healthcare facilities can use predictive modeling to forecast patient
admissions, optimize staffing levels, and ensure the availability of resources such as beds
and medications.

 Marketing and Customer Relationship Management (CRM)

Customer Segmentation:

Predictive modeling enables businesses to segment customers based on their behaviour,

preferences, and likelihood to purchase, allowing for targeted marketing campaigns.

Churn Prediction:

By analysing customer data, predictive modeling can predict which customers are likely to
churn (stop using a service or product), enabling companies to take proactive steps to retain
them.

 Supply Chain Management

Demand Forecasting:

Predictive modeling helps companies forecast demand for their products, ensuring that they
maintain optimal inventory levels and reduce stockouts or overstock situations.

Murali Krishna B | VLITS, CSE DEPT

Logistics Optimization:

By analysing historical data and external factors, predictive modeling can optimize logistics
operations, such as routing, transportation modes, and warehouse locations, to improve
efficiency and reduce costs.

Human Resources

Talent Acquisition:

Predictive modeling can help HR departments identify the best candidates for job openings
by analysing resumes, past performance, and other relevant data.

Employee Retention:

By analysing factors that contribute to employee turnover, predictive modeling can help
companies implement strategies to retain top talent and reduce turnover rates.

Search and Learning:

Search algorithms help in finding optimal solutions to specific tasks, while machine learning
algorithms enable systems to learn and adapt to new data and situations, making AI
applications more intelligent and effective.

In machine learning, search refers to the process of finding the best algorithm or model to
make the most accurate predictions or decisions based on input data.

The machine searches through many possible solutions, parameters, or models to find the
one that works best.

An effective machine learning search engine goes beyond simple search or AI techniques.
Automatically find relevant content personalized for users.

Machine learning is a branch of artificial intelligence (AI) that allows computers to learn and
improve from data without being explicitly programmed.

The goal of machine learning is to create machines that can learn from data to improve the
accuracy of their output.

In the real world, we are surrounded by humans who can learn everything from their
experiences with their learning capability, and we have computers or machines which work
on our instructions.

Murali Krishna B | VLITS, CSE DEPT

But can a machine also learn from experiences or past data like a human does? So here
comes the role of Machine Learning.

Data Sets:

A dataset in machine learning is a collection of data that a computer treats as a single unit.

Datasets are used to train and test algorithms and models. They can be used to teach
machine learning algorithms to find patterns in the data.

Dataset is essentially the backbone for all operations, techniques or models used by
developers to interpret them.

Datasets involve a large amount of data points grouped into one table.

Datasets are used in almost all industries today for various reasons.

Fig. Dataset representation

Murali Krishna B | VLITS, CSE DEPT

A Dataset is a set of data grouped into a collection with which developers can work to
meet their goals. In a dataset, the rows represent the number of data points and the
columns represent the features of the Dataset.

Let us see an example below:

Fig.dataset

Types of Datasets

There are various types of datasets available out there. They are:

Numerical Dataset: They include numerical data points that can be solved with equations.

These include temperature, humidity, marks and so on.

Categorical Dataset: These include categories such as colour, gender, occupation, games,
sports and so on.

Web Dataset: These include datasets created by calling APIs using HTTP requests and
populating them with values for data analysis. These are mostly stored in JSON (JavaScript
Object Notation) formats.

These include datasets between a period, for example, changes in geographical terrain over
time.

Image Dataset: It includes a dataset consisting of images. This is mostly used to differentiate
the types of diseases, heart conditions and so on.

Murali Krishna B | VLITS, CSE DEPT

Ordered Dataset: These datasets contain data that are ordered in ranks, for example,
customer reviews, movie ratings and so on.

Partitioned Dataset: These datasets have data points segregated into different members or
different partitions.

File-Based Datasets: These datasets are stored in files, in Excel as .csv, or .xlsx files.

Bivariate Dataset: In this dataset, 2 classes or features are directly correlated to each other.

For example, height and weight in a dataset are directly related to each other.

Multivariate Dataset: In these types of datasets, as the name suggests 2 or more classes are
directly correlated to each other.

For example, attendance, and assignment grades are directly correlated to a student’s
overall grade.

Murali Krishna B | VLITS, CSE DEPT

(eBook PDF) An Introduction to Analysis 4th Edition by William R. Wade - The ebook is now available, just one click to start reading
100% (1)
(eBook PDF) An Introduction to Analysis 4th Edition by William R. Wade - The ebook is now available, just one click to start reading
49 pages
Fuchi CACA
No ratings yet
Fuchi CACA
147 pages
Escort Annual Sustainability-Report2020-21
No ratings yet
Escort Annual Sustainability-Report2020-21
175 pages
Unit Iii Supervised Learning
No ratings yet
Unit Iii Supervised Learning
67 pages
Cognitive Computing Model Brief - Hospital Admissions and ED Visits (Version 1)
No ratings yet
Cognitive Computing Model Brief - Hospital Admissions and ED Visits (Version 1)
10 pages
Humps and Constrictions Lec5,6HS PDF
No ratings yet
Humps and Constrictions Lec5,6HS PDF
15 pages
Abm Ns Toor
100% (1)
Abm Ns Toor
180 pages
ML Notes UT-1
No ratings yet
ML Notes UT-1
21 pages
List of C++ Multiple-choice Questions and Answers
No ratings yet
List of C++ Multiple-choice Questions and Answers
63 pages
Machine Learning - its types
No ratings yet
Machine Learning - its types
8 pages
Nouns and Verbs
No ratings yet
Nouns and Verbs
20 pages
Asterisk Config Extensions-Conf
No ratings yet
Asterisk Config Extensions-Conf
10 pages
ICS_OT Certified Security Professional (ICSP) Training Boot Camp (2)
No ratings yet
ICS_OT Certified Security Professional (ICSP) Training Boot Camp (2)
6 pages
ML Unit 1
No ratings yet
ML Unit 1
42 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Lu BSA1 BGEC05 Module 10
No ratings yet
Lu BSA1 BGEC05 Module 10
5 pages
9e27d2e7-5dfa-4b8b-b760-d1fb4a21abd0
No ratings yet
9e27d2e7-5dfa-4b8b-b760-d1fb4a21abd0
24 pages
Roots of Soa
No ratings yet
Roots of Soa
18 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Design & Analysis of Algorithm (CSC-321) : Mona Leeza, Computer Sciences Department Bahria University (Karachi Campus)
No ratings yet
Design & Analysis of Algorithm (CSC-321) : Mona Leeza, Computer Sciences Department Bahria University (Karachi Campus)
19 pages
Audio Podium (METAL BODY)
No ratings yet
Audio Podium (METAL BODY)
3 pages
civil engineering level 6 curriculum
No ratings yet
civil engineering level 6 curriculum
142 pages
Machine Learning Essentials
No ratings yet
Machine Learning Essentials
19 pages
Prestolite Starter MM
100% (3)
Prestolite Starter MM
155 pages
INTRODUCTION TO MACHINE LEARNING
No ratings yet
INTRODUCTION TO MACHINE LEARNING
31 pages
Report
No ratings yet
Report
27 pages
ML UNIT-1 NOTES
No ratings yet
ML UNIT-1 NOTES
13 pages
Unit-1 new
No ratings yet
Unit-1 new
48 pages
Unit5_ML_introduction
No ratings yet
Unit5_ML_introduction
32 pages
Introducion to ML
No ratings yet
Introducion to ML
29 pages
UNIT II deep learning
No ratings yet
UNIT II deep learning
42 pages
FDS Assignment
No ratings yet
FDS Assignment
76 pages
Unit 1
No ratings yet
Unit 1
19 pages
Network Security and Anti Hacking
No ratings yet
Network Security and Anti Hacking
8 pages
PROJECT_REPORT_p2
No ratings yet
PROJECT_REPORT_p2
82 pages
ML Unit 1
No ratings yet
ML Unit 1
6 pages
ml3 2
No ratings yet
ml3 2
59 pages
Chapter1
No ratings yet
Chapter1
30 pages
DIP Mid Term 14_09_23 Solution
No ratings yet
DIP Mid Term 14_09_23 Solution
8 pages
ML-UNIT 1
No ratings yet
ML-UNIT 1
53 pages
Unit-1
No ratings yet
Unit-1
55 pages
Machine L
No ratings yet
Machine L
29 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
50 Biggest Market Capitalization - Jan 2021
No ratings yet
50 Biggest Market Capitalization - Jan 2021
1 page
ML-Unit 1
No ratings yet
ML-Unit 1
43 pages
What Is Machine Learning-UNIT III
No ratings yet
What Is Machine Learning-UNIT III
12 pages
MACHINE LEARNING
No ratings yet
MACHINE LEARNING
97 pages
ML1
No ratings yet
ML1
33 pages
Unit-1 MLA
No ratings yet
Unit-1 MLA
31 pages
Machine Learning Lecture Notes
No ratings yet
Machine Learning Lecture Notes
19 pages
Machine Learning Unit-I
No ratings yet
Machine Learning Unit-I
41 pages
Unit 3 and Unit 4 Notes - Data Science - III BCA 2
No ratings yet
Unit 3 and Unit 4 Notes - Data Science - III BCA 2
27 pages
Unit I_Machine Learning @ CSJMU_6 Slides Handouts
No ratings yet
Unit I_Machine Learning @ CSJMU_6 Slides Handouts
4 pages
Introduction to Industrial Electronics_190724
No ratings yet
Introduction to Industrial Electronics_190724
31 pages
Intorduction of ML
No ratings yet
Intorduction of ML
14 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
20 pages
Module1 And2
No ratings yet
Module1 And2
122 pages
Unit 1
No ratings yet
Unit 1
62 pages
Supervised and Unsupervised Machine Learning
No ratings yet
Supervised and Unsupervised Machine Learning
3 pages
A33 Manual Acople Woochang
No ratings yet
A33 Manual Acople Woochang
16 pages
CM6800GIP
No ratings yet
CM6800GIP
18 pages
Unit 5 Machine Learning
No ratings yet
Unit 5 Machine Learning
14 pages
Machine Learning
No ratings yet
Machine Learning
35 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Machine Learning(MCA)
No ratings yet
Machine Learning(MCA)
5 pages
ML Chapter 1
No ratings yet
ML Chapter 1
37 pages
Machine Learning
No ratings yet
Machine Learning
73 pages
ML-Unit 1 Merged
No ratings yet
ML-Unit 1 Merged
151 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
5th Sem Report
No ratings yet
5th Sem Report
29 pages
Distributed Control Systems: Prof - Dr. Joyanta Kumar Roy
No ratings yet
Distributed Control Systems: Prof - Dr. Joyanta Kumar Roy
27 pages
Unit 1 PDF
No ratings yet
Unit 1 PDF
135 pages
Unit-1 Part-1 Material
No ratings yet
Unit-1 Part-1 Material
45 pages
Introduction-to-Geometallurgy-White-Paper
No ratings yet
Introduction-to-Geometallurgy-White-Paper
15 pages
ML Unit 1
No ratings yet
ML Unit 1
19 pages
!index
No ratings yet
!index
3 pages
Introduction To Machine Learning For Beginners
No ratings yet
Introduction To Machine Learning For Beginners
5 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Cse443 11904916
No ratings yet
Cse443 11904916
24 pages
machine learning notes
No ratings yet
machine learning notes
20 pages
MAchine Learning
No ratings yet
MAchine Learning
10 pages
Truncated_Doc_4
No ratings yet
Truncated_Doc_4
3 pages
TWSGuide
No ratings yet
TWSGuide
948 pages
Garden City Movement
100% (1)
Garden City Movement
3 pages
LIS Guide V350, ECi
No ratings yet
LIS Guide V350, ECi
270 pages
MATHEMATICAL FOUNDATIONS OF MACHINE LEARNING: Unveiling the Mathematical Essence of Machine Learning (2024 Guide for Beginners)
From Everand
MATHEMATICAL FOUNDATIONS OF MACHINE LEARNING: Unveiling the Mathematical Essence of Machine Learning (2024 Guide for Beginners)
DAVID MACKAY
No ratings yet
Unit 3-Introduction to Machine Learning
No ratings yet
Unit 3-Introduction to Machine Learning
44 pages
FPGA Architecture Principles and Progression
No ratings yet
FPGA Architecture Principles and Progression
26 pages
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
Navi Pac 46
No ratings yet
Navi Pac 46
54 pages