0% found this document useful (0 votes)

22 views

AI & ML Unit 2 Notes

Uploaded by

Anandakumar A

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

AI & ML Unit 2 Notes

Uploaded by

Anandakumar A

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Unit – II

Acting Under Uncertainty:

Uncertainty:
• The knowledge representation, A->B means if A is true then B is true, but a
situation where not sure about whether A is true or not then cannot express this
statement, this situation is called Uncertainty.
• Agents must act under Uncertainty.

Causes for uncertainty:

• Information occurred from unreliable sources

• Experimental Errors
• Equipment Fault
• Temperature variation
• Climate change

Probabilistic Reasoning:

• It is a way of knowledge representation, the concept of probability is applied to

indicate the uncertainty in knowledge.
Need for Probabilistic Reasoning in AI:
✓ When there are unpredictable outcomes
✓ When an unknown error occurs during an experiment

Ways to solve problems with uncertain knowledge:

✓ Baye’s rule
✓ Bayesian Statistics

Probability:

• It can be defined as a chance that an uncertain event will occur.

• The value of probability always remains between 0 and 1.
• 0 ≤ P(A) ≤ 1, where P(A) is the probability of an event A.
• P(A) = 0, indicates total uncertainty in an event A
• P(A) =1, indicates total certainty in an event A.

Event: Each possible outcome of a variable is called event.

Sample Space: The collection of all possible events is called sample space.
Random variables: Random variables are used to represent the events and
objects in real world.
Prior Probability: It is probability computed before observing new information.
Posterior Probability: It is calculated after all information has taken into
account.

Conditional Probability:

• It is probability of occurring an event when another event has already happened.

Bayesian Inference:

• Bayesian inference is a probabilistic approach to machine learning that provides

estimates of the probability of specific events.
• Bayesian inference is a statistical method for understanding the uncertainty
inherent in prediction problems.
• Bayesian inference algorithm can be viewed as a Markov Chain Monte Carlo
algorithm that uses prior probability distributions to optimize the likelihood
function.
• The basis of Bayesian inference is the notion of a priori and a posteriori
probabilities.
• The priori probability is the probability of an event before any evidence is
considered.
• The posteriori probability is the probability of an event after taking into account
all available evidence.

Baye’s Theorem / Baye’s Rule:

• Baye’s theorem determines the probability of an event with uncertain

knowledge.
• It can be derived using product rule and conditional probability of event A
with known event B.

P(A|B) is known as posterior, P(B|A) is called likelihood, P(A) is called Prior

probability, P(B) is called Marginal Probability.

Application of Baye’s Theorem:

• It is used to calculate next step pf robot when already executed step is given.
• Helpful in weather forecasting.
• Solve Monty Hall problem

Naïve Bayes Theorem:

• It is a classification technique based on Baye’s Theorem with an independence

assumption.
• The full joint distribution can be written as

Bayesian Networks:

• "A Bayesian network is a probabilistic graphical model which represents a set of

variables and their conditional dependencies using a directed acyclic graph."
• It is also called a Bayes network, belief network, decision network, or Bayesian
model.
• Bayesian Network can be used for building models from data and experts
opinions, and it consists of two parts: Directed Acyclic Graph, Table of
conditional probabilities.
• It is used to represent conditional dependencies.
• It can also be used in various tasks including prediction, anomaly detection,
diagnostics, automated insight, reasoning, time series prediction.
• A Bayesian network graph is made up of nodes and Arcs.

• Each node corresponds to the random variables, and a variable can be continuous
or discrete.
• Arc or directed arrows represent the causal relationship or conditional
probabilities between random variables.
• These directed links or arrows connect the pair of nodes in the graph.
• These links represent that one node directly influence the other node.
• The Bayesian network graph does not contain any cyclic graph. Hence, it is
known as a directed acyclic graph or DAG.
• The Bayesian network has mainly two components: 1. Causal Component 2.
Actual numbers
• Bayesian network is based on Joint probability distribution and conditional
probability.

Joint probability distribution:

• If variables are x1, x2, x3,....., xn, then the probabilities of a different
combination of x1, x2, x3.. xn, are known as Joint probability distribution.
• P[x1, x2, x3, ,xn], can be written as the following way in terms of the joint
probability distribution. = P[x1| x2, x3,....., xn]. p[x2, x3, , xn] = P[x1| x2,
x3,....., xn]P[x2|x3,....., xn] P[xn-1|xn]P[xn].
• Global semantics defines the full joint distribution as the product of local
condition distributions.
• Local semantics defines each node is conditionally independent of its
nondescendants given its parents.

Example:

The network structure is showing that burglary and earthquake is the parent node of the alarm and directly
affecting the probability of alarm's going off.

Variables: Burglar, Earthquake, Alarm, Johncalls, Marycalls

Conditional Probability table for Alarm A:

Conditional Probability table for David Calls:

Conditional Probability table for Sophia Calls:

Applications of Bayesian Networks:

• Spam Filtering
• Biomonitoring
• Image processing
• Turbo code
• Document Classification

Exact Inference in BN:

• In exact inference, analytically compute the conditional probability distribution

over the variables of interest.
• The basic task for any probabilistic inference system is to compute the
posterior probability distribution for a set of variables.
• The notation X denotes the query variable, E denotes the set of evidence
variables E1,…,Em, Y denotes nonevidence variables.
• Conditional probability can be computed by summing terms from the full joint
distribution.

• Now, a Bayesian network gives a complete representation of the full joint

distribution.
• More specifically, Equation shows that the terms P(x, e, y) in the joint
distribution can be written as products of conditional probabilities from the
network.
• To compute this expression, we have to add four terms, each computed by
multiplying five numbers.
• In the worst case, where we have to sum out almost all the variables, the
complexity of the algorithm for a network with n Boolean variables is O(n2n).

• This expression can be evaluated by looping through the variables in order.

Variable Elimination Algorithm:

• The enumeration algorithm can be improved substantially by eliminating

repeated calculations.
• Do the calculation once and save the results for later use.
• This is a form of dynamic programming.
• It works by evaluating expressions such as equation in right-to-left order.

Approximate Inference in BN:

• Given the intractability of exact inference in large networks, we will consider approximate inference
methods.

• This section describes randomized sampling algorithms, also called Monte Carlo algorithms.

• They work by generating random events based on the probabilities in the Bayes net and counting
up the different answers found in those random events.

Direct Sampling methods:

• The primitive element in any sampling algorithm is the generation of samples

from a known probability distribution.
• For example, an unbiased coin can be thought of as a random variable Coin with
values (heads, tails) and a prior distribution P(Coin) = (0.5,0.5).
• Sampling from this distribution is exactly like flipping the coin: with probability
0.5 it will return heads, and with probability 0.5 it will return tails.
• Given a source of random numbers r uniformly distributed in the range [0,1], it
is a simple matter to sample any distribution on a single variable, whether
discrete or continuous.
• The idea is to sample each variable in turn, in topological order.
• The probability distribution from which the value is sampled is conditioned on
the values already assigned to the variable’s parents.
• Applying it to the network with the ordering Cloudy, Sprinkler, Rain.
• Sample from P(Cloudy) = {0.5,0.5}, value is true.
• Sample from P(Sprinkler | Cloudy) = {0.1,0.9}, value is false.
• Sample from P(Rain | Cloudy = true) = {0.8,0.2}, value is true.

Rejection Sampling in Bayesian Networks:

• Rejection sampling is a general method for producing samples from a hard-to-

sample distribution given an easy-to-sample distribution.
• It can be used to compute conditional probabilities that is, to determine P(X |e).
• First it generates samples from the prior distribution specified by network.
• Then it rejects all those that do not matches the evidence.

Markov Chain Monte Carlo (MCMC) Algorithm:

• MCMC generates each event by making a random change to the preceding event.
• It is therefore helpful to think of the network as being in a particular current state
specifying a value for every variable.
• The next state is generated by randomly sampling a value for one of the
nonevidence variables Xi, conditioned on the current values of the variables in
the Markov blanket of Xi.
• MCMC therefore wanders randomly around the state space-the space of possible
complete assignmentsflipping one variable at a time, but keeping the evidence
variables fixed.
• Consider the query P(Rain1 Sprinkler = true, Wet Grass = true) applied to the
network.
• The evidence variables Sprinkler and WetGrass are fixed to their observed values
and the hidden variables Cloudy and Rain are initialized randomly.
• Thus, the initial state is [true, true, false, true]. Now the following steps are
executed repeatedly:
• Cloudy is sampled, given the current values of its Markov blanket variables: in
this case, we sample from P(Cloudy1 Sprinkler = true, Rain =false). Suppose the
result is Cloudy =false. Then the new current state is [false, true, false, true].

Causal Networks:

• A causal network is an acyclic digraph arising from an evolution of a substitution

system.
• Each substitution event is a vertex in a causal network.
• Two events which are related by causal dependence, meaning one occurs just
before the other, having edge between the corresponding vertices in the causal
network.
• The edge is directed edge leading from the past event to future event.
• A CBN is a graph formed by nodes representing random variables, connected by
links denoting causal influence.
• Some causal networks are independent of choice of evolution and these are
called Causally Invariant.

Structural Causal Models (SCMs):

• SCMs consists of two parts: a graph which visualizes causal connections, and
equations which express the details of the connection. Graph is a mathematical
construction that consists of vertices(nodes) and edges(links).
• SCMs use a special kind pf graph called Directed Acyclic Graph(DAG) for
which all edges are directed and no cycles exist.
• DAGs are common starting place for causal inference.
• Bayesian and causal networks are completely identical.

• A network with 2 nodes and 1 edge.

• This network can be both a Bayesian or causal network.

Implementing Causal Inference:

1) The do-operator:

• The do-operator is a mathematical representation of a physical intervention.

• If the model starts with Z → X → Y.
2) Confounding:

• In this example, age is a confounder pf education and wealth.

• Adjusting for age just means that when looking at age, education and wealth
data, one would compare data points within afe groups, not between age
groups.

3) Estimating Causal Effects:

• Treatment Effect = (Outcome under E) minus (Outcome under C).

• The difference between the outcome a child would receive if assigned to
treatment E and outcome that same child would receive of assigned to
treatment C.
• These are called Potential Outcomes.

CS 188 Introduction To AI Midterm Study Guide
No ratings yet
CS 188 Introduction To AI Midterm Study Guide
2 pages
Markov Models Supervised and Unsupervised Machine Learning: Mastering Data Science And Python
From Everand
Markov Models Supervised and Unsupervised Machine Learning: Mastering Data Science And Python
William Sullivan
2/5 (1)
Capstone Project Report - Hotel Room Pricing in Indian Cities
No ratings yet
Capstone Project Report - Hotel Room Pricing in Indian Cities
8 pages
SMDM Extended Project Report
No ratings yet
SMDM Extended Project Report
9 pages
Bayesian Belief Network, Exact Inference, Approx Inference, Causal Network
No ratings yet
Bayesian Belief Network, Exact Inference, Approx Inference, Causal Network
15 pages
ASHTIKA
No ratings yet
ASHTIKA
9 pages
unit ii
No ratings yet
unit ii
44 pages
AAI Module 3 Notes
No ratings yet
AAI Module 3 Notes
7 pages
EXP1_A09_DS
No ratings yet
EXP1_A09_DS
6 pages
Unit-5 Bayes' Rule and Bayesian Network
No ratings yet
Unit-5 Bayes' Rule and Bayesian Network
9 pages
Aiml Unit 2
No ratings yet
Aiml Unit 2
15 pages
Good BayesianNetworksPrimer
No ratings yet
Good BayesianNetworksPrimer
23 pages
Ai Pro
No ratings yet
Ai Pro
11 pages
Unit 3-2
No ratings yet
Unit 3-2
12 pages
Unit 6
No ratings yet
Unit 6
126 pages
202004021910158758chandrabhan Artificial Intelligence Probabilistic Reasoning
No ratings yet
202004021910158758chandrabhan Artificial Intelligence Probabilistic Reasoning
11 pages
13 Bayes Nets
No ratings yet
13 Bayes Nets
38 pages
Unit 5
No ratings yet
Unit 5
98 pages
AI NOTES unit 2
No ratings yet
AI NOTES unit 2
9 pages
Bayesian Networks in AI
No ratings yet
Bayesian Networks in AI
8 pages
Uncertain Knowledge
No ratings yet
Uncertain Knowledge
31 pages
Data Analytics Unit-2 PPT Notes
No ratings yet
Data Analytics Unit-2 PPT Notes
190 pages
AI unit 5 notes
No ratings yet
AI unit 5 notes
35 pages
Bayesian Networks: Construction, Inference, Learning and Causal Interpretation
No ratings yet
Bayesian Networks: Construction, Inference, Learning and Causal Interpretation
58 pages
29-Approximate Inference Methods-28-03-2024
No ratings yet
29-Approximate Inference Methods-28-03-2024
26 pages
AI Bayes Theorem
No ratings yet
AI Bayes Theorem
10 pages
Lecture 5 Bayesian Networks
No ratings yet
Lecture 5 Bayesian Networks
12 pages
Bayesian Neworks
No ratings yet
Bayesian Neworks
32 pages
Lecture Bayesian Networks
No ratings yet
Lecture Bayesian Networks
50 pages
Ai Ii Notes
No ratings yet
Ai Ii Notes
33 pages
Bayesian Belief Network in Artificial Intelligence
No ratings yet
Bayesian Belief Network in Artificial Intelligence
10 pages
Bayesian Networks
No ratings yet
Bayesian Networks
7 pages
BayesNets2016
No ratings yet
BayesNets2016
62 pages
Chapter 9 Data Mining
No ratings yet
Chapter 9 Data Mining
147 pages
Unit Iv Learning
No ratings yet
Unit Iv Learning
40 pages
Module V_v1
No ratings yet
Module V_v1
58 pages
IAI UNIT 6
No ratings yet
IAI UNIT 6
6 pages
3-Bayesian Modelling - Inference and Bayesian NT
No ratings yet
3-Bayesian Modelling - Inference and Bayesian NT
25 pages
Probabilistic Reasoning
No ratings yet
Probabilistic Reasoning
58 pages
Learning Bayesian Networks (Neapolitan, Richard) PDF
100% (1)
Learning Bayesian Networks (Neapolitan, Richard) PDF
704 pages
Bayesian Belief Network
No ratings yet
Bayesian Belief Network
10 pages
Lecture 8
No ratings yet
Lecture 8
61 pages
LM7 Approximate Inference in BN
No ratings yet
LM7 Approximate Inference in BN
18 pages
Ai Notes
No ratings yet
Ai Notes
68 pages
learning bayesian networks 1st edition by Richard Neapolitan ISBN 0130125342 978-0130125347 - Instantly access the complete ebook with just one click
100% (18)
learning bayesian networks 1st edition by Richard Neapolitan ISBN 0130125342 978-0130125347 - Instantly access the complete ebook with just one click
90 pages
ml 5
No ratings yet
ml 5
28 pages
PPT06-Probabilistic Reasoning
No ratings yet
PPT06-Probabilistic Reasoning
31 pages
CM Week 15
No ratings yet
CM Week 15
15 pages
Aids Lab PDF
No ratings yet
Aids Lab PDF
53 pages
ML-9
No ratings yet
ML-9
15 pages
Bayesian Network
No ratings yet
Bayesian Network
15 pages
Unit4 - Lecture 2
No ratings yet
Unit4 - Lecture 2
17 pages
learning bayesian networks 1st edition by Richard Neapolitan ISBN 0130125342 978-0130125347 - The full ebook with complete content is ready for download
No ratings yet
learning bayesian networks 1st edition by Richard Neapolitan ISBN 0130125342 978-0130125347 - The full ebook with complete content is ready for download
43 pages
AI&MLUnit 2
No ratings yet
AI&MLUnit 2
26 pages
Unit-4 Bayesian Networks
No ratings yet
Unit-4 Bayesian Networks
19 pages
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Introduction to Statistics
From Everand
Introduction to Statistics
Simone Malacrida
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Exercises of Advanced Statistics
From Everand
Exercises of Advanced Statistics
Simone Malacrida
No ratings yet
EOI - SP Consultant Roster ESARO-WCARO - 090524 - June 24
No ratings yet
EOI - SP Consultant Roster ESARO-WCARO - 090524 - June 24
6 pages
Ex01 Linear Regression PDF
No ratings yet
Ex01 Linear Regression PDF
2 pages
HR Analytics: A Literature Review and New Conceptual Model: June 2020
No ratings yet
HR Analytics: A Literature Review and New Conceptual Model: June 2020
13 pages
The Spearman Rho Rank Correlation Coefficient
No ratings yet
The Spearman Rho Rank Correlation Coefficient
22 pages
M&Ms Project!!
No ratings yet
M&Ms Project!!
2 pages
FormulaSheet APStat
No ratings yet
FormulaSheet APStat
2 pages
Hypothesis Testing Z-Test Z-Test: State The Hypotheses
No ratings yet
Hypothesis Testing Z-Test Z-Test: State The Hypotheses
9 pages
Name: Udaya Bir Saha Batch: 61D Student ID: 50 Marks: Out of 60 SPRING 2021
No ratings yet
Name: Udaya Bir Saha Batch: 61D Student ID: 50 Marks: Out of 60 SPRING 2021
21 pages
An Assessment of Current Methods For Surveying and Monitoring Wolves 2005
No ratings yet
An Assessment of Current Methods For Surveying and Monitoring Wolves 2005
79 pages
All chapter download Managing for Quality and Performance Excellence 10th Edition Evans Solutions Manual
100% (17)
All chapter download Managing for Quality and Performance Excellence 10th Edition Evans Solutions Manual
66 pages
(Richard Dennis) English Industrial Cities of The (B-Ok - Xyz) PDF
No ratings yet
(Richard Dennis) English Industrial Cities of The (B-Ok - Xyz) PDF
380 pages
Research Methods and Statistics From An Islamic Perspective
No ratings yet
Research Methods and Statistics From An Islamic Perspective
14 pages
Sta1510 Zulu Translation
No ratings yet
Sta1510 Zulu Translation
199 pages
BA Module 3 - As of 25th September 2020
No ratings yet
BA Module 3 - As of 25th September 2020
72 pages
Berger Et Al 2011 J Structural Geol
No ratings yet
Berger Et Al 2011 J Structural Geol
13 pages
Introduction: Meaning of Hypothesis
No ratings yet
Introduction: Meaning of Hypothesis
8 pages
A Risk-Oriented Model For Factor Rotation Decisions
No ratings yet
A Risk-Oriented Model For Factor Rotation Decisions
38 pages
International Journal of Forecasting: Andrea Carriero, Ana Beatriz Galvão, George Kapetanios
No ratings yet
International Journal of Forecasting: Andrea Carriero, Ana Beatriz Galvão, George Kapetanios
14 pages
(Ebook) Handbook of Statistics 24: Data Mining and Data Visualization by C.R. Rao, E. J. Wegman, J. L. Solka ISBN 9780444511416, 0444511415 pdf download
100% (1)
(Ebook) Handbook of Statistics 24: Data Mining and Data Visualization by C.R. Rao, E. J. Wegman, J. L. Solka ISBN 9780444511416, 0444511415 pdf download
55 pages
AI Chapter 6
No ratings yet
AI Chapter 6
28 pages
Research Methods For Managers
No ratings yet
Research Methods For Managers
57 pages
Final Exam in Stat Final Version
100% (2)
Final Exam in Stat Final Version
5 pages
Pro-Quality Associates: Trg. Date(s) - TRAINING ASSESSMENT - SPC / 00
No ratings yet
Pro-Quality Associates: Trg. Date(s) - TRAINING ASSESSMENT - SPC / 00
4 pages
Sampling Techniques And
No ratings yet
Sampling Techniques And
30 pages
Same-Different Test Analysis Using Chi-Square
No ratings yet
Same-Different Test Analysis Using Chi-Square
4 pages
BS-CHAPTER5
No ratings yet
BS-CHAPTER5
4 pages
Bike Assignment - Subjective Sol
No ratings yet
Bike Assignment - Subjective Sol
5 pages
Title Defense 3.0
No ratings yet
Title Defense 3.0
33 pages