Pattern Recognition
Pattern Recognition
Unit 1 : Introduction
Pattern is everything around in this digital world. A pattern can either be seen
physically or it can be observed mathematically by applying algorithms.
Example: The colors on the clothes, speech pattern, etc. In computer science, a
pattern is represented using vector feature values.
What is Pattern Recognition?
Feature Vector:
A set of features that are taken together, forms the features vector.
Example: In the above example of a face, if all the features (eyes, ears, nose, etc) are
taken together then the sequence is a feature vector([eyes, ears, nose]). The feature
vector is the sequence of a feature represented as a d-dimensional column vector. In
the case of speech, MFCC (Mel-frequency Cepstral Coefficient) is the spectral feature
of the speech. The sequence of the first 13 features forms a feature vector.
Learning is a phenomenon through which a system gets trained and becomes adaptable
to give results in an accurate manner. Learning is the most important phase as to how
well the system performs on the data provided to the system depends on which
algorithms are used on the data. The entire dataset is divided into two categories, one
which is used in training the model i.e. Training set, and the other that is used in testing
the model after training, i.e. Testing set.
Training Set
The training set is used to build a model. It consists of the set of images that are used
to train the system. Training rules and algorithms are used to give relevant information
on how to associate input data with output decisions. The system is trained by applying
these algorithms to the dataset, all the relevant information is extracted from the data,
and results are obtained. Generally, 80% of the data of the dataset is taken for training
data.
Testing Set:
Testing data is used to test the system. It is the set of data that is used to verify whether
the system is producing the correct output after being trained or not. Generally, 20%
of the data of the dataset is used for testing. Testing data is used to measure the
accuracy of the system. For example, a system that identifies which category a
particular flower belongs to is able to identify seven categories of flowers correctly
out of ten and the rest of others wrong, then the accuracy is 70 %
Real-time Examples Pattern Recognition:
A pattern is a physical object or an abstract notion. While talking about the classes of
animals, a description of an animal would be a pattern. While talking about various
types of balls, then a description of a ball is a pattern. In the case balls considered as
pattern, the classes could be football, cricket ball, table tennis ball, etc. Given a new
pattern, the class of the pattern is to be determined. The choice of attributes and
representation of patterns is a very important step in pattern classification. A good
representation is one that makes use of discriminating attributes and reduces the
computational burden in pattern classification.
Advantages:
Applications:
• Image processing, segmentation, and analysis
• Computer vision
The pattern recognition approach is used for the discovery, imaging, and
interpretation of temporal patterns in seismic array recordings. Statistical
pattern recognition is implemented and used in different types of seismic
analysis models.
The greatest success in speech recognition has been obtained using pattern
recognition paradigms. It is used in various algorithms of speech
recognition which tries to avoid the problems of using a phoneme level of
description and treats larger units such as words as pattern
• Fingerprint identification
1. Sensing: It deals with problem arises in the input such as its bandwidth,
resolution, sensitivity, distortion, signal-to-noise ratio, latency, etc.
There are various sequences of activities that are used for designing the Pattern
Recognition Systems. These activities are as follows:
• Data Collection
• Feature Choice
• Model Choice
• Training
• Evaluation
Pattern Recognition in Data Machine-driven Data Analysis
At the age of 5, most children can recognize digits and letters – small characters, large
characters, handwritten, machine printed or rotated – all easily recognized by the young.
In most instances, the best pattern recognizers are humans, yet we do not understand
how humans recognize patterns. The rapidly growing amount of data generated is
making it impossible for humans to interpret and driving the need for machines to be
able to identify patterns quickly and accurately. The ability to automate the recognition
of patterns and regularities in data has many applications, from facial recognition
software to tumor detection.
Given a pattern, its recognition and classification can consist of one of the following
two tasks: Supervised classification identifies the input pattern as a member of a
predefined class. (Descriptive) Unsupervised classification assigns the input pattern to
a hitherto undefined class. (Explorative) The recognition problem is usually posed as
either classification or categorization task. The classes are either defined by the system
designed (supervised classification) or are learned based on the similarity of patterns
(in unsupervised classification). We recommend checking out our article about
supervised vs. unsupervised learning. Pattern recognition is constantly evolving, driven
by emerging applications that are not only challenging but also more computationally
intensive.
What Is the Goal of Pattern Recognition?
The goal of pattern recognition is based on the idea that the decision-making process
of a human being is somewhat related to the recognition of patterns. For example, the
next move in a chess game is based on the board’s current pattern and buying or selling
stocks is decided by a complex pattern of financial information. Therefore, the goal of
pattern recognition is to clarify these complicated mechanisms of decision-making
processes and to automate these e functions using computers.
Pattern recognition is defined as the study of how machines can observe the
environment, learn to distinguish various patterns of interest from their background, and
make logical decisions about the categories of the patterns. During recognition, the
given objects are assigned to a specific category. Because it is a constantly evolving and
broad field, there exist several definitions of Pattern Recognition, as it is a constantly
evolving and broad field.
Pattern recognition is also an integral part of most machine intelligence systems built
for decision making” (2003, Sergio Theodoridis). In general, pattern recognition can be
described as an information reduction, information mapping, or information labeling
process. In computer science, pattern recognition refers to the process of matching
information already stored in a database with incoming data based on their attributes.
Historically, the two major approaches to pattern recognition are Statistical Pattern
Recognition (or decision-theoretic) and Syntactic Pattern Recognition (or structural).
The third major approach is based on the technology of artificial neural networks
(ANN), named Neural Pattern Recognition. No single technology is always the optimal
solution for a given pattern recognition problem. All three or hybrid methods are often
considered to solve a given pattern recognition problem. Next, we will look into the
most prominent methods used for finding patterns in data.
Syntactic Pattern Recognition, also known as SyntPR, is used for recognition problems
involving complex patterns that can be addressed by adopting a hierarchical
perspective. Accordingly, the syntactic pattern approach relies on primitive subpatterns
(such as letters of the alphabet). The pattern is described depending on the way the
primitives interact with each other. An example of this interaction is how they are
assembled in words and sentences. The given training samples develop how
grammatical rules are developed and how the sentences will later be “read”. In addition
to classification, structural pattern recognition also provides a description of how the
given pattern is constructed from the primitive subpatterns. Hence, the approach has
been used in examples where the patterns have a distinct structure that can be captured
in terms of a rule-set, such as EKG waveforms or textured images.
The syntactic approach may lead to a combinatorial explosion of probabilities to be
examined, requiring large training sets and very large computational efforts. Template-
matching Template matching is one of the simplest and earliest approaches to pattern
recognition. Matching is a generic operation that is used to determine the similarity
between two entities of the same type. Therefore, template matching models try to
discover similarities in a sample based on a reference template. Hence, the template
matching technique is commonly used in digital image processing for detecting small
sections of an image that match a template image. Typical real-world examples are
medical image processing, quality control in manufacturing, robot navigation, or face
recognition.