Adaptive Classifier Construction An Approach To Ha
Adaptive Classifier Construction An Approach To Ha
net/publication/220801444
CITATIONS READS
16 204
1 author:
SEE PROFILE
All content following this page was uploaded by Tuan Trung Nguyen on 15 January 2015.
1 Introduction
J.J. Alpigini et al. (Eds.): RSCTC 2002, LNAI 2475, pp. 578–585, 2002.
c Springer-Verlag Berlin Heidelberg 2002
Adaptive Classifier Construction 579
In this paper, we shall show that all three components can be dynamically
constructed by an adaptive process based extensively on the actual input image
domain.
For the researches in this paper we have chosen the Enhanced Loci coding
scheme, which assigns to every image’s pixel a code reflecting the topology of its
neighborhood. The Enhanced Loci algorithm, though simple, has proved to be
very successful in digit recognition. For a detailed description of the Loci coding
scheme, see [4].
Once the Loci coding is done, the digit image is segmented into regions con-
sisting of pixels with the same code value. These regions then serve as primitives
to build the graph representation of the image.
NW NE
SW SE
GI = {N, P, E}
Definition 2. Given a set of base segments with common node and edge sets
S1 = {N, P1 , E}, S2 = {N, P2 , E}, ..., Sk = {N, Pk , E}, a base skeleton segment
is defined as:
Sbs = {N, Pbs , E}
where Pbs is a combined set of properties:
We shall say that a base skeleton segment Sbs = {N, {(L1 , f1 ), (L2 ,
f2 ), ..., (Lk , fk )}, E} matches a graph GI , or match(Sbs , GI ) if ∃1 ≤ j ≤ k :
match({N, Lk , E}, GI )
Definition 3. A base skeleton graph (BSG) is any set of base skeleton segments.
One can look at BSGs as “soft” or “blurred” prototypical graphs that may
be used to represent class of digits. By fine-tuning various parameters of the
model, e.g. the set of properties or connection labels or by imposing various
cut-off thresholds, we can dynamically control the primitives extraction process.
else
τ (S, G) = 0.
Algorithm
Step 1 – Initial Skeleton Graph Set
for each digit class k, a set of initial BSGs are constructed based on::
End Algorithm
It can be observed that this is an adaptive iterative process with a two-layered
k-NN clustering scheme, aimed at the optimization of three components crucial
to the recognition process:
– Primitives extraction process, implemented by Loci coding scheme, code-
defined and connection-defined similarity measures.
– Similarity measures model, represented by base skeleton graphs.
– Class distance functions (discriminants), synthesized over extended skeleton
graphs.
584 T.T. Nguyen
5 Dissimilarity Measures
6 Results of Experiments
In order to verify the developed methods, extensive testing has been conducted.
We have chosen the U.S. National Institute of Standards and Technology (NIST)
Handwritten Segmented Character Special Database 3, a major reference base
within the handwritten character recognition community, as the main data col-
lection. The base contains 223,125 128 × 128 normalized binary images with
isolated handwritten digits from 2,100 different people.(For details see [3])
As a reference experiment’s data collection, we have chosen a random portion
of the whole base that contained:
– 44,000 digits as a training table, of which 4,000 have been separated for tests
during the learning process.
– 4,000 digits for final test table
The results obtained qualify our system close to the leading recognition pack-
ages tested at NIST, of which the average zero-rejection error rates were 1.70
percent. (See [3])
7 Conclusion
We presented a uniformed framework for the automatic construction of classifiers
based on an adaptive scheme. A model for the synthesis of similarity measures
from the input data primitives through higher level features has been proposed.
The method allows for a flexible learning from the input training data during
the construction phase and proved to be effective. The same framework can be
used to develop dissimilarity measures that are highly useful in the improvement
of the classification quality. Experiments conducted on a large handwritten digit
database showed that the method can be applied to practical problems with
encouraging results. The framework can easily be adapted to the recognition of
other structured objects such as handwritten characters, fingerprints, iris images
or human faces.
References
1. Michael R. Anderberg. Cluster Analysis for Applications. Academic Press, Inc.,
1973.
2. Jan Bazan, Hung Son Nguyen, Tuan Trung Nguyen, Jaroslaw Stepaniuk, and An-
drzej Skowron. Application of modal logics and rough sets for classifying objects. In
Michel De Glas and Zdzislaw Pawlak, editors, Proceedings of the Second World Con-
ference on the Fundamentals of Artificial Intelligence, pages 15–26, Paris, France,
1995. Ankor.
3. J. Geist, R. A. Wilkinson, S. Janet, P. J. Grother, B. Hammond, N. W. Larsen,
R. M. Klear, C. J. C. Burges, R. Creecy, J. J. Hull, T. P. Vogl, and C. L. Wilson.
The second census optical character recognition systems conference. NIST Technical
Report NISTIR 5452, pages 1–261, 1994.
4. K. Komori, T. Kawatani, K. Ishii, and Y. Iida. A feature concentrated method for
character recognition. IFIP Proceedings, pages 29–34, 1977.
5. Z.C. Li, C.Y. Suen, and J. Guo. Hierarchical models for analysis and recognition
of handwritten characters. Annals of Mathematics and Artificial Intelligence, pages
149–174, 1994.
6. L. Polkowski and A. Skowron. Towards adaptive calculus of granules. In L.A.
Zadeh and J. Kacprzyk, editors, Computing with Words in Information/Intelligent
Systems, pages 201–227, Heidelberg, 1999. Physica-Verlag.
7. Robert J. Schalkoff. Pattern Recognition: Statistical, Structural and Neural Ap-
proaches. John Wiley & Sons, Inc., 1992.
8. Kodratoff Y. and Michalski R. Machine Learning: An Artificial Intelligence Ap-
proach, volume 3. Morgan Kaufmann, 1990.