Abstract
Recognizing text in natural scene images is very important to develop various systems such as an assistant device for visually-impaired people. Multilingual scene text recognition is also becoming important for wearable camera devices with language translation feature. Since computational resources are limited on such mobile devices, fast and accurate Optical Character Recognition (OCR) algorithm is needed. Nearest Neighbor (NN) search is quite popular in feature vector-based OCR systems, and its speed improvement is required. In this paper, we develop an OCR scheme with tree-based clustering technique with LDA (Linear Discriminant Analysis) aiming at real-time Japanese/Chinese character recognition. The experimental results using ETL9B dataset show that our proposed method is 94.6% faster than our previous method, also beating other techniques, at mere 0.24% accuracy drop from the full linear search.
Chapter PDF
Similar content being viewed by others
Keywords
References
Koga, M., Mine, R., Takahashi, T., Yamazaki, M., Yamaguchi, T.: Camera-based Kanji OCR for Mobile-phones Practical Issues. In: Proc. of ICDAR, pp. 635–639 (2005)
Mancas-Thilou, C., Ferreira, S., Demeyer, J., Minetti, C., Gosselin, B.: A multifunctional reading assistant for the visually impaired. EURASIP Journal on Image and Video Processing, 1–11 (2007)
Arya, S., Mount, D., Netanyahu, N., Silverman, R., Wu, A.: An Optimal Algorithm for Approximate Nearest Neighbor Searching in Fixed Dimensions. Journal of the ACM 6(45), 891–923 (1998)
Datar, M., Immorlica, N.: P. Indyk, V.M.: Locality-Sensitive Hashing Scheme Based on p-Stable Distributions. In: Proc. of the Twentieth Annual Symposium on Computational Geometry, pp. 253–262 (2004)
Sobu, Y., Goto, H., Aso, H.: Binary Tree-Based Accuracy-Keeping Clustering Using CDA for Very Fast Japanese Character Recognition. In: Proc. of MVA 2011, pp. 299–302 (2011)
Zhang, H., Guo, J., Chen, G., Li, C.: HCL2000 – A Large-scale Handwritten Chinese Character Database for Handwritten Character Recognition. In: Proc. of ICDAR, pp. 286–290 (2009)
Sasaki, T., Goto, H.: High-Accuracy Clustering Using LDA for Fast Japanese Character Recognition. IEICE Technical Report, PRMU2012–73, 19–24 (2012) (in Japanese)
Barnea, D., Silverman, H.: A Class of Algorithms for Fast Digital Image Registration. IEEE Trans. on Computers 2, C-21, 179–186 (1972)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abe, Y., Sasaki, T., Goto, H. (2013). Fast and Accurate Tree-Based Clustering for Japanese/Chinese Character Recognition. In: Petrosino, A. (eds) Image Analysis and Processing – ICIAP 2013. ICIAP 2013. Lecture Notes in Computer Science, vol 8157. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41184-7_47
Download citation
DOI: https://doi.org/10.1007/978-3-642-41184-7_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41183-0
Online ISBN: 978-3-642-41184-7
eBook Packages: Computer ScienceComputer Science (R0)