[Previous [1] [2] [3] [4] [5] [6] [7] [8]

Journal of Inforamtion Science and Engineering, Vol.16, No.1, pp.127-139 (January 2000)

Information Distribution of the Central Projection Method
For Chinese Character Recognition

Yu Tao, Ernest C. M. Lam, Chin S. Huang and Yuan Y. Tang
Department of Computer Science
Hong Kong Baptist University
Kowloon Tong, Hong Kong
E-mail: {taoyu, ecmlam, cshuang, yytang}@comp.hkbu.edu.hk

A new method called central projection transformation is proposed in this paper for feature extraction. From our experiments, the new method is found to be efficient in extracting features from Chinese characters, which contain vast amounts of information. Chinese characters have complex structures, and some of them are composed of several separate components, so several contours are embedded in a character. This may obstruct application of the contour approach to recognizing Chinese characters. Central projection transformation can convert such a multicontour pattern into a solid convex pattern, whose contour is a unique polygon. Most of the information of this new pattern is still located around its peripheries. In this paper, information contents and entropy measurements are studied in both original Chinese characters and transformed new objects from the 3500 most frequently used Chinese characters. The results indicate that both the information contents and entropy measurements of pixels vary according to the positions of the points, and that most of the information is located around the peripheries of the original characters as well as of the new ones. This approach can greatly simplify the processing of Chinese characters and other multicontour patterns. It is also a powerful tool for processing Arabic characters, Japanese characters and other characters.

Keywords: character recognition, feature extraction, distribution of information, entropy, central projection transformation

Full Text () Retrieve PDF document (200001_08.pdf : 3,563,874 bytes)

Received July 14, 1998; revised October 6, 1998; accepted December 22, 1998.
Communicated by Zen Chen..