A Language-Independent, Open-Vocabulary System Based on HMMs for Recognition of Ultra Low Resolution Words - HES SO Valais Publications

BROWSE
EXPORT
- Export all publications
SITE
- Help
- About this site
GUEST USER
anonymous
Topic Subscribe
LOGIN
Name:

Password:

Deutsch, English, Nederlands, Norsk, Português, <more...>

Dear guest, welcome to this publication database. As an anonymous user, you will probably not have edit rights. Also, the collapse status of the topic tree will not be persistent. If you like to have these and other options enabled, you might ask Admin (Ivan Eggel) for a login account.

[BibTeX] [RIS]

Type of publication:	Inproceedings
Citation:	eins08:sac
Booktitle:	23rd Annual ACM Symposium on Applied Computing (ACM SAC 2008), Fortaleza, Ceara, Brasil
Year:	2008
Pages:	429--433
URL:	http://www.hennebert.org/downl...
DOI:	10.1145/1363686.1363791
Abstract:	In this paper, we introduce and evaluate a system capable of recognizing ultra low resolution words extracted from images such as those frequently embedded on web pages. The design of the system has been driven by the following constraints. First, the system has to recognize small font sizes where anti-aliasing and resampling procedures have been applied. Such procedures add noise on the patterns and complicate any a priori segmentation of the characters. Second, the system has to be able to recognize any words in an open vocabulary setting, potentially mixing different languages. Finally, the training procedure must be automatic, i.e. without requesting to extract, segment and label manually a large set of data. These constraints led us to an architecture based on ergodic HMMs where states are associated to the characters. We also introduce several improvements of the performance increasing the order of the emission probability estimators and including minimum and maximum duration constraints on the character models. The proposed system is evaluated on different font sizes and families, showing good robustness for sizes down to 6 points.
Keywords:	image analysis, OCR, Pattern Recognition
Authors	Einsele, F. Ingold, Rolf Hennebert, Jean
Added by:	[]
Total mark:	0
Attachments

Notes

Topics
Institute of Informatics (II) 0/1381

processing time: 0.7904 seconds.