Publications:Lexicon-based Offline Recognition of Amharic Words in Unconstrained Handwritten Text

Title Lexicon-based Offline Recognition of Amharic Words in Unconstrained Handwritten Text
Author Yaregal Assabie and Josef Bigun
Year 2008
PublicationType Conference Paper
HostPublication 19th International Conference on Pattern Recognition : (ICPR 2008) ; Tampa, Florida, USA 8-11 December 2008
Conference 19th International Conference on Pattern Recognition, ICPR, Tampa, FL, 8-11 December 2008
Abstract This paper describes an offline handwriting recognition system for Amharic words based on lexicon. The system computes direction fields of scanned handwritten documents, from which pseudo-characters are segmented. The pseudo-characters are organized based on their proximity and direction to form text lines. Words are then segmented by analyzing the relative gap between subsequent pseudocharacters in text lines. For each segmented word image, the structural characteristics of pseudo-characters are syntactically analyzed to predict a set of plausible characters forming the word. The most likelihood word is finally selected among candidates by matching against the lexicon. The system is tested by a database of unconstrained handwritten Amharic documents collected from various sources. The lexicon is prepared from words appearing in the collected database.