管理信息与决策科学杂志

1532-5806

抽象的

Design and development of an ocr system that can convert both amharic and english based images and scanned pdf files into editable content

Beshah, T., & Asfawosen, A.

Optical Character Recognition (OCR) is technology of recognizing printed or written text characters by a computer. It is being used in many areas like libraries and information centers to document and preserve handwritten and/or computer processed text images. Done effectively it is assumed to facilitate text recognition and processing. Though there are attempts in building such systems, their accuracy and applicability due to language difference is limited. Thus, through this research attempt is made to investigate and develop an OCR system that can recognize both Amharic (local language of Ethiopia) and English text images. Design science research process is followed through out the research and a convincing result is achieved.

: