Evaluation of an Optical Character Recognition Model For Yoruba Text

dc.contributor.authorAkintola, Abimbola G
dc.contributor.authorIbiyemi, Tunji S
dc.contributor.authorBajeh, Amos O
dc.date.accessioned2019-06-11T08:36:31Z
dc.date.available2019-06-11T08:36:31Z
dc.date.issued2019-01
dc.description.abstractThe optical character recognition (OCR) for different languages has been developed and in use with diverse applications over the years. The development of OCR enables the digitization of paper document that would have been neglected over a period of time as well as serving as a form of backup for those documents. The system proposed is for isolated characters of Yoruba language. Yoruba language is a tonal language that carries accent on the vowel alphabets. The process used involves image gray scal, binarization, de-skew, and segmentation. Thus, the OCR enable the system read the images and convert them to text data. The proposed model was evaluated using the information retrieval metrics: Precision and Recall. Results showed a significant performance with a recall of 100% in the sample document used, and precision results that varies between 76%, 97%, and 100% in the sample document.The optical character recognition (OCR) for different languages has been developed and in use with diverse applications over the years. The development of OCR enables the digitization of paper document that would have been neglected over a period of time as well as serving as a form of backup for those documents. The system proposed is for isolated characters of Yoruba language. Yoruba language is a tonal language that carries accent on the vowel alphabets. The process used involves image gray scal, binarization, de-skew, and segmentation. Thus, the OCR enable the system read the images and convert them to text data. The proposed model was evaluated using the information retrieval metrics: Precision and Recall. Results showed a significant performance with a recall of 100% in the sample document used, and precision results that varies between 76%, 97%, and 100% in the sample document.en_US
dc.identifier.urihttp://hdl.handle.net/123456789/2106
dc.language.isoenen_US
dc.publisherTibiscus Universityen_US
dc.subjectRecognitionen_US
dc.subjectBinarizationen_US
dc.subjectAccuracyen_US
dc.subjectImage digitizationen_US
dc.titleEvaluation of an Optical Character Recognition Model For Yoruba Texten_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
17-1-04-Akintola
Size:
2.02 MB
Format:
Adobe Portable Document Format
Description:
Main Article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections