Mallappa, Satishkumar; Dhandra, B. V.; Mukarambi, Gururaj

doi:10.21917/ijsc.2023.0450

Back to matches

Your institution may have access to this item. Find your institution then sign in to continue.

Title: SCRIPT IDENTIFICATION FROM CAMERA CAPTURED INDIAN DOCUMENT IMAGES WITH CNN MODEL.
Authors: Mallappa, Satishkumar; Dhandra, B. V.; Mukarambi, Gururaj
Abstract: Compared to typical scanners, handheld cameras offer convenient, flexible, portable, and noncontact image capture, which enables many new applications and breathes new life into existing ones, but cameracaptured documents may suffer from distortions caused by a nonplanar document shape and perspective projection, which lead to the failure of current optical character recognition (OCR) technologies. This paper presents a new CNN model for script identification from cameracaptured Indian multilingual document images. To evaluate the performance of the proposed model 9 regional languages, one national language and one international Roman languages are considered. Two languages, Hindi national language, and Roman English language are taken as the common languages with regional language for the study. The proposed method is applied on Bi-script, Tri-script, and Multiscript combinations. The average recognition accuracy for three script combinations is 92.92%, for bi-script 91.33%, and for tri-script 87.33%. is achieved. The proposed method is the unified approach used for identifying the script from bi-script, tri-script and multi-script cameracaptured document images and is the novelty of this paper. The proposed model is compared with the Alexnet pretrained CNN model, and it achieved the highest recognition accuracy.
Subjects: OPTICAL character recognition; DOCUMENT imaging systems; UNIVERSAL language; LANGUAGE policy; SCRIPTS; HINDI language; CAMERAS
Publication: ICTACT Journal on Soft Computing, 2023, Vol 14, Issue 2, p3232
ISSN: 0976-6561
Publication type: Article
DOI: 10.21917/ijsc.2023.0450

We found a match

SCRIPT IDENTIFICATION FROM CAMERA CAPTURED INDIAN DOCUMENT IMAGES WITH CNN MODEL.

Mallappa, Satishkumar; Dhandra, B. V.; Mukarambi, Gururaj

OPTICAL character recognition; DOCUMENT imaging systems; UNIVERSAL language; LANGUAGE policy; SCRIPTS; HINDI language; CAMERAS

ICTACT Journal on Soft Computing, 2023, Vol 14, Issue 2, p3232

0976-6561

Article

10.21917/ijsc.2023.0450