We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
ارتقا و اصلاح فرايندهاي رايج در بازشناسي نوري حروف متون فارسي با ب هكارگيري ويژگيهاي خط فارسي و الگوريتم انتقال فضا.
- Authors
آرش زارعيان; طيبه موسوي ميانگ; بلقيس روشن; سيد مصطفي فخر احم
- Abstract
Since the technology of optical recognition of characters is essentially based on Latin script, almost all the algorithms and processes involved in Persian OCR systems are constructed upon the structure and scriptological features of Latin alphabet. This utilization of the means and features of Latin script to design Persian-based OCR systems however, not only has not resulted in the appropriate optical recognition of Persian characters but it also has simultaneously ended in confusion on the part of both the Persian-speaking users and the systems. This paper, therefore, begins with a short review of the significance of language and linguistics in the field of information technology in connection with OCR systems. Then, it will continue with a short history of Persian/Arabic script, while focusing on the scribal features of Persian writing system and its differences with other scripts. In the next part, for effective utilization of the formal elements of the Persian script, these elements have been categorized according to their application and significance in the process of the user’s interaction with Persian OCR systems. Furthermore, through a step by step discussion and analysis of the processes involved in optical recognition of characters based on the scriptological features of the Persian script, not only the deficiencies and faults of the current Latin-based OCR systems will be pinpointed but also a different aspect of the Persian writing system, in connection with its use in computer software, especially OCR systems, will be used so that the reader will practically notice the potentials and capabilities of this complex script in contrast to the simpler Latin writing system. In the end, in order to upgrade and improve the current algorithms employed in Persian OCR systems, the geometrical process of transferring bi-dimensional specifications into mono-dimensional ones has been utilized. The proposed algorithm, which is based on the scriptological features of Persian script, will simultaneously result in the convenient manipulation of patterns, reduction of the bulk of the database, and acceleration of the data processing rate.
- Subjects
OPTICAL character recognition; INFORMATION technology; IRANIAN history; FAULT currents; COMPUTER software
- Publication
Language Related Research, 2023, Vol 14, Issue 2, p363
- ISSN
2322-3081
- Publication type
Article
- DOI
10.29252/LRR.14.2.11