Abstract
The complexity in machine recognition of Arabic language due to its cursive nature is well known. Urdu is a popular language which is written in Arabic based script but uses a special calligraphic style of writing known as Nastaliq. The calligraphic nature of Nastaliq and other linguistic properties of Urdu introduce many other complexities which must be kept in mind in the development of OCR. This paper introduces all those complexities and open issues which are unique to Urdu language and Nastaliq style or writing from OCR point of view

Danish Altaf Satti, Dr. Khalid Saleem. (2012) Complexities and Implementation Challenges in Offline Urdu Nastaliq OCR, Conference on Language and Technology 2012.
  • Viewed 1461
  • Downloads 179
  Previous Article
Publisher
Center for Language Engineering
Country
Pakistan
City
Lahore
From
09-11-2012
To
10-11-2012