Universiti Teknologi Malaysia Institutional Repository

Contour vs non-contour based word segmentation from handwritten text lines: an experimental analysis

Kurniawan, Fajri and Khan, Amjad Rehman and Mohamad, Dzulkifli (2009) Contour vs non-contour based word segmentation from handwritten text lines: an experimental analysis. International Journal of Digital Content Technology and its Applications (JDCTA), 3 (2). pp. 127-131. ISSN 1975-9339

Full text not available from this repository.

Official URL: http://dx.doi.org/ 10.4156/jdcta.vol3.issue2.kurni...

Abstract

This paper compares contour based and noncontours based techniques for extracting words from unconstrained handwritten text lines. Proposed novel approach is based on contours of the words rather only considering threshold for inter-word gaps as previous studies. In this approach, contour of each word is examined along with threshold for inter-word gaps to extract words with high confidence. Unlike previous studies, preprocessing technique is not applied, that enhance the speed significantly. Furthermore, a simple technique for punctuation detection is proposed to increase accuracy of word extraction. For fair comparison text lines are taken randomly from IAM benchmark database and threshold calculation is kept same for all techniques. Experiments thus performed, exhibit improved results and speed over the conventional word extraction methods. Furthermore, developed techniques and results are compared with the other approaches available in the literature using same benchmark database

Item Type:Article
Uncontrolled Keywords:word segmentation, preprocessing, contour detection, punctuation detection
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computer Science and Information System (Formerly known)
ID Code:11840
Deposited By: Siti Anisa Abdul Hamid
Deposited On:21 Jan 2011 10:23
Last Modified:21 Jan 2011 10:23

Repository Staff Only: item control page