Universiti Teknologi Malaysia Institutional Repository

Text extraction from invariant complex image

Al Hashi, Nouri Ali Al Mabrouk (2009) Text extraction from invariant complex image. Masters thesis, Universiti Teknologi Malaysia, Faculty of Computer Science and Information Systems.

[img] PDF - Submitted Version
Restricted to Repository staff only

3783Kb
[img] PDF
27Kb
[img] PDF
33Kb
[img] PDF
41Kb

Abstract

Great progress has been made in Optical Character Recognition (OCR) technology. Most current OCRs, however, can only read characters printed on sheets of paper according to some rigid format restrictions. For that, the detection and extraction of text regions in an image are well known problems in Computer Vision research area. The goal of this project is to extract and recognize the text from an image by using the edge-based and fuzzy logic algorithm respectively. The algorithms are implemented and evaluated by using a set of images of natural scenes that vary along its’ size, scale and orientation. Various kernels can be used for this operation ,the whole set of 8 kernels is produced by taking one of kernels and rotating its coefficient circularly and edgedetection operator is calculated by forming matrix centered on pixel chosen as center of matrix area, then Localization involves further enhancing regions by eliminating nontext regions. Edge-detection works quite well for digital image corrupted with multiscale and multi-orientation whereas its performance of this operator cannot be used in practical image which generally corrupted other types and edge-detection for detection of edge in digital image is that image should contain sharp intensity transition and low noise of the type is present. Moreover the image is colored image .Then, edge detect at eight edges and convolve with Gaussian after that select the strong edge was suitable of detect the text. As known be the project in complex image by using eight kernels to accomplish the task .Then, we used identified pixel of determine the character with use fuzzy logic.

Item Type:Thesis (Masters)
Additional Information:Thesis (Sarjana Sains (Sains Komputer)) - Universiti Teknologi Malaysia, 2009; Supervisor : Prof. Dr. Dzulkifli Mohamad
Uncontrolled Keywords:text processing (Computer science), complex image
Subjects:Q Science > QA Mathematics > QA76 Computer software
Divisions:Computer Science and Information System (Formerly known)
ID Code:12767
Deposited By: Ramli Haron
Deposited On:28 Jun 2011 09:07
Last Modified:07 Aug 2012 00:38

Repository Staff Only: item control page