Text area detection in digital documents images using textural features

No Thumbnail Available

Date

2007

Authors

Ar, İlktan
Karsligil, M. Elif

Journal Title

Journal ISSN

Volume Title

Publisher

Springer

Research Projects

Organizational Units

Journal Issue

Abstract

In this paper we propose a new texture-based method for extraction of text areas in a complex document image. Gabor filter motivated by the multi-channel filtering approach of Human Visual System (HVS) has been employed to create energy map of the document. In this energy map we assumed that text areas were rich in high frequency components. Connected components (probable text characters) were extracted by binarization of the energy map with Otsu's adaptive threshold method. First non-text components such as pictures lines frames etc. were eliminated by Gabor filtering. As a novel approach remaining non-text components were then eliminated by using character component interval tracing. Elimination that formed in two stage enhanced the success of detecting text area on different kinds of digital documents. © Springer-Verlag Berlin Heidelberg 2007.

Description

Keywords

Character tracing, Document image, Gabor filter, Text area extraction

Turkish CoHE Thesis Center URL

Citation

1

WoS Q

N/A

Scopus Q

Q2

Source

Volume

4673 LNCS

Issue

Start Page

555

End Page

562