Statistical modeling for the detection, localization and extraction of text from heterogeneous textual images using combined feature scheme

被引:0
|
作者
D. Chitrakala Gopalan
机构
[1] Anna University,Department of Computer Science and Engineering, Easwari Engineering College
[2] Anna University,Department of Computer Science and Engineering, College of Engineering
来源
关键词
Text extraction; Non sub sampled Contourlet Transform; Gray level run length matrix; Caption text; Scene text; Document image;
D O I
暂无
中图分类号
学科分类号
摘要
Discriminating between the text and non text regions of an image is a complex and challenging task. In contrast to Caption text, Scene text can have any orientation and may be distorted by the perspective projection. Moreover, it is often affected by variations in scene and camera parameters such as illumination, focus, etc. These variations make the design of unified text extraction from various kinds of images extremely difficult. This paper proposes a statistical unified approach for the extraction of text from hybrid textual images (both Scene text and Caption text in an image) and Document images with variations in text by using carefully selected features with the help of multi level feature priority (MLFP) algorithm. The selected features are combinedly found to be the good choice of feature vectors and have the efficacy to discriminate between text and non text regions for Scene text, Caption text and Document images and the proposed system is robust to illumination, transformation/perspective projection, font size and radially changing/angular text. MLFP feature selection algorithm is evaluated with three common ML algorithms: a decision tree inducer (C4.5), a naive Bayes classifier, and an instance based K-nearest neighbour learner and effectiveness of MLFP is shown by comparing with three feature selection methods with benchmark dataset. The proposed text extraction system is compared with the Edge based method, Connected component method and Texture based method and shown encouraging result and finds its major application in preprocessing for optical character recognition technique and multimedia processing, mobile robot navigation, vehicle license detection and recognition, page segmentation and text-based image indexing, etc.
引用
收藏
页码:165 / 183
页数:18
相关论文
共 50 条
  • [21] A Feature Extraction Scheme from Region of Interest of Wireless Capsule Endoscopy Images for Automatic Bleeding Detection
    Ghosh, T.
    Bashar, S. K.
    Fattah, S. A.
    Shahnaz, C.
    Wahid, K. A.
    2014 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2014, : 256 - 260
  • [22] Feature extraction from MR images for detection of brain and breast tumors through mathematical modeling
    Badshah, Noor
    Rabbani, Hena
    Atta, Hadia
    Irfan, Muhammad Abeer
    Ahmad, Ali
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [23] Multilingual Artificial Text Detection and Extraction from Still Images
    Raza, Ahsen
    Abidi, Ali
    Siddiqi, Imran
    DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
  • [24] Sign text detection in street view images using an integrated feature
    Zhao, Fan
    Yang, Yao
    Zhang, Hai-yan
    Yang, Lin-lin
    Zhang, Lin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (21) : 28049 - 28076
  • [25] Sign text detection in street view images using an integrated feature
    Fan Zhao
    Yao Yang
    Hai-yan Zhang
    Lin-lin Yang
    Lin Zhang
    Multimedia Tools and Applications, 2018, 77 : 28049 - 28076
  • [26] Deep learning based text detection using resnet for feature extraction
    Li-Kun Huang
    Hsiao-Ting Tseng
    Chen-Chiung Hsieh
    Chih-Sin Yang
    Multimedia Tools and Applications, 2023, 82 : 46871 - 46903
  • [27] Deep learning based text detection using resnet for feature extraction
    Huang, Li-Kun
    Tseng, Hsiao-Ting
    Hsieh, Chen-Chiung
    Yang, Chih-Sin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (30) : 46871 - 46903
  • [28] A Pair-copula Based Scheme for Text Extraction from Digital Images
    Roy, Anandarup
    Parui, Swapan K.
    Roy, Utpal
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 892 - 896
  • [29] Defect detection of castings in radiography images using a robust statistical feature
    Zhao, Xinyue
    He, Zaixing
    Zhang, Shuyou
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2014, 31 (01) : 196 - 205
  • [30] Text extraction from scene images by character appearance and structure modeling
    Yi, Chucai
    Tian, Yingli
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (02) : 182 - 194