Statistical modeling for the detection, localization and extraction of text from heterogeneous textual images using combined feature scheme

被引:0
|
作者
D. Chitrakala Gopalan
机构
[1] Anna University,Department of Computer Science and Engineering, Easwari Engineering College
[2] Anna University,Department of Computer Science and Engineering, College of Engineering
来源
关键词
Text extraction; Non sub sampled Contourlet Transform; Gray level run length matrix; Caption text; Scene text; Document image;
D O I
暂无
中图分类号
学科分类号
摘要
Discriminating between the text and non text regions of an image is a complex and challenging task. In contrast to Caption text, Scene text can have any orientation and may be distorted by the perspective projection. Moreover, it is often affected by variations in scene and camera parameters such as illumination, focus, etc. These variations make the design of unified text extraction from various kinds of images extremely difficult. This paper proposes a statistical unified approach for the extraction of text from hybrid textual images (both Scene text and Caption text in an image) and Document images with variations in text by using carefully selected features with the help of multi level feature priority (MLFP) algorithm. The selected features are combinedly found to be the good choice of feature vectors and have the efficacy to discriminate between text and non text regions for Scene text, Caption text and Document images and the proposed system is robust to illumination, transformation/perspective projection, font size and radially changing/angular text. MLFP feature selection algorithm is evaluated with three common ML algorithms: a decision tree inducer (C4.5), a naive Bayes classifier, and an instance based K-nearest neighbour learner and effectiveness of MLFP is shown by comparing with three feature selection methods with benchmark dataset. The proposed text extraction system is compared with the Edge based method, Connected component method and Texture based method and shown encouraging result and finds its major application in preprocessing for optical character recognition technique and multimedia processing, mobile robot navigation, vehicle license detection and recognition, page segmentation and text-based image indexing, etc.
引用
收藏
页码:165 / 183
页数:18
相关论文
共 50 条
  • [41] Structured Cluster Detection from Local Feature Learning for Text Region Extraction
    Lin, Huei-Yung
    Hsu, Chin-Yu
    ENTROPY, 2023, 25 (04)
  • [42] First order statistical feature for breast cancer detection using thermal images
    Nurhayati, Oky Dwi
    Widodo, Thomas Sri
    Susanto, Adhi
    Tjokronagoro, Maesadji
    World Academy of Science, Engineering and Technology, 2010, 46 : 424 - 426
  • [43] Statistical Indoor Localization Using Fusion of Depth-Images and Step Detection
    Fetzer, Toni
    Deinzer, Frank
    Koeping, Lukas
    Grzegorzek, Marcin
    2014 INTERNATIONAL CONFERENCE ON INDOOR POSITIONING AND INDOOR NAVIGATION (IPIN), 2014, : 407 - 415
  • [44] Text Extraction from Images using Gamma Correction Method and different Text Extraction Methods - A Comparative Analysis
    Devi, G. Gayathri
    Sumathi, C. P.
    2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [45] Gastrointestinal polyp detection in endoscopic images using an improved feature extraction method
    Billah, Mustain
    Waheed, Sajjad
    BIOMEDICAL ENGINEERING LETTERS, 2018, 8 (01) : 69 - 75
  • [46] Target Detection For Hyperspectral Images Using ICA-Based Feature Extraction
    Wang, Chunye
    Zhang, Junping
    Gu, Yanfeng
    2006 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, 2006, : 850 - 853
  • [47] DETECTION OF EXUDATES ON COLOR FUNDUS IMAGES USING TEXTURE BASED FEATURE EXTRACTION
    Nugroho, Hanung Adi
    Oktoeberza, K. Z. Widhia
    Adji, Teguh Bharata
    Najamuddin, Faisal
    INTERNATIONAL JOURNAL OF TECHNOLOGY, 2015, 6 (02) : 121 - 129
  • [48] AUTOMATED GLAUCOMA DETECTION USING HYBRID FEATURE EXTRACTION IN RETINAL FUNDUS IMAGES
    Krishnan, M. Muthu Rama
    Faust, Oliver
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2013, 13 (01)
  • [49] AN EFFICIENT DETECTION AND CLASSIFICATION OF DIABETIC RETINAL FUNDUS IMAGES USING FEATURE EXTRACTION
    Jawahar, S.
    Devaraju, S.
    Ali, S. Ahamed Johnsha
    Gnanapriya, S.
    INTERNATIONAL JOURNAL OF LIFE SCIENCE AND PHARMA RESEARCH, 2022, 12 : 71 - 80
  • [50] Text extraction from natural scene images using Renyi entropy
    Karpagam, Angia Venkatesan
    Manikandan, Mohan
    JOURNAL OF ENGINEERING-JOE, 2019, (08): : 5397 - 5406