Automatic text extraction in news images using morphology

被引:1
|
作者
Jang, IY [1 ]
Ko, BC [1 ]
Byun, H [1 ]
Choi, YW [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Visual Informat Proc Lab, Seodaemun Gu, Seoul 120749, South Korea
来源
VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2002, PTS 1 AND 2 | 2002年 / 4671卷
关键词
text extraction; video indexing; morphology;
D O I
10.1117/12.453094
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we present a new method to extract both superimposed and embedded graphical texts in a freeze-frame of news video. The algorithm is summarized in the following three steps. For the first step, we convert a color image into a gray-level image and apply contrast stretching to enhance the contrast of the input image. Then, a modified local adaptive thresholding is applied to the contrast-stretched image. The second step is divided into three processes: eliminating text-like components by applying erosion, dilation, and (OpenClose + CloseOpen)/2 morphological operations, maintaining text components using (OpenClose + CloseOpen)/2 operation with a new Geo-correction method, and subtracting two result images for eliminating false-positive components further. In the third filtering step, the characteristics of each component such as the ratio of the number of pixels in each candidate component to the number of its boundary pixels and the ratio of the minor to the major axis of each bounding box are used. Acceptable results have been obtained using the proposed method on 300 news images with a recognition rate of 93.6%. Also, our method indicates a good performance on all the various kinds of images by adjusting the size of the structuring element.
引用
收藏
页码:521 / 530
页数:10
相关论文
共 50 条
  • [21] Speech-to-Text Summarization Using Automatic Phrase Extraction from Recognized Text
    Rott, Michal
    Cerva, Petr
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 101 - 108
  • [22] Text Extraction from Scene Images using Statistical Distributions
    Ghoshal, Ranjit
    Roy, Anandarup
    Parui, Swapan K.
    2012 THIRD INTERNATIONAL CONFERENCE ON EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2012, : 187 - 190
  • [23] Text Extraction and Enhancement of Binary Images Using Cellular Automata
    Sahoo, G.
    Kumar, Tapas
    Raina, B. L.
    Bhatia, C. M.
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2009, 6 (03) : 254 - 260
  • [24] Automated Text Extraction from Images using OCR System
    Kaundilya, Chandni
    Chawla, Diksha
    Chopra, Yatin
    PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 145 - 150
  • [25] Text extraction in document images: highlight on using corner points
    Yadav, Vikas
    Ragot, Nicolas
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 281 - 286
  • [26] Text Extraction and Enhancement of Binary Images Using Cellular Automata
    G. Sahoo
    Tapas Kumar
    B. L. Raina
    C. M. Bhatia
    International Journal of Automation & Computing, 2009, 6 (03) : 254 - 260
  • [27] Automatic Caption Generation for News Images
    Feng, Yansong
    Lapata, Mirella
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (04) : 797 - 812
  • [28] Text Extraction from Document Images using Edge Information
    Grover, Sachin
    Arora, Kushal
    Mitra, Suman K.
    2009 ANNUAL IEEE INDIA CONFERENCE (INDICON 2009), 2009, : 582 - +
  • [29] Mathematical Morphology and Region Clustering Based Text Information Extraction from Malayalam News Videos
    Anoop, K.
    Gangan, Manjary P.
    Lajish, V. L.
    ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS (SIRS-2015), 2016, 425 : 431 - 442
  • [30] CNsum: Automatic Summarization for Chinese News Text
    Zhao, Yu
    Huang, Songping
    Zhou, Dongsheng
    Ding, Zhaoyun
    Wang, Fei
    Nian, Aixin
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS (WASA 2022), PT II, 2022, 13472 : 539 - 547