Identification and removal of advertisements from yellow page documents

被引:0
|
作者
Hashemi, RR [1 ]
Epperson, C [1 ]
Jones, S [1 ]
Jin, L [1 ]
Talburt, J [1 ]
机构
[1] Univ Arkansas, Dept Comp Sci, Little Rock, AR 72204 USA
关键词
OCR of yellow pages; identification of advertisements; hesitation; tracking; removal of advertisements;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
OCRing fails to deliver the information embaded in a yellow page document. Such failure stems from the fact that a yellow page document includes creative advertisements, multiple columns, and decorative graphics. In this research effort we introduce a set of algorithms that enables us to identify and remove advertisements from a scanned yellow page document, Removal of advertisements is a major step in paving the way for successful OCRing of the yellow pages. The scanned image is a gray scaled image with 256 gray levels and the resolution of 3300 x 4400. The experimental test shows 98% correct identification and removal of advertisements from the image.
引用
收藏
页码:94 / 100
页数:7
相关论文
共 50 条
  • [31] Guiding the content of tourism web advertisements on a search engine results page
    Lin, Chin-Feng
    Liao, Yu-Hung
    ONLINE INFORMATION REVIEW, 2010, 34 (02) : 263 - 281
  • [32] Young children's ability to recognize advertisements in web page designs
    Ali, Moondore
    Blades, Mark
    Oates, Caroline
    Blumberg, Fran
    BRITISH JOURNAL OF DEVELOPMENTAL PSYCHOLOGY, 2009, 27 : 71 - 83
  • [33] PREPRINTED PAGE TECHNIQUES FOR PRODUCTION OF ESTATE PLAN DOCUMENTS
    KEYDEL, FR
    REAL PROPERTY PROBATE AND TRUST JOURNAL, 1973, 8 (02): : 300 - 370
  • [34] Web page caricatures: Multimedia summaries for WWW documents
    Wynblatt, M
    Benson, D
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS, 1998, : 194 - 199
  • [35] Page layout analysis and classification for complex scanned documents
    Erkilinc, M. Sezer
    Jaber, Mustafa
    Saber, Eli
    Bauer, Peter
    Depalov, Dejan
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXIV, 2011, 8135
  • [36] THE PAGE IMAGE Towards a Visual History of Digital Documents
    Piper, Andrew
    Wellmon, Chad
    Cheriet, Mohamed
    BOOK HISTORY, 2020, 23 : 365 - 397
  • [37] Mobile Video Capture of Multi-page Documents
    Kumar, Jayant
    Bala, Raja
    Ding, Hengzhou
    Emmett, Phillip
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 35 - 40
  • [38] Page Layout Analysis System for Unconstrained Historic Documents
    Kodym, Oldrich
    Hradis, Michal
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 492 - 506
  • [39] Laser removal of foxing from the pages of books and paper documents
    Dobrusina, S. A.
    Parfenov, V. A.
    Podgornaya, N. I.
    Samsygina, N. D.
    Titov, S., V
    Petrov, A. A.
    Aseev, V. A.
    JOURNAL OF OPTICAL TECHNOLOGY, 2023, 90 (10) : 617 - 625