Identification and removal of advertisements from yellow page documents

被引:0
|
作者
Hashemi, RR [1 ]
Epperson, C [1 ]
Jones, S [1 ]
Jin, L [1 ]
Talburt, J [1 ]
机构
[1] Univ Arkansas, Dept Comp Sci, Little Rock, AR 72204 USA
关键词
OCR of yellow pages; identification of advertisements; hesitation; tracking; removal of advertisements;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
OCRing fails to deliver the information embaded in a yellow page document. Such failure stems from the fact that a yellow page document includes creative advertisements, multiple columns, and decorative graphics. In this research effort we introduce a set of algorithms that enables us to identify and remove advertisements from a scanned yellow page document, Removal of advertisements is a major step in paving the way for successful OCRing of the yellow pages. The scanned image is a gray scaled image with 256 gray levels and the resolution of 3300 x 4400. The experimental test shows 98% correct identification and removal of advertisements from the image.
引用
收藏
页码:94 / 100
页数:7
相关论文
共 50 条
  • [1] Influences on the purchase of yellow page display advertisements
    Abernethy, AM
    Laband, D
    JOURNAL OF ADVERTISING RESEARCH, 1999, 39 (05) : 15 - 25
  • [2] Page frame detection for marginal noise removal from scanned documents
    Shafait, Faisal
    van Beusekom, Joost
    Keysers, Daniel
    Breue, Thomas M.
    IMAGE ANALYSIS, PROCEEDINGS, 2007, 4522 : 651 - +
  • [3] Why do some yellow page advertisements capture attention better than others?
    Tuominen, R
    ACTA ODONTOLOGICA SCANDINAVICA, 2001, 59 (02) : 79 - 82
  • [4] Page-level Script Identification from Multi-script Handwritten Documents
    Singh, Pawan Kumar
    Dalal, Santu Kumar
    Sarkar, Ram
    Nasipuri, Mita
    2015 THIRD INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND INFORMATION TECHNOLOGY (C3IT), 2015,
  • [5] RECALL OF ADVERTISEMENTS IN RELATION TO PAGE POSITIONS
    CHAKRAPA.TK
    PSYCHOLOGICAL STUDIES, 1971, 16 (01) : 4 - &
  • [6] THE HISTORY OF FULL-PAGE ADVERTISEMENTS
    Gaudet, Frederick J.
    Zients, B. Bernard
    JOURNAL OF APPLIED PSYCHOLOGY, 1932, 16 (05) : 512 - 514
  • [7] Scheduling advertisements on a web page to maximize revenue
    Kumar, Subodha
    Jacob, Varghese S.
    Sriskandarajah, Chelliah
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2006, 173 (03) : 1067 - 1089
  • [8] Automatic Indic script identification from handwritten documents: page, block, line and word-level approach
    Obaidullah, Sk Md
    Santosh, K. C.
    Halder, Chayan
    Das, Nibaran
    Roy, Kaushik
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (01) : 87 - 106
  • [9] Automatic Indic script identification from handwritten documents: page, block, line and word-level approach
    Sk Md Obaidullah
    K. C. Santosh
    Chayan Halder
    Nibaran Das
    Kaushik Roy
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 87 - 106
  • [10] THE EFFECT OF MAGAZINE PAGE SIZE ON IMMEDIATE MEMORY FOR ADVERTISEMENTS
    Webster, Edward C.
    Bird, T. C.
    CANADIAN JOURNAL OF PSYCHOLOGY, 1950, 4 (03): : 115 - 121