Robin: Extracting visual and textual features from web pages

被引:0
|
作者
Oka, M [1 ]
Tsukada, H [1 ]
Kato, K [1 ]
机构
[1] Univ Tsukuba, Tsukuba, Ibaraki 305, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web pages contain information in several forms. These include textual information such as words and visual information such as images, use of color, and layout. We propose a method of extracting the characteristic features from both the textual and visual information in Web pages. Our method enables seamless integration of the two types of information and automatic extraction of their characteristic features. Based on this method, we developed a proof-of-concept system called Robin, which is designed to provide users with an intuitive way of browsing search engine results. The results of an experimental evaluation of the system showed that it has the potential to be practical and effective.
引用
收藏
页码:765 / 771
页数:7
相关论文
共 50 条
  • [41] Extracting lists of data records from semi-structured web pages
    Alvarez, Manuel
    Pan, Alberto
    Raposo, Juan
    Bellas, Fernando
    Cacheda, Fidel
    DATA & KNOWLEDGE ENGINEERING, 2008, 64 (02) : 491 - 509
  • [42] Learning page-independent heuristics for extracting data from Web pages
    Cohen, WW
    Fan, W
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL WORLD WIDE WEB CONFERENCE, 1999, : 563 - 574
  • [43] A novel method for extracting information from web pages with multiple presentation templates
    Qingzhong L.
    Yanhui D.
    An F.
    Yongquan D.
    Journal of Software, 2010, 5 (05) : 506 - 513
  • [44] House Price Estimation from Visual and Textual Features
    Ahmed, Eman H.
    Moustafa, Mohamed
    PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, VOL 3: NCTA, 2016, : 62 - 68
  • [45] A Visual Technique for Web Pages Comparison
    Alpuente, Maria
    Romero, Daniel
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2009, 235 : 3 - 18
  • [46] Visual literacy and the design of Web pages
    Surprenant, TT
    Blake, VL
    IOLS '97: INTEGRATED ONLINE LIBRARY SYSTEMS, PROCEEDINGS - 1997: EXPANDING EXPECTATIONS, 1997, : 131 - 143
  • [47] Measuring the Visual Complexities of Web Pages
    Wu, Ou
    Hu, Weiming
    Shi, Lei
    ACM TRANSACTIONS ON THE WEB, 2013, 7 (01)
  • [48] Term frequency occurrences on web pages for textual information retrieval
    Sivapathasundaram, Karthika
    Cheng, Xiaochun
    Petridis, Miltos
    DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 585 - 590
  • [49] Extracting Textual Features from Video Streaming Services Publications to Predict their Popularity
    de Sa, Sidney Loyola
    Paes, Aline
    Rocha, Antonio A. de A.
    PROCEEDINGS OF THE 27TH BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB (WEBMEDIA '21), 2021, : 113 - 120
  • [50] A color selection tool for the readability of textual information on Web pages
    Zuffia, Silvia
    Beretta, Giordano
    Brambilla, Carla
    INTERNET IMAGING VII, 2006, 6061