Robin: Extracting visual and textual features from web pages

被引:0
|
作者
Oka, M [1 ]
Tsukada, H [1 ]
Kato, K [1 ]
机构
[1] Univ Tsukuba, Tsukuba, Ibaraki 305, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web pages contain information in several forms. These include textual information such as words and visual information such as images, use of color, and layout. We propose a method of extracting the characteristic features from both the textual and visual information in Web pages. Our method enables seamless integration of the two types of information and automatic extraction of their characteristic features. Based on this method, we developed a proof-of-concept system called Robin, which is designed to provide users with an intuitive way of browsing search engine results. The results of an experimental evaluation of the system showed that it has the potential to be practical and effective.
引用
收藏
页码:765 / 771
页数:7
相关论文
共 50 条
  • [31] Visual Summarization of Web Pages
    Jiao, Binxing
    Yang, Linjun
    Xu, Jizheng
    Wu, Feng
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 499 - 506
  • [32] Visual Similarity of Web Pages
    Kudelka, Milos
    Takama, Yasufumi
    Snasel, Vaclav
    Klos, Karel
    Pokorny, Jaroslav
    ADVANCES IN INTELLIGENT WEB MASTERING-2, PROCEEDINGS, 2010, 67 : 135 - +
  • [33] Web pages aesthetic evaluation using low-level visual features
    Mirdehghani, Maryam
    Monadjemi, S. Amirhassan
    World Academy of Science, Engineering and Technology, 2009, 37 : 811 - 814
  • [34] Title identification of web article pages using HTML']HTML and visual features
    Fan, Jian
    Luo, Ping
    Joshi, Parag
    IMAGING AND PRINTING IN A WEB 2.0 WORLD II, 2011, 7879
  • [35] Extracting Visual Knowledge from the Web with Multimodal Learning
    Gong, Dihong
    Wang, Daisy Zhe
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1718 - 1724
  • [36] NEIL: Extracting Visual Knowledge from Web Data
    Chen, Xinlei
    Shrivastava, Abhinav
    Gupta, Abhinav
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1409 - 1416
  • [37] Learning page-independent heuristics for extracting data from Web pages
    Cohen, William W.
    Fan, Wei
    Computer Networks, 1999, 31 (11): : 1641 - 1652
  • [38] Extracting Content for News Web Pages based on DOM
    Geng, Hua
    Gao, Qiang
    Pan, Jingui
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (02): : 124 - 129
  • [39] Improving the web text content by extracting significant pages into a Web Site
    Ríos, SA
    Velásquez, JD
    Vera, ES
    Yasuda, H
    Aoki, T
    5th International Conference on Intelligent Systems Design and Applications, Proceedings, 2005, : 32 - 36
  • [40] Learning page-independent heuristics for extracting data from Web pages
    Cohen, WW
    Fan, W
    COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 1999, 31 (11-16): : 1641 - 1652