Organizing WWW images based on the analysis of page layout and web link structure

被引:0
|
作者
Cai, D [1 ]
He, XF [1 ]
Ma, WY [1 ]
Wen, JR [1 ]
Zhang, HJ [1 ]
机构
[1] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the rapid growth of the number of digital images on the Web, there is an increasing demand for effective and efficient method for organizing and retrieving the images available. This paper describes a method for clustering and embedding WWW images. By using a vision-based page segmentation algorithm, a web page is partitioned into blocks, and the textual and link information of an image can be accurately extracted from the block containing that image. By extracting the page-to-block, block-to-image, block-to-page relationships through link structure and page layout analysis, we construct an image graph. With the image graph model, we use techniques from spectral graph theory for image clustering and embedding. Some experimental results are given in the paper.
引用
收藏
页码:113 / 116
页数:4
相关论文
共 50 条
  • [21] THESUS: Organizing Web document collections based on link semantics
    Maria Halkidi
    Benjamin Nguyen
    Iraklis Varlamis
    Michalis Vazirgiannis
    The VLDB Journal, 2003, 12 : 320 - 332
  • [22] Web page analysis: Experiments based on web patterns
    Klos, Karel
    Kocibova, Jana
    Lehecka, Ondrej
    Kudelka, Milos
    Snasel, Vaclav
    Rezankova, Hana
    2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 655 - +
  • [23] SimiLay: A Developing Web Page Layout Based Visual Similarity Search Engine
    Bozkir, Ahmet Selman
    Sezer, Ebru Akcapinar
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014, 2014, 8556 : 457 - 470
  • [24] Study on Usability of Agricultural Product Web Page Layout Based on Eye Tracker
    Liu lulu
    He Xiangzhen
    Wan Fucheng
    Xiong Zhangyuan
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INFORMATION ENGINEERING (ICACIE), 2016, 64 : 78 - 82
  • [25] Computer Vision-based Analysis of Web Page Structure for Assistive Interfaces
    Cormier, Michael
    13TH WEB FOR ALL CONFERENCE MONTREAL, CANADA 2016, 2016,
  • [26] Web Page Content Extraction Method Based on Link Density and Statistic
    Pan, Donghua
    Qiu, Shaogang
    Yin, Dawei
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11452 - 11455
  • [27] Graph theory application and web page ranking for website link structure improvement
    Abedin, Babak
    Sohrabi, Babak
    BEHAVIOUR & INFORMATION TECHNOLOGY, 2009, 28 (01) : 63 - 72
  • [28] An approach of page layout analysis based on active contour model
    Liu, DR
    Guo, BL
    Tian, XD
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1711 - 1714
  • [29] Automatic Summarization of Web Page Based on Statistics and Structure
    Zheng, Shuangyi
    Yu, Junyang
    KNOWLEDGE DISCOVERY AND DATA MINING, 2012, 135 : 643 - +
  • [30] Web Page Classification Method Based on Semantics and Structure
    Li, Huaxin
    Zhang, Zhaoxin
    Xu, Yongdong
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019), 2019, : 238 - 243