Organizing WWW images based on the analysis of page layout and web link structure

被引:0
|
作者
Cai, D [1 ]
He, XF [1 ]
Ma, WY [1 ]
Wen, JR [1 ]
Zhang, HJ [1 ]
机构
[1] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the rapid growth of the number of digital images on the Web, there is an increasing demand for effective and efficient method for organizing and retrieving the images available. This paper describes a method for clustering and embedding WWW images. By using a vision-based page segmentation algorithm, a web page is partitioned into blocks, and the textual and link information of an image can be accurately extracted from the block containing that image. By extracting the page-to-block, block-to-image, block-to-page relationships through link structure and page layout analysis, we construct an image graph. With the image graph model, we use techniques from spectral graph theory for image clustering and embedding. Some experimental results are given in the paper.
引用
收藏
页码:113 / 116
页数:4
相关论文
共 50 条
  • [1] Clustering and searching WWW images using link and page layout analysis
    He, Xiaofei
    Cai, Deng
    Wen, Ji-Rong
    Ma, Wei-Ying
    Zhang, Hong-Jiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2007, 3 (02)
  • [2] Web page scoring based on link analysis of web page sets
    Nakakubo, Hitoshi
    Nakajima, Shinsuke
    Hatano, Kenji
    Miyazaki, Jun
    Uemura, Shunsuke
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 269 - +
  • [3] An adaptive web page layout structure for small devices
    Xing Xie
    Chong Wang
    Li-Qun Chen
    Wei-Ying Ma
    Multimedia Systems, 2005, 11 : 34 - 44
  • [4] An adaptive web page layout structure for small devices
    Xie, X
    Wang, C
    Chen, LQ
    Ma, WY
    MULTIMEDIA SYSTEMS, 2005, 11 (01) : 34 - 44
  • [5] A Web Spam Link Detection Method Based on Web Page Structure and Text Features
    Yang W.
    Jiang Y.-H.
    Zhang S.-F.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2020, 41 (08): : 1091 - 1096
  • [6] Retrieval of document images based on page layout similarity
    Naveen
    Guru, D. S.
    ADAPTIVE MULTIMEDIA RETRIEVAL: USER, CONTEXT, AND FEEDBACK, 2007, 4398 : 136 - +
  • [7] Exploiting link structure for web page genre identification
    Jia Zhu
    Qing Xie
    Shoou-I Yu
    Wai Hung Wong
    Data Mining and Knowledge Discovery, 2016, 30 : 550 - 575
  • [8] Exploiting link structure for web page genre identification
    Zhu, Jia
    Xie, Qing
    Yu, Shoou-I
    Wong, Wai Hung
    DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (03) : 550 - 575
  • [9] Relating Web characteristics with link based Web page ranking
    Baeza-Yates, R
    Castillo, C
    EIGHTH SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2001, : 21 - 32
  • [10] Methods for Automatic Web Page Layout Testing and Analysis: A Review
    Prazina, Irfan
    Becirovic, Seila
    Cogo, Emir
    Okanovic, Vensada
    IEEE ACCESS, 2023, 11 : 13948 - 13964