Determining the most representative image on a Web page

被引:7
|
作者
Vyas, Krishna [1 ]
Frasincar, Flavius [1 ]
机构
[1] Erasmus Univ, POB 1738, NL-3000 DR Rotterdam, Netherlands
关键词
Image search; Representative image; Support vector machines; Feature selection; VECTOR MACHINES; RETRIEVAL; CLASSIFICATION;
D O I
10.1016/j.ins.2019.10.045
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We investigate how to determine the most representative image on a Web page. This problem has not been thoroughly investigated and, up to today, only expert-based algorithms have been proposed in the literature. We attempt to improve the performance of known algorithms with the use of Support Vector Machines (SVM). Besides, our algorithm distinguishes itself from existing literature with the introduction of novel image features, including previously unused meta-data protocols. Also, we design and attempt a less-restrictive ranking methodology in the image preprocessing stage of our algorithm. We find that the application of the SVM framework with our improved classification methodology increases the F1 score from 27.2% to 38.5%, as compared to a state-of-the-art method. Introducing novel image features and applying backward feature selection, we find that the F1 score rises to 40.0%. Lastly, we use a class-weighted SVM in order to resolve the imbalance in number of representative images. This final modification improves the classification performance of our algorithm even further to 43.9%, outperforming our benchmark algorithms, including those of Facebook and Google. Suggested beneficiaries are the search engine community, image retrieval community, including the commercial sector due to superior performance. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:1234 / 1248
页数:15
相关论文
共 50 条
  • [1] Getting the Most Out of Social Annotations for Web Page Classification
    Zubiaga, Arkaitz
    Martinez, Raquel
    Fresno, Victor
    [J]. DOCENG'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, 2009, : 74 - 83
  • [2] Image Logging Technique of A Web URL Page on the Tiny Web Server
    Yoo, Seunghee
    Cho, Dongsub
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTING AND COMPUTATIONAL SCIENCES: ADVANCES IN APPLIED COMPUTING AND COMPUTATIONAL SCIENCES, 2008, : 92 - 95
  • [3] Determining the More Adequate Web Page Node for Advertising Placement
    Guadalupe Ramos, J.
    Espejel, Jessica Lopez
    Ferreira Escutia, Rogelio
    Ferreira Medina, Heberto
    [J]. COMPUTACION Y SISTEMAS, 2020, 24 (02): : 703 - 714
  • [4] Web Page Classification Using Image Analysis Features
    de Boer, Viktor
    van Someren, Maarten W.
    Lupascu, Tiberiu
    [J]. WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2011, 75 : 272 - +
  • [5] Analysis of Web page image tag distribution characteristics
    Ajiferuke, I
    Wolfram, D
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (04) : 987 - 1002
  • [6] Analysis of Web Usage Patterns to Identify Most Frequently Accessed Web Page by Multiple Users
    Verma, Priyanka
    Kesswani, Nishtha
    [J]. 4TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND CONNECTED TECHNOLOGIES (ICIOTCT), 2019: INTERNET OF THINGS AND CONNECTED TECHNOLOGIES, 2020, 1122 : 151 - 159
  • [7] An image based design support system for web page design
    Yoshida, Tetsuya
    Watanabe, Masato
    Nishida, Shogo
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2006, 10 (03) : 201 - 212
  • [8] Visual Web Page Editor with Local Image Display Function
    Goto, Toru
    Fujinaka, Toru
    [J]. 2023 11th International Conference on Information and Education Technology, ICIET 2023, 2023, : 513 - 517
  • [9] Web image context extraction based on semantic representation of web page visual segments
    Tryfou, Georgina
    Theodosiou, Zenonas
    Tsapatsoulis, Nicolas
    [J]. 2012 SEVENTH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION (SMAP 2012), 2012, : 63 - 67
  • [10] Determining the most representative topographic variables in locally manufactured sockets for patients with transfemoral amputation
    Restrepo, Vanessa
    Villarraga, Junes A.
    Jose Pavon, Juan
    [J]. INGENIERIA Y COMPETITIVIDAD, 2014, 16 (01): : 209 - 215