A Multi-label and Adaptive Genre Classification of Web Pages

被引:5
|
作者
Jebari, Chaker [1 ]
Wani, M. Arif [2 ]
机构
[1] Fac Sci, Comp Sci Dept, Tunis, Tunisia
[2] Calif State Univ Backersfield, Comp & Elect Engn & Comp Sci Dept, Backersfield, CA USA
关键词
Multi-label; classification; genre; centroid; adaptive;
D O I
10.1109/ICMLA.2012.106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new centroid-based approach to classify web pages by genre using character n-grams extracted from different information sources such as URL, title, headings and anchors. To deal with the complexity of web pages and the rapid evolution of web genres, our approach implements a multi-label and adaptive classification scheme in which web pages are classified one by one and each web page can affect more than one genre. According to the similarity between the new page and each genre centroid, our approach either adapts the genre centroid under consideration or considers the new page as noise page and discards it. The experiment results show that our approach is very fast and achieves better results than existing multi-label classifiers.
引用
收藏
页码:578 / 581
页数:4
相关论文
共 50 条
  • [1] Multi-Label Genre Classification of Web Pages Using an Adaptive Centroid-Based Classifier
    Jebari, Chaker
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2016, 15 (01)
  • [2] A Combination based on OWA Operators for Multi-label Genre Classification of web pages
    Jebari, Chaker
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2015, (54): : 13 - 20
  • [3] An Improved Centroid-based Approach for Multi-label Classification of Web Pages by Genre
    Jebari, Chaker
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 889 - 890
  • [4] Web Genre Classification via Hierarchical Multi-label Classification
    Madjarov, Gjorgji
    Vidulin, Vedrana
    Dimitrovski, Ivica
    Kocev, Dragi
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 : 9 - 17
  • [5] Detecting Musical Genre Borders For Multi-label Genre Classification
    Nakamura, Hiroki
    Huang, Hung-Hsuan
    Kawagoe, Kyoji
    2013 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2013, : 532 - 533
  • [6] A multimodal approach for multi-label movie genre classification
    Rafael B. Mangolin
    Rodolfo M. Pereira
    Alceu S. Britto
    Carlos N. Silla
    Valéria D. Feltrim
    Diego Bertolini
    Yandre M. G. Costa
    Multimedia Tools and Applications, 2022, 81 : 19071 - 19096
  • [7] A multimodal approach for multi-label movie genre classification
    Mangolin, Rafael B.
    Pereira, Rodolfo M.
    Britto, Alceu S., Jr.
    Silla, Carlos N., Jr.
    Feltrim, Valeria D.
    Bertolini, Diego
    Costa, Yandre M. G.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (14) : 19071 - 19096
  • [8] Multi-label movie genre classification based on multimodal fusion
    Cai, Zihui
    Ding, Hongwei
    Wu, Jinlu
    Xi, Ying
    Wu, Xuemeng
    Cui, Xiaohui
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36823 - 36840
  • [9] Evaluating multimodal strategies for multi-label movie genre classification
    Paulino, Marco Aurelio D.
    Costa, Yandre M. G.
    Feltrim, Valeria D.
    2022 29TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2022,
  • [10] Discriminative Adaptive Sets for Multi-Label Classification
    Ghani, Muhammad Usman
    Rafi, Muhammad
    Tahir, Muhammad Atif
    IEEE ACCESS, 2020, 8 : 227579 - 227595