Scalable text classification as a tool for personalization

被引:0
|
作者
Antonellis, Ioannis [1 ,2 ]
Bouras, Christos [1 ,2 ]
Poulopoulos, Vassilis [1 ,2 ]
机构
[1] Res Acad Comp Technol Inst N Kazantzaki, GR-26500 Patras, Greece
[2] Univ Patras, Comp Engn & Informat Dept, GR-26500 Patras, Greece
来源
关键词
8.3 Data Mining; Web-based information; 14.1 Information Retrieval; Customization and user profiles; 30.1; Web; Inf. Services on the Web;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We consider scalability issues of the text classification problem where by using (multi)-labeled training documents, we try to build classifiers that assign documents into classes permitting classification in multiple classes. A new class of classification problems; called 'scalable', is introduced, with applications on web mining. Scalable classification utilizes newly classified instances in order to improve the accuracy of future classifications and capture changes in semantic representation of different topics. In addition, definition of different similarity classes is allowed, resulting in a 'per-user' classification procedure. Such an approach provides a new methodology for building personalized applications. This is due to the fact that the user becomes a part of the classification procedure. We explore solutions for the scalable text classification problem and introduce an algorithm that exploits a new text analysis technique that decomposes documents into the vector representation of their sentences according to the user expertise. Finally, a web-based personalized news categorization system that bases upon this approach is presented.
引用
收藏
页码:399 / 408
页数:10
相关论文
共 50 条
  • [1] Benchmarking Scalable Predictive Uncertainty in Text Classification
    Van Landeghem, Jordy
    Blaschko, Matthew
    Anckaert, Bertrand
    Moens, Marie-Francine
    [J]. IEEE ACCESS, 2022, 10 : 43703 - 43737
  • [2] Two scalable algorithms for associative text classification
    Yoon, Yongwook
    Lee, Gary G.
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (02) : 484 - 496
  • [3] A Scalable Hybrid Ensemble Model for Text Classification
    Singh, Bharat
    Kushwaha, Nidhi
    Vyas, Om Prakash
    [J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 3148 - 3152
  • [4] Integration of text classification techniques and user modeling for personalization in news services
    Diaz Esteban, Alberto
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2007, (38): : 145 - 146
  • [5] BioClass: A Tool for Biomedical Text Classification
    Romero, R.
    Seara Vieira, A.
    Iglesias, E. L.
    Borrajo, L.
    [J]. 8TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS (PACBB 2014), 2014, 294 : 243 - 251
  • [6] Text classification:: A preferred tool for audio file classification
    Rompre, Louis
    Biskri, Ismail
    Meunier, Francois
    [J]. 2008 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2008, : 834 - 839
  • [7] Personalized news categorization through scalable text classification
    Antonellis, L
    Bouras, C
    Poulopoulos, V
    [J]. FRONTIERS OF WWW RESEARCH AND DEVELOPMENT - APWEB 2006, PROCEEDINGS, 2006, 3841 : 391 - 401
  • [8] Scalable Multi-Label Arabic Text Classification
    Ahmed, Nizar A.
    Shehab, Mohammed A.
    Al-Ayyoub, Mahmoud
    Hmeidi, Ismail
    [J]. 2015 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2015, : 212 - 217
  • [9] Semi-Automatic Image Personalization Tool for Variable Text Insertion and Replacement
    Ding, Hengzhou
    Bala, Raja
    Fan, Zhigang
    Eschbach, Reiner
    Bouman, Charles A.
    Allebach, Jan P.
    [J]. IMAGING AND PRINTING IN A WEB 2.0 WORLD; AND MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS IV, 2010, 7540
  • [10] A Scalable Text Classification Using Naive Bayes with Hadoop Framework
    Temesgen, Mulualem Mheretu
    Lemma, Dereje Teferi
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGY FOR DEVELOPMENT FOR AFRICA (ICT4DA 2019), 2019, 1026 : 291 - 300