Documents, Topics, and Authors: Text Mining of Online News

被引:0
|
作者
Sertkan, Mete [1 ]
Neidhardt, Julia [1 ]
Werthner, Hannes [1 ]
机构
[1] TU Wien, Res Unit ECommerce, Vienna, Austria
关键词
Recommender Systems; Online News; Text Mining; Topic Modelling; Co-occurence Networks;
D O I
10.1109/CBI.2019.00053
中图分类号
F [经济];
学科分类号
02 ;
摘要
The goal of recommender systems is, in essence, to help people to discover items they might like, i.e., items that fit their preferences, personality, and needs. Depending on the respective domain, those items can be books, movies, music, hotels, and much more. Typically, recommendations are based on past user interactions (e.g., movies a user saw, hotels a user booked, etc.). This work in progress paper focuses on news recommender systems. Because of the nature of news (e.g., constantly new items, short item lifetime, etc.), recommendations based on past interactions are especially hard to make. Hence, news recommender systems heavily rely on the actual content of news. While previous work mainly considers one aspect of the content of news articles, we jointly analyse and discuss in this work a given corpora of news articles on three different levels (i.e., document-level, topic-level, and author-level). The overall aim is to set to provide the basis for a comprehensive news recommender system, which reaches beyond accuracy and considers also diversity and serendipity. We demonstrate that relevant information can be extracted out of a given corpora, and differences in author, time, and topic can be shown. Furthermore, the author-level analysis shows that documents can be clustered based on the writing style of authors. Finally, our findings show that author-level analysis has the potential to recommend the most diverse items compared to the other approaches.
引用
收藏
页码:405 / 413
页数:9
相关论文
共 50 条
  • [1] Hierarchical classification in text mining for sentiment analysis of online news
    Jinyan Li
    Simon Fong
    Yan Zhuang
    Richard Khoury
    Soft Computing, 2016, 20 : 3411 - 3420
  • [2] Hierarchical classification in text mining for sentiment analysis of online news
    Li, Jinyan
    Fong, Simon
    Zhuang, Yan
    Khoury, Richard
    SOFT COMPUTING, 2016, 20 (09) : 3411 - 3420
  • [3] Analysis of Online News Coverage on Earthquakes Through Text Mining
    Camilleri, Stephen
    Agius, Matthew R.
    Azzopardi, Joel
    FRONTIERS IN EARTH SCIENCE, 2020, 8
  • [4] Automated Mining of Relevant N-grams in Relation to Predominant Topics of Text Documents
    Zizka, Jan
    Darena, Frantisek
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 461 - 469
  • [5] Topics Discovery in Text Mining
    Correia, Anacleto
    Goncalves, Antonio
    RECENT ADVANCES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, 2017, 569 : 251 - 256
  • [6] Text Mining of Online News and Social Data About Chatbot Service
    Jeong, Yunjik
    Suk, Jaehye
    Hong, Jihyung
    Kim, Dongmin
    Kim, Kee Ok
    Hwang, Hyesun
    HCI INTERNATIONAL 2018 - POSTERS' EXTENDED ABSTRACTS, PT I, 2018, 850 : 429 - 434
  • [7] Factors for diffusion of autonomous vehicles technology: text mining of online news
    Silva, Joao Paulo Nascimento
    Lima, Paulo de Oliveira
    Gruetzmann, Andre
    Antunes, Luiz Guilherme R.
    Pedrosa, Gabriel
    Oliveira, Cledison Carlos
    Sugano, Joel Yutaka
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY AND MANAGEMENT, 2022, 22 (04) : 424 - 449
  • [8] Mining coherent topics in documents using word embeddings and large-scale text data
    Yao, Liang
    Zhang, Yin
    Chen, Qinfei
    Qian, Hongze
    Wei, Baogang
    Hu, Zhifeng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 64 : 432 - 439
  • [9] Protecting Sensitive Topics in Text Documents with PROTEXTOR
    Cumby, Chad
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 714 - 717
  • [10] Mining online text
    Knight, K
    COMMUNICATIONS OF THE ACM, 1999, 42 (11) : 58 - 61