An Ensemble Method for the Credibility Assessment of User-Generated Content

被引:4
|
作者
Fontanarava, Julien [1 ]
Pasi, Gabriella [2 ]
Viviani, Marco [2 ]
机构
[1] Ecole Polytech, Route Saclay, F-91128 Palaiseau, France
[2] Univ Milano Bicocca, DISCo, Viale Sarca 336, I-20126 Milan, Italy
关键词
Credibility; Social Web; Social Media; Classification; Ensemble Learning; Text Mining; Language Models; INFORMATION;
D O I
10.1145/3106426.3106464
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Social Web supports and fosters social interactions by means of different social media, which allow the spread of the so called User-Generated Content (UGC). In this context, characterized by the absence of trusted third parties that verify the reliability of the sources and the believability of the content generated, the issue of assessing the credibility of the information diffused by means of social media is receiving increasing attention. In the literature, this issue has been mainly tackled as a classification problem; information is categorized into genuine and fake, usually by implementing or applying classifiers that consider multiple kinds of features (mainly textual and non-textual) to be evaluated in terms of credibility. In this article, unlike prior research, textual features are considered separately with respect to other kinds of features during the classification process. In particular, an Ensemble Method that combines the results produced by two text classifiers and the ones returned by another classifier acting on non-textual features is proposed. This allows to have better results with respect to the use of a single classifier on multiple features together. The effectiveness of the Ensemble Method has been assessed in the context of review sites, by means of a labeled dataset gathered from the Yelp.com site, where on-line reviews are already classified as recommended and not recommended.
引用
收藏
页码:863 / 868
页数:6
相关论文
共 50 条
  • [1] A Supervised Machine Learning Approach for the Credibility Assessment of User-Generated Content
    Jain, Praphula Kumar
    Pamula, Rajendra
    Ansari, Sarfraj
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2021, 118 (04) : 2469 - 2485
  • [2] A Supervised Machine Learning Approach for the Credibility Assessment of User-Generated Content
    Praphula Kumar Jain
    Rajendra Pamula
    Sarfraj Ansari
    [J]. Wireless Personal Communications, 2021, 118 : 2469 - 2485
  • [3] Beyond Text: Multimodal Credibility Assessment Approaches for Online User-Generated Content
    Choudhary, Monika
    Chouhan, Satyendra Singh
    Rathore, Santosh Singh
    [J]. ACM Transactions on Intelligent Systems and Technology, 2024, 15 (05)
  • [4] ScoreTree: A Decentralised Framework for Credibility Management of User-Generated Content
    Liao, Yang
    Harwood, Aaron
    Ramamohanarao, Kotagiri
    [J]. DISTRIBUTED APPLICATIONS AND INTEROPERABLE SYSTEMS, 2011, 6723 : 249 - 256
  • [5] Consumption and Production of User-Generated Content, Credibility, and Political Participation
    Yamamoto, Masahiro
    Nah, Seungahn
    Choung, Hyesun
    [J]. COMMUNICATION STUDIES, 2022, 73 (01) : 1 - 16
  • [6] The Challenge of Improving Credibility of User-Generated Content in Online Social Networks
    Haralabopoulos, Giannis
    Anagnostopoulos, Ioannis
    Zeadally, Sherali
    [J]. ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2016, 7 (03):
  • [7] User-generated content
    Greenfield, David
    [J]. CONTROL ENGINEERING, 2009, 56 (10) : 2 - 2
  • [8] User-generated content
    Wofford, Jennifer
    [J]. NEW MEDIA & SOCIETY, 2012, 14 (07) : 1236 - 1239
  • [9] Application of Aggregation Operators to Assess the Credibility of User-Generated Content in Social Media
    Pasi, Gabriella
    Viviani, Marco
    [J]. INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: THEORY AND FOUNDATIONS, IPMU 2018, PT I, 2018, 853 : 342 - 353
  • [10] User-Generated Content Introduction
    Krumm, John
    Davies, Nigel
    Narayanaswami, Chandra
    [J]. IEEE PERVASIVE COMPUTING, 2008, 7 (04) : 10 - 11