Mining user-generated comments

被引:1
|
作者
Subercaze, Julien [1 ]
Gravier, Christophe
Laforest, Frederique
机构
[1] Univ Lyon, F-42023 St Etienne, France
关键词
D O I
10.1109/WI-IAT.2015.138
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Social-media websites, such as newspapers, blogs, and forums, are the main places of generation and exchange of user-generated comments. These comments are viable sources for opinion mining, descriptive annotations and information extraction. User-generated comments are formatted using a HTML template, they are therefore entwined with the other information in the HTML document Their unsupervised extraction is thus a taxing issue even greater when considering the extraction of nested answers by different users. This paper presents a novel technique (CommentsMiner) for unsupervised users comments extraction. Our approach uses both the theoretical framework of frequent subtree mining and data extraction techniques. We demonstrate that the comment mining task can be modelled as a constrained closed induced subtree mining problem followed by a learning-to-rank problem. Our experimental evaluations show that Comment sMiner solves the plain comments and nested comments extraction problems for 84% of a representative and accessible dataset, while outperforming existing baselines techniques.
引用
收藏
页码:45 / 52
页数:8
相关论文
共 50 条
  • [1] Argumentation Mining in User-Generated Web Discourse
    Habernal, Ivan
    Gurevych, Iryna
    [J]. COMPUTATIONAL LINGUISTICS, 2017, 43 (01) : 125 - 179
  • [2] Categorizing Quality Determinants in Mining User-Generated Contents
    Barravecchia, Federico
    Mastrogiacomo, Luca
    Franceschini, Fiorenzo
    [J]. SUSTAINABILITY, 2020, 12 (23) : 1 - 12
  • [3] Mining User-generated Content of Mobile Patient Portal
    Al-Ramahi, Mohammad
    Noteboom, Cherie
    [J]. ACM Transactions on Social Computing, 2020, 3 (03)
  • [4] Temporal pattern mining from user-generated content
    Ali, Adnan
    Li, Jinlong
    Chen, Huanhuan
    Bashir, Ali Kashif
    [J]. DIGITAL COMMUNICATIONS AND NETWORKS, 2022, 8 (06) : 1027 - 1039
  • [5] Get into the spirit of a location by mining user-generated travelogues
    Zhu, Zhu
    Shou, Lidan
    Chen, Ke
    [J]. NEUROCOMPUTING, 2016, 204 : 61 - 69
  • [6] Analysis of User-Generated Comments on Rumor Correction YouTube Videos
    Majid, Gilang Maulana
    Pal, Anjan
    Wardani, Siska Premida
    Banerjee, Snehasish
    [J]. Proceedings of the 2021 15th International Conference on Ubiquitous Information Management and Communication, IMCOM 2021, 2021,
  • [7] Analysis of User-Generated Comments on Rumor Correction YouTube Videos
    Majid, Gilang Maulana
    Pal, Anjan
    Wardani, Siska Premida
    Banerjee, Snehasish
    [J]. PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,
  • [8] Mining Opinions in User-Generated Contents to Improve Course Evaluation
    El-Halees, Alaa
    [J]. SOFTWARE ENGINEERING AND COMPUTER SYSTEMS, PT 2, 2011, 180 : 107 - 115
  • [9] Mining User-generated Path Traversal Patterns in an Information Network
    Takes, Frank W.
    Kosters, Walter A.
    [J]. 2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2013, : 284 - 289
  • [10] Introduction to the Special Section on Search and Mining User-Generated Content
    Carlos Cortizo, Jose
    Carrero, Francisco
    Cantador, Ivan
    Antonio Troyano, Jose
    Rosso, Paolo
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 3 (04)