Argument Extraction from News, Blogs, and the Social Web

被引:10
|
作者
Goudas, Theodosis [1 ]
Louizos, Christos [2 ]
Petasis, Georgios [3 ]
Karkaletsis, Vangelis [3 ]
机构
[1] Univ Piraeus, Dept Digital Syst, Athens, Greece
[2] Univ Athens, Dept Informat & Telecommun, Athens, Greece
[3] Natl Ctr Sci Res NCSR Demokritos, Software & Knowledge Engn Lab, Inst Informat & Telecommun, GR-15310 Athens, Greece
关键词
Argument mining; argument extraction; argument matching;
D O I
10.1142/S0218213015400242
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Argument extraction is the task of identifying arguments, along with their components in text. Arguments can be usually decomposed into a claim and one or more premises justifying it. Among the novel aspects of this work is the thematic domain itself which relates to Social Media, in contrast to traditional research in the area, which concentrates mainly on law documents and scientific publications. The huge increase of social media communities, along with their user tendency to debate, makes the identification of arguments in these texts a necessity. Argument extraction from Social Media is more challenging because texts may not always contain arguments, as is the case of legal documents or scientific publications usually studied. In addition, being less formal in nature, texts in Social Media may not even have proper syntax or spelling. This paper presents a two-step approach for argument extraction from social media texts. During the first step, the proposed approach tries to classify the sentences into "sentences that contain arguments" and "sentences that don't contain arguments". In the second step, it tries to identify the exact fragments that contain the premises from the sentences that contain arguments, by utilizing conditional random fields. The results exceed significantly the base line approach, and according to literature, are quite promising.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] News from the Extraction
    Hlawitschka, Mark W.
    CHEMIE INGENIEUR TECHNIK, 2015, 87 (12) : 1660 - 1661
  • [22] Dependency Trigram Model for Social Relation Extraction from News Articles
    Choi, Maengsik
    Kim, Harksoo
    Croft, Bruce W.
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1047 - 1048
  • [23] POLYPHONET: An advanced social network extraction system from the Web
    Matsuo, Yutaka
    Mori, Junichiro
    Hamasaki, Masahiro
    Nishimura, Takuichi
    Takeda, Hideaki
    Hasida, Koiti
    Ishizuka, Mitsuru
    JOURNAL OF WEB SEMANTICS, 2007, 5 (04): : 262 - 278
  • [24] Agent Reasoning with Semantic Web in Web Blogs
    Dinh Que Tran
    Tuan Nha Hoang
    INTELLIGENT AGENTS AND MULTI-AGENT SYSTEMS, PROCEEDINGS, 2008, 5357 : 389 - 396
  • [25] Automatic Web News Extraction Using Blocking Tag
    Lin Ziyi
    Shen Beijun
    Tang Xinhuai
    Chen Delai
    2009 SECOND INTERNATIONAL CONFERENCE ON MACHINE VISION, PROCEEDINGS, ( ICMV 2009), 2009, : 74 - +
  • [26] A novel chinese web news source extraction algorithm
    Liu Z.
    Liu L.
    Journal of Convergence Information Technology, 2011, 6 (08) : 99 - 106
  • [27] News item extraction for text mining in web newspapers
    Norvåg, K
    Oyri, R
    International Workshop on Challenges in Web Information Retrieval and Integration, Proceedings, 2005, : 195 - 204
  • [28] No place for news in social network web sites?
    Thelwall, Mike
    ONLINE INFORMATION REVIEW, 2008, 32 (06) : 726 - 744
  • [29] TESTING THE REVENUE DIVERSITY ARGUMENT ON INDEPENDENT WEB-NATIVE NEWS VENTURES
    Massey, Brian L.
    DIGITAL JOURNALISM, 2018, 6 (10) : 1333 - 1348
  • [30] News Search, Blogs and Feeds: A Toolkit
    Wiley, Deborah Lynne
    ONLINE, 2011, 35 (03): : 62 - 62