Argument Extraction from News, Blogs, and the Social Web

被引:10
|
作者
Goudas, Theodosis [1 ]
Louizos, Christos [2 ]
Petasis, Georgios [3 ]
Karkaletsis, Vangelis [3 ]
机构
[1] Univ Piraeus, Dept Digital Syst, Athens, Greece
[2] Univ Athens, Dept Informat & Telecommun, Athens, Greece
[3] Natl Ctr Sci Res NCSR Demokritos, Software & Knowledge Engn Lab, Inst Informat & Telecommun, GR-15310 Athens, Greece
关键词
Argument mining; argument extraction; argument matching;
D O I
10.1142/S0218213015400242
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Argument extraction is the task of identifying arguments, along with their components in text. Arguments can be usually decomposed into a claim and one or more premises justifying it. Among the novel aspects of this work is the thematic domain itself which relates to Social Media, in contrast to traditional research in the area, which concentrates mainly on law documents and scientific publications. The huge increase of social media communities, along with their user tendency to debate, makes the identification of arguments in these texts a necessity. Argument extraction from Social Media is more challenging because texts may not always contain arguments, as is the case of legal documents or scientific publications usually studied. In addition, being less formal in nature, texts in Social Media may not even have proper syntax or spelling. This paper presents a two-step approach for argument extraction from social media texts. During the first step, the proposed approach tries to classify the sentences into "sentences that contain arguments" and "sentences that don't contain arguments". In the second step, it tries to identify the exact fragments that contain the premises from the sentences that contain arguments, by utilizing conditional random fields. The results exceed significantly the base line approach, and according to literature, are quite promising.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Argument Extraction from News, Blogs, and Social Media
    Goudas, Theodosis
    Louizos, Christos
    Petasis, Georgios
    Karkaletsis, Vangelis
    ARTIFICIAL INTELLIGENCE: METHODS AND APPLICATIONS, 2014, 8445 : 287 - 299
  • [2] Mining on Terms Extraction from Web News
    Hsu, Li-Fu
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT I, 2010, 6421 : 188 - 194
  • [3] Multiple sclerosis research dissemination in the web: news, blogs, or tweets?
    Heydarpour, P.
    Shirazi, A. H.
    Sahraian, M. A.
    MULTIPLE SCLEROSIS JOURNAL, 2017, 23 : 459 - 460
  • [4] Extraction of web news from web pages using a ternary tree approach
    Laishram, Debina
    Sebastian, Merin
    2015 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATION ENGINEERING ICACCE 2015, 2015, : 628 - 633
  • [5] Corporate social responsibility as argument on the web
    Coupland, C
    JOURNAL OF BUSINESS ETHICS, 2005, 62 (04) : 355 - 366
  • [6] Hybrid method for automated news content extraction from the web
    Li, Yu
    Meng, Xiaofeng
    Li, Qing
    Wang, Liping
    WEB INFORMATION SYSTEMS - WISE 2006, PROCEEDINGS, 2006, 4255 : 327 - 338
  • [7] Corporate Social Responsibility as Argument on the Web
    C Coupland
    Journal of Business Ethics, 2005, 62 : 355 - 366
  • [8] Automatic Extraction of Textual Elements from News Web Pages
    Ibrahim, Hossam
    Darwish, Kareem
    Abdel-sabor, Abdel-Rahim
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1600 - 1603
  • [9] Automated metadata and instance extraction from news Web sites
    Vadrevu, S
    Nagarajan, S
    Gelgi, F
    Davulcu, H
    2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2005, : 38 - 41
  • [10] Blogs, news and credibility
    Gunter, Barrie
    Campbell, Vincent
    Touri, Maria
    Gibson, Rachel
    ASLIB PROCEEDINGS, 2009, 61 (02): : 185 - 204