Language features in extractive summarization: Humans Vs. Machines

被引:4
|
作者
Arroyo-Fernandez, Ignacio [1 ]
Curiel, Arturo [2 ]
Mendez-Cruz, Carlos-Francisco [3 ]
机构
[1] Univ Nacl Autonoma Mexico, Ciudad Univ, Mexico City, DF, Mexico
[2] Univ Veracruzana, CONACYT, Fac Estadist & Informat, Ave Xalapa Esq Manuel Avila Camacho S-N, Xalapa 91020, Veracruz, Mexico
[3] Univ Nacl Autonoma Mexico, Ctr Ciencias Genom, Ave Univ S-N, Cuernavaca 62100, Morelos, Mexico
关键词
Automatic text summarization; Statistical feature analysis; Natural language processing; Artificial intelligence; RELEVANCE CRITERIA;
D O I
10.1016/j.knosys.2019.05.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a comparative statistical analysis of the language features most commonly used for Automatic Text Summarization (ATS), namely: Parts of Speech (PoS) (unigrams and bigrams), sentiments (by token and sentence), and Rhetorical Structure Theory (RTS) relations. The analyses were carried out on both human-made and machine-made summaries, in order to determine whether current ATS systems capture the same kind of information as humans do. Our results show that there are some marked differences between machine and human-made summaries, which at times may seem counterintuitive. For instance, named entities were usually frequent in machine-made summaries, but not in human-made ones. Similarly, words perceived to hold a "neutral" sentiment were systematically favored by machines, but not always by humans. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [1] Abstractive vs. Extractive Summarization: An Experimental Review
    Giarelis, Nikolaos
    Mastrokostas, Charalampos
    Karacapilidis, Nikos
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [2] Humans vs. Machines in Malware Classification
    Aonzo, Simone
    Han, Yufei
    Mantovani, Alessandro
    Balzarotti, Davide
    [J]. PROCEEDINGS OF THE 32ND USENIX SECURITY SYMPOSIUM, 2023, : 1145 - 1162
  • [3] Extractive Summarization for Myanmar Language
    Lwin, Soe Soe
    Nwet, Khin Thandar
    [J]. 2018 INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2018), 2018, : 138 - 143
  • [4] Extractive Text Summarization Models for Urdu Language
    Nawaz, Ali
    Bakhtyar, Maheen
    Baber, Junaid
    Ullah, Ihsan
    Noor, Waheed
    Basit, Abdul
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [5] Extractive Text Summarization Using Topological Features
    Kumar, Ankit
    Sarkar, Apurba
    [J]. COMBINATORIAL IMAGE ANALYSIS, IWCIA 2022, 2023, 13348 : 105 - 121
  • [6] Integrating Prosodic Features in Extractive Meeting Summarization
    Xie, Shasha
    Hakkani-Tuer, Dilek
    Favre, Benoit
    Liu, Yang
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 387 - +
  • [7] Visual sense of number vs. sense of magnitude in humans and machines
    Alberto Testolin
    Serena Dolfi
    Mathijs Rochus
    Marco Zorzi
    [J]. Scientific Reports, 10
  • [8] Visual sense of number vs. sense of magnitude in humans and machines
    Testolin, Alberto
    Dolfi, Serena
    Rochus, Mathijs
    Zorzi, Marco
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [9] HUMANS VS MACHINES
    BUTLIN, DM
    [J]. BYTE, 1992, 17 (04): : 18 - 18
  • [10] Optimal Features Set For Extractive Automatic Text Summarization
    Meena, Yogesh Kumar
    Deolia, Peeyush
    Gopalani, Dinesh
    [J]. 2015 5TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION TECHNOLOGIES ACCT 2015, 2015, : 35 - 40