Language features in extractive summarization: Humans Vs. Machines

被引:4
|
作者
Arroyo-Fernandez, Ignacio [1 ]
Curiel, Arturo [2 ]
Mendez-Cruz, Carlos-Francisco [3 ]
机构
[1] Univ Nacl Autonoma Mexico, Ciudad Univ, Mexico City, DF, Mexico
[2] Univ Veracruzana, CONACYT, Fac Estadist & Informat, Ave Xalapa Esq Manuel Avila Camacho S-N, Xalapa 91020, Veracruz, Mexico
[3] Univ Nacl Autonoma Mexico, Ctr Ciencias Genom, Ave Univ S-N, Cuernavaca 62100, Morelos, Mexico
关键词
Automatic text summarization; Statistical feature analysis; Natural language processing; Artificial intelligence; RELEVANCE CRITERIA;
D O I
10.1016/j.knosys.2019.05.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a comparative statistical analysis of the language features most commonly used for Automatic Text Summarization (ATS), namely: Parts of Speech (PoS) (unigrams and bigrams), sentiments (by token and sentence), and Rhetorical Structure Theory (RTS) relations. The analyses were carried out on both human-made and machine-made summaries, in order to determine whether current ATS systems capture the same kind of information as humans do. Our results show that there are some marked differences between machine and human-made summaries, which at times may seem counterintuitive. For instance, named entities were usually frequent in machine-made summaries, but not in human-made ones. Similarly, words perceived to hold a "neutral" sentiment were systematically favored by machines, but not always by humans. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [41] Features in extractive supervised single-document summarization: case of Persian news
    Rezaei, Hosein
    Mirhosseini, Seyed Amid Moeinzadeh
    Shahgholian, Azar
    Saraee, Mohamad
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [42] The Impact of ASR on Abstractive vs. Extractive Meeting Summaries
    Murray, Gabriel
    Carenini, Giuseppe
    Ng, Raymond
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1688 - 1691
  • [43] Significance of Learner Dependent Features for Improving Text Readability using Extractive Summarization
    Nandhini, K.
    Balasundaram, S. R.
    [J]. 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2012), 2012,
  • [44] DETECTION OF TV IMAGERY BY HUMANS VS MACHINES
    THOMAS, CE
    TISDALE, G
    SPINK, T
    KAHN, A
    [J]. APPLIED OPTICS, 1972, 11 (05): : 1047 - &
  • [45] Ultimatum bargaining: Algorithms vs. Humans
    Ozkes, Ali I.
    Hanaki, Nobuyuki
    Vanderelst, Dieter
    Willems, Jurgen
    [J]. ECONOMICS LETTERS, 2024, 244
  • [46] Appetitive vs. Aversive conditioning in humans
    Andreatta, Marta
    Pauli, Paul
    [J]. FRONTIERS IN BEHAVIORAL NEUROSCIENCE, 2015, 9
  • [47] AVERSIVE VS. APPETITIVE CONDITIONING IN HUMANS
    Andreatta, Marta
    Pauli, Paul
    [J]. PSYCHOPHYSIOLOGY, 2014, 51 : S64 - S64
  • [48] Vision and grasping: Humans vs. robots
    Chinellato, E
    del Pobil, AP
    [J]. MECHANISMS, SYMBOLS AND MODELS UNDERLYING COGNITION, PT 1, PROCEEDINGS, 2005, 3561 : 366 - 375
  • [49] Risks in features vs. assurance
    Acar, T
    Michener, JR
    [J]. COMMUNICATIONS OF THE ACM, 2002, 45 (08) : 112 - 112
  • [50] A Position-Aware Language Modeling Framework for Extractive Broadcast News Speech Summarization
    Liu, Shih-Hung
    Chen, Kuan-Yu
    Hsieh, Yu-Lun
    Chen, Berlin
    Wang, Hsin-Min
    Yen, Hsu-Chun
    Hsu, Wen-Lian
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2017, 16 (04)