Language-independent extractive automatic text summarization based on automatic keyword extraction

被引:4
|
作者
Hernandez-Castaneda, Angel [1 ,2 ]
Arnulfo Garcia-Hernandez, Rene [2 ]
Ledeneva, Yulia [2 ]
Eduardo Millan-Hernandez, Christian [2 ]
机构
[1] Catedras CONACyT, Ave Insurgentes Sur 1582, Col Credito Constructor 03940, Mexico
[2] Autonomous Univ State Mexico, Inst Literario 100, Col Ctr 50000, Mexico State, Mexico
来源
关键词
Automatic summarization; Genetic algorithm; Topic modeling; Extractive summaries; Keywords;
D O I
10.1016/j.csl.2021.101267
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study proposes a language and domain independent approach for automatic extractive text summarization (EATS) tasks, which is based on a clustering scheme supported by a genetic algorithm (GA), to find an optimal grouping of sentences. Furthermore, our approach includes a topic modeling algorithm to find the key sentences in clusters based on automatically generated keywords. Our experimental results show that our system outperforms previous methods through the application of two general steps: clustering, which helps to increase coverage, and the addition of semantic information to the model, which facilitates the detection of the key sentences in the clusters and improves precision.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Supporting Data Driven Access through Automatic Keyword Extraction and Summarization
    Xu, Weijia
    Luo, Wei
    Woodward, Nicholas
    Zhang, Yan
    [J]. 2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 704 - 707
  • [42] Language-Independent Automatic Evaluation of Intelligibility of Chronically Hoarse Persons
    Haderlein, Tino
    Middag, Catherine
    Martens, Jean-Pierre
    Doellinger, Michael
    Noeth, Elmar
    [J]. FOLIA PHONIATRICA ET LOGOPAEDICA, 2014, 66 (06) : 219 - 226
  • [43] Language-Independent Text Lines Extraction Using Seam Carving
    Saabni, Raid
    El-Sana, Jihad
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 563 - 568
  • [44] Efficient Voting-Based Extractive Automatic Text Summarization Using Prominent Feature Set
    Meena, Yogesh Kumar
    Gopalani, Dinesh
    [J]. IETE JOURNAL OF RESEARCH, 2016, 62 (05) : 581 - 590
  • [45] AUTOMATIC TEXT SUMMARIZATION BASED ON TEXTUAL COHESION
    Chen Yanmin Liu Bingquan Wang Xiaolong (Dept of Computer Science and Engineering
    [J]. Journal of Electronics(China), 2007, (03) : 338 - 346
  • [46] Automatic text summarization based on lexical chains
    Chen, YM
    Wang, XL
    Guan, Y
    [J]. ADVANCES IN NATURAL COMPUTATION, PT 1, PROCEEDINGS, 2005, 3610 : 947 - 951
  • [47] Automatic Language-Independent Indexing of Documents using Image Processing
    Rait, Aishanou Osha
    Venkatesh, K. S.
    [J]. MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 817 - +
  • [48] An Algebraic Approach for Sentence Based Feature Extraction Applied for Automatic Text Summarization
    Batcha, Nowshath Kadhar
    Aziz, Normaziah Abdul
    [J]. ADVANCED SCIENCE LETTERS, 2014, 20 (01) : 139 - 143
  • [49] Word-sentence co-ranking for automatic extractive text summarization
    Fang, Changjian
    Mu, Dejun
    Deng, Zhenghong
    Wu, Zhiang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 : 189 - 195
  • [50] Use of Fuzzy Logic and WordNet for Improving Performance of Extractive Automatic Text Summarization
    Yadav, Jyoti
    Meena, Yogesh Kumar
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 2071 - 2077