Language-independent extractive automatic text summarization based on automatic keyword extraction

被引:4
|
作者
Hernandez-Castaneda, Angel [1 ,2 ]
Arnulfo Garcia-Hernandez, Rene [2 ]
Ledeneva, Yulia [2 ]
Eduardo Millan-Hernandez, Christian [2 ]
机构
[1] Catedras CONACyT, Ave Insurgentes Sur 1582, Col Credito Constructor 03940, Mexico
[2] Autonomous Univ State Mexico, Inst Literario 100, Col Ctr 50000, Mexico State, Mexico
来源
关键词
Automatic summarization; Genetic algorithm; Topic modeling; Extractive summaries; Keywords;
D O I
10.1016/j.csl.2021.101267
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study proposes a language and domain independent approach for automatic extractive text summarization (EATS) tasks, which is based on a clustering scheme supported by a genetic algorithm (GA), to find an optimal grouping of sentences. Furthermore, our approach includes a topic modeling algorithm to find the key sentences in clusters based on automatically generated keywords. Our experimental results show that our system outperforms previous methods through the application of two general steps: clustering, which helps to increase coverage, and the addition of semantic information to the model, which facilitates the detection of the key sentences in the clusters and improves precision.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Chinese Automatic Text Summarization Based on Keyword Extraction
    Jiang Xiao-yu
    [J]. FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 225 - 228
  • [2] Automatic text summarization based on keyword derivation
    Ando, K
    Yamasaki, T
    Shishibori, M
    Aoe, JI
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 464 - 469
  • [3] Automatic Keyword Extraction for Text Summarization in e-Newspapers
    Thomas, Justine Raju
    Bharti, Santosh Kumar
    Babu, Korra Sathya
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [4] Text Summarization with Automatic Keyword Extraction in Telugu e-Newspapers
    Naidu, Reddy
    Bharti, Santosh Kumar
    Babu, Korra Sathya
    Mohapatra, Ramesh Kumar
    [J]. SMART COMPUTING AND INFORMATICS, 2018, 77 : 555 - 564
  • [5] Automatic Summarization and Keyword Extraction from Web Page or Text File
    You, Xiangdong
    [J]. 2019 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY (CCET), 2019, : 154 - 158
  • [6] Evolutionary Algorithms for Extractive Automatic Text Summarization
    Meena, Yogesh Kumar
    Gopalani, Dinesh
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND CONVERGENCE (ICCC 2015), 2015, 48 : 244 - 249
  • [7] Supervised Automatic Text Summarization of Konkani Texts Using Linear Regression-Based Feature Weighing and Language-Independent Features
    D'Silva, Jovi
    Sharma, Uzzal
    [J]. INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 439 - 457
  • [8] A Novel Optimized Language-Independent Text Summarization Technique
    Mahmoud, Hanan A. Hosni
    Hafez, Alaaeldin M.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 5121 - 5136
  • [9] Automatic text summarization based on sentences clustering and extraction
    Zhang Pei-ying
    Li Cun-he
    [J]. 2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 1, 2009, : 167 - 170
  • [10] Optimal Features Set For Extractive Automatic Text Summarization
    Meena, Yogesh Kumar
    Deolia, Peeyush
    Gopalani, Dinesh
    [J]. 2015 5TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION TECHNOLOGIES ACCT 2015, 2015, : 35 - 40