Automatic Persian Text Summarization Using Linguistic Features from Text Structure Analysis

被引:0
|
作者
Heidary, Ebrahim [1 ]
Parvin, Hamid [2 ,3 ,4 ]
Nejatian, Samad [5 ,6 ]
Bagherifard, Karamollah [1 ,6 ]
Rezaie, Vahideh [6 ,7 ]
机构
[1] Islamic Azad Univ, Yasooj Branch, Dept Comp Engn, Yasuj, Iran
[2] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam
[3] Duy Tan Univ, Fac Informat Technol, Da Nang 550000, Vietnam
[4] Islamic Azad Univ, Nourabad Mamasani Branch, Dept Comp Sci, Mamasani, Iran
[5] Islamic Azad Univ, Yasooj Branch, Dept Elect Engn, Yasuj, Iran
[6] Islamic Azad Univ, Yasooj Branch, Young Res & Elite Club, Yasuj, Iran
[7] Islamic Azad Univ, Yasooj Branch, Dept Math, Yasuj, Iran
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 69卷 / 03期
关键词
Natural language processing; extractive summarization; linguistic feature; text structure analysis;
D O I
10.32604/cmc.2021.014361
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the remarkable growth of textual data sources in recent years, easy, fast, and accurate text processing has become a challenge with significant payoffs. Automatic text summarization is the process of compressing text documents into shorter summaries for easier review of its core contents, which must be done without losing important features and information. This paper introduces a new hybrid method for extractive text summarization with feature selection based on text structure. The major advantage of the proposed summarization method over previous systems is the modeling of text structure and relationship between entities in the input text, which improves the sentence feature selection process and leads to the generation of unambiguous, concise, consistent, and coherent summaries. The paper also presents the results of the evaluation of the proposed method based on precision and recall criteria. It is shown that the method produces summaries consisting of chains of sentences with the aforementioned characteristics from the original text.
引用
收藏
页码:2845 / 2861
页数:17
相关论文
共 50 条
  • [1] Automatic Text Summarization with Statistical and Linguistic Features using Successive Thresholds
    PadmaLahari, E.
    Kumar, D. V. N. Siva
    Prasad, S. Shiva
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2014, : 1519 - 1524
  • [2] Psychological Features for Automatic Text Summarization
    Losada, David E.
    Parapar, Javier
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2017, 25 : 129 - 149
  • [3] Persian Text Summarization Using Fractal Theory
    Tofighy, Mohsen
    Kashefi, Omid
    Zamanifar, Azadeh
    Javadi, Hamid Haj Seyyed
    [J]. INFORMATICS ENGINEERING AND INFORMATION SCIENCE, PT II, 2011, 252 : 651 - +
  • [4] The Impact of Features and Preprocessing on Automatic Text Summarization
    Bal, Salih
    Sora Gunal, Efnan
    [J]. ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2022, 25 (02): : 117 - 132
  • [5] Automatic Text Summarization Using Latent Semantic Analysis
    Mashechkin, I. V.
    Petrovskiy, M. I.
    Popov, D. S.
    Tsarev, D. V.
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2011, 37 (06) : 299 - 305
  • [6] Automatic text summarization using latent semantic analysis
    I. V. Mashechkin
    M. I. Petrovskiy
    D. S. Popov
    D. V. Tsarev
    [J]. Programming and Computer Software, 2011, 37 : 299 - 305
  • [7] Persian Automatic Text Summarization Based on Named Entity Recognition
    Khademi, Mohammad Ebrahim
    Fakhredanesh, Mohammad
    [J]. IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2020,
  • [8] Structural Features for Predicting the Linguistic Quality of Text Applications to Machine Translation, Automatic Summarization and Human-Authored Text
    Nenkova, Ani
    Chae, Jieun
    Louis, Annie
    Pitler, Emily
    [J]. EMPIRICAL METHODS IN NATURAL LANGUAGE GENERATION: DATA-ORIENTED METHODS AND EMPIRICAL EVALUATION, 2010, 5790 : 222 - 241
  • [9] Automatic Text Summarization
    Soumya, S.
    Kumar, Geethu S.
    Naseem, Rasia
    Mohan, Saumya
    [J]. COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 787 - 789
  • [10] Automatic Text Summarization
    Fattah, Mohamed Abdel
    Ren, Fuji
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 27, 2008, 27 : 192 - +