Automatic Persian Text Summarization Using Linguistic Features from Text Structure Analysis

被引:0
|
作者
Heidary, Ebrahim [1 ]
Parvin, Hamid [2 ,3 ,4 ]
Nejatian, Samad [5 ,6 ]
Bagherifard, Karamollah [1 ,6 ]
Rezaie, Vahideh [6 ,7 ]
机构
[1] Islamic Azad Univ, Yasooj Branch, Dept Comp Engn, Yasuj, Iran
[2] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam
[3] Duy Tan Univ, Fac Informat Technol, Da Nang 550000, Vietnam
[4] Islamic Azad Univ, Nourabad Mamasani Branch, Dept Comp Sci, Mamasani, Iran
[5] Islamic Azad Univ, Yasooj Branch, Dept Elect Engn, Yasuj, Iran
[6] Islamic Azad Univ, Yasooj Branch, Young Res & Elite Club, Yasuj, Iran
[7] Islamic Azad Univ, Yasooj Branch, Dept Math, Yasuj, Iran
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 69卷 / 03期
关键词
Natural language processing; extractive summarization; linguistic feature; text structure analysis;
D O I
10.32604/cmc.2021.014361
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the remarkable growth of textual data sources in recent years, easy, fast, and accurate text processing has become a challenge with significant payoffs. Automatic text summarization is the process of compressing text documents into shorter summaries for easier review of its core contents, which must be done without losing important features and information. This paper introduces a new hybrid method for extractive text summarization with feature selection based on text structure. The major advantage of the proposed summarization method over previous systems is the modeling of text structure and relationship between entities in the input text, which improves the sentence feature selection process and leads to the generation of unambiguous, concise, consistent, and coherent summaries. The paper also presents the results of the evaluation of the proposed method based on precision and recall criteria. It is shown that the method produces summaries consisting of chains of sentences with the aforementioned characteristics from the original text.
引用
收藏
页码:2845 / 2861
页数:17
相关论文
共 50 条
  • [32] new graph based text segmentation using Wikipedia for automatic text summarization
    Pourvali, Mohsen
    Abadeh, Mohammad Saniee
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2012, 3 (01) : 35 - 39
  • [33] Using LSA and text segmentation to improve automatic Chinese dialogue text summarization
    Chuan-han Liu
    Yong-cheng Wang
    Fei Zheng
    De-rong Liu
    [J]. Journal of Zhejiang University-SCIENCE A, 2007, 8 : 79 - 87
  • [34] Using LSA and text segmentation to improve automatic Chinese dialogue text summarization
    Liu Chuan-han
    Wang Yong-cheng
    Zheng Fei
    Liu De-rong
    [J]. JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2007, 8 (01): : 79 - 87
  • [35] Open information extraction as an intermediate semantic structure for Persian text summarization
    Rahat, Mahmoud
    Talebpour, Alireza
    [J]. INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2018, 19 (04) : 339 - 352
  • [36] Text analysis for Bengali Text Summarization using Deep Learning
    Al Munzir, Abdullah
    Rahman, Md. Lutfor
    Abujar, Sheikh
    Ohidujjaman
    Hossain, Syed Akhter
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [37] Automatic Arabic Text Summarization Using Analogical Proportions
    Bilel Elayeb
    Amina Chouigui
    Myriam Bounhas
    Oussama Ben Khiroun
    [J]. Cognitive Computation, 2020, 12 : 1043 - 1069
  • [38] Improving the Performance of Text Categorization using Automatic Summarization
    Jiang Xiao-Yu
    Fan Xiao-Zhong
    Wang Zhi-Fei
    Jia Ke-Liang
    [J]. 2009 INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION, PROCEEDINGS, 2009, : 347 - +
  • [39] AUTOMATIC TEXT SUMMARIZATION USING SUPPORT VECTOR MACHINE
    Begum, Nadira
    Fattah, Mohamed Abdel
    Ren, Fuji
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (07): : 1987 - 1996
  • [40] Automatic text summarization using a machine learning approach
    Neto, JL
    Freitas, AA
    Kaestner, CAA
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 2507 : 205 - 215