Automatic text summarization based on latent semantic indexing

被引:6
|
作者
Ai, Dongmei [1 ,2 ]
Zheng, Yuchao [2 ]
Zhang, Dezheng [2 ]
机构
[1] Univ Sci & Technol, Sch Appl Sci, Beijing, Peoples R China
[2] Univ Sci & Technol, Sch Informat Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Automatic text summarization; Latent semantic indexing; Vector space model;
D O I
10.1007/s10015-010-0759-x
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Automatic summarization is a topic of common concern in computational linguistics and information science, since a computer system of text summarization is considered to be an effective means of processing information resources. A method of text summarization based on latent semantic indexing (LSI), which uses semantic indexing to calculate the sentence similarity, is proposed in this article. It improves the accuracy of sentence similarity calculations and subject delineation, and helps the abstracts generated to cover the documents comprehensively as well as reducing redundancies. The effectiveness of the method is proved by the experimental results. Compared with the traditional keyword-based vector space model method of automatic text summarization, the quality of the abstracts generated was significantly improved.
引用
收藏
页码:25 / 29
页数:5
相关论文
共 50 条
  • [1] Latent Topic-semantic Indexing based Automatic Text Summarization
    Yu, Jiangsheng
    Chen, Xue-wen
    [J]. 2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 120 - 126
  • [2] Automatic text summarization using latent semantic analysis
    I. V. Mashechkin
    M. I. Petrovskiy
    D. S. Popov
    D. V. Tsarev
    [J]. Programming and Computer Software, 2011, 37 : 299 - 305
  • [3] Automatic Text Summarization Using Latent Semantic Analysis
    Mashechkin, I. V.
    Petrovskiy, M. I.
    Popov, D. S.
    Tsarev, D. V.
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2011, 37 (06) : 299 - 305
  • [4] Hybrid Latent Semantic Analysis and Random Indexing Model for Text Summarization
    Chatterjee, Niladri
    Yadav, Nidhika
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGY FOR COMPETITIVE STRATEGIES, 2019, 40 : 149 - 156
  • [5] Automatic Text Summarization of Konkani Texts Using Latent Semantic Analysis
    D'Silva, Jovi
    Sharma, Uzzal
    More, Chaitali
    [J]. INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 425 - 437
  • [6] Text segmentation by latent semantic indexing
    Ishioka, T
    [J]. NEW DEVELOPMENTS IN PSYCHOMETRICS, 2003, : 689 - 696
  • [7] Research on multi-document summarization based on latent semantic indexing
    Qin, Bing
    Liu, Ting
    Zhang, Yu
    Li, Sheng
    [J]. Journal of Harbin Institute of Technology (New Series), 2005, 12 (01) : 91 - 94
  • [8] Research on multi-document summarization based on latent semantic indexing
    秦兵
    刘挺
    张宇
    李生
    [J]. Journal of Harbin Institute of Technology(New series), 2005, (01) : 91 - 94
  • [9] NLP Based Latent Semantic Analysis for Legal Text Summarization
    Merchant, Kaiz
    Pande, Yash
    [J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1803 - 1807
  • [10] A Comprehensive Method for Text Summarization Based on Latent Semantic Analysis
    Wang, Yingjie
    Ma, Jun
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 394 - 401