Random Indexing and Modified Random Indexing based approach for extractive text summarization

被引:8
|
作者
Chatterjee, Niladri [1 ]
Sahoo, Pramod Kumar [1 ,2 ]
机构
[1] Indian Inst Technol Delhi, Dept Math, New Delhi 110016, India
[2] Def Res & Dev Org, Inst Syst Studies & Anal, Delhi 110054, India
来源
COMPUTER SPEECH AND LANGUAGE | 2015年 / 29卷 / 01期
关键词
Word Space Model; Random Indexing; PageRank; Convolution; Modified Random Indexing; INFORMATION;
D O I
10.1016/j.csl.2014.07.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random Indexing based extractive text summarization has already been proposed in literature. This paper looks at the above technique in detail, and proposes several improvements. The improvements are both in terms of formation of index (word) vectors of the document, and construction of context vectors by using convolution instead of addition operation on the index vectors. Experiments have been conducted using both angular and linear distances as metrics for proximity. As a consequence, three improved versions of the algorithm, viz. RISUM, RISUM+ and MRISUM were obtained. These algorithms have been applied on DUC 2002 documents, and their comparative performance has been studied. Different ROUGE metrics have been used for performance evaluation. While RISUM and RISUM+ perform almost at par, MRISUM is found to outperform both RISUM and RISUM+ significantly. MRISUM also outperforms LSA+TRM based summarization approach. The study reveals that all the three Random Indexing based techniques proposed in this study produce consistent results when linear distance is used for measuring proximity. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:32 / 44
页数:13
相关论文
共 50 条
  • [21] Random indexing K-tree
    De Vries, Christopher M.
    De Vine, Lance
    Geva, Shlomo
    ADCS 2009 - Proceedings of the Fourteenth Australasian Document Computing Symposium, 2009, : 43 - 50
  • [22] Simple recurrent networks and random indexing
    Sakurai, A
    Hyodo, D
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 35 - 39
  • [23] On the Usability of Random Indexing in Patent Retrieval
    Lupu, Mihai
    GRAPH-BASED REPRESENTATION AND REASONING, 2014, 8577 : 202 - 216
  • [24] Language Geometry Using Random Indexing
    Joshi, Aditya
    Halseth, Johan T.
    Kanerva, Pentti
    QUANTUM INTERACTION, QI 2016, 2017, 10106 : 265 - 274
  • [25] Malayalam Text Summarization: An Extractive Approach
    Krishnaprasad, P.
    Sooryanarayanan, A.
    Ramanujan, Ajeesh
    2016 INTERNATIONAL CONFERENCE ON NEXT GENERATION INTELLIGENT SYSTEMS (ICNGIS), 2016, : 40 - 43
  • [26] Random indexing for comparing path-based chemical fingerprints
    Devaney, Patrick
    Lancia, David
    Milbank, Jared
    Bradley, Mary
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2015, 250
  • [27] Summarization graph indexing: Beyond frequent structure-based approach
    Zou, Lei
    Chen, Lei
    Zhang, Huaming
    Lu, Yansheng
    Lou, Qiang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 141 - +
  • [28] A weighted word embedding based approach for extractive text summarization
    Rani, Ruby
    Lobiyal, Daya K.
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 186
  • [29] Extractive Arabic Text Summarization-Graph-Based Approach
    AL-Khassawneh, Yazan Alaya
    Hanandeh, Essam Said
    ELECTRONICS, 2023, 12 (02)
  • [30] Extractive Odia Text Summarization System: An OCR Based Approach
    Pattnaik, Priyanka
    Mallick, Debasish Kumar
    Parida, Shantipriya
    Dash, Satya Ranjan
    BIOLOGICALLY INSPIRED TECHNIQUES IN MANY-CRITERIA DECISION MAKING, 2020, 10 : 136 - 143