Query-oriented Unsupervised Multi-document Summarization on Big Data

被引:0
|
作者
Sunaina [1 ]
Kamath, Sowmya S. [1 ]
机构
[1] Natl Inst Technol Karnataka, Dept Informat Technol, Surathkal, India
关键词
multi-document summarization; natural language processing; dynamic programming; map-reduce;
D O I
10.1145/2967878.2967919
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Real time document summarization is a critical need nowadays, owing to the large volume of information available for our reading, and our inability to deal with this entirely due to limitations of time and resources. Oftentimes, information is available in multiple sources, offering multiple contexts and viewpoints on a single topic of interest. Automated multi-document summarization (MDS) techniques aim to address this problem. However, current techniques for automated MDS suffer from low precision and accuracy with reference to a given subject matter, when compared to those summaries prepared by humans and takes large time to create the summary when the input given is too huge. In this paper, we propose a hybrid MDS technique combining feature based algorithms and dynamic programming for generating a summary from multiple documents based on user provided query. Further, in real-world scenarios, Web search serves up a large number of URLs to users, and the work of making sense of these with reference to a particular query is left to the user. In this context, an efficient parallelized MDS technique based on Hadoop is also presented, for serving a concise summary of multiple Webpage contents for a given user query in reduced time duration.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Query-oriented unsupervised multi-document summarization via deep learning model
    Zhong, Sheng-hua
    Liu, Yan
    Li, Bin
    Long, Jing
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) : 8146 - 8155
  • [2] A cluster-sensitive graph model for query-oriented multi-document summarization
    Wei, Furu
    Li, Wenjie
    Lu, Qin
    He, Yanxiang
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2008, 4956 : 446 - +
  • [3] A Query-Sensitive Graph-Based Sentence Ranking Algorithm for Query-Oriented Multi-Document Summarization .
    Wei, Furu
    He, Yanxiang
    Li, Wenjie
    Lu, Qin
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 9 - +
  • [4] Unsupervised Multi-document Summarization with Holistic Inference
    Zhang, Haopeng
    Cho, Sangwoo
    Song, Kaiqiang
    Wang, Xiaoyang
    Wang, Hongwei
    Zhang, Jiawei
    Yu, Dong
    [J]. 13TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING AND THE 3RD CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, IJCNLP-AACL 2023, 2023, : 123 - 133
  • [5] Multi-document summarization based on unsupervised clustering
    Ji, Paul
    [J]. INFORMATION RETRIEVAL TECHNOLOLGY, PROCEEDINGS, 2006, 4182 : 560 - 566
  • [6] Exploiting the Role of Named Entities in Query-Oriented Document Summarization
    Li, Wenjie
    Wei, Furu
    You, Ouyang
    Lu, Qin
    He, Yanxiang
    [J]. PRICAI 2008: TRENDS IN ARTIFICIAL INTELLIGENCE, 2008, 5351 : 740 - +
  • [7] A Spectral Method for Unsupervised Multi-Document Summarization
    Wang, Kexiang
    Chang, Baobao
    Sui, Zhifang
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 435 - 445
  • [8] Data Augmentation for Abstractive Query-Focused Multi-Document Summarization
    Pasunuru, Ramakanth
    Celikyilmaz, Asli
    Galley, Michel
    Xiong, Chenyan
    Zhang, Yizhe
    Bansal, Mohit
    Gao, Jianfeng
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13666 - 13674
  • [9] Unsupervised Query-Focused Multi-Document Summarization using the Cross Entropy Method
    Feigenblat, Guy
    Roitman, Haggai
    Boni, Odellia
    Konopnicki, David
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 961 - 964
  • [10] Query-Oriented Summarization of RDF Graphs
    Cebiric, Sejla
    Goasdoue, Francois
    Manolescu, Ioana
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (12): : 2013 - 2016