OnSeS: A Novel Online Short Text Summarization based on BM25 and Neural Network

被引:0
|
作者
Niu, Jianwei [1 ]
Zhao, Qingjuan [1 ]
Wang, Lei [1 ]
Chen, Huan [1 ]
Atiquzzaman, Mohammed [2 ]
Peng, Fei [3 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Univ Oklahoma, Sch Comp Sci, Norman, OK 73019 USA
[3] Shanghai Res Inst Aerosp Comp Technol, Shanghai 200050, Peoples R China
基金
中国国家自然科学基金;
关键词
short text clustering; text ranking; opinion extraction; short text summarization; neural machine translation;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The last decade has witnessed a dramatic growth of social networks, such as Twitter, Sina Microblog, etc. Messages/ short texts on these platforms are generally of limited length, causing difficulties for machines to understand. Moreover, it is rarely possible for users to read and understand all the content due to the large quantity. So it is imperative to cluster and extract the viewpoints of these short texts. To solve this, the representation of a word is enriched with additional features from external, but it is demanding in terms of computational and time resources. In this paper, we proposed OnSeS, a novel short text summarization method which makes full use of word2vec to represent a word and utilizes neural network model to generate each word of the summary. OnSeS consists of three phrases: 1) clustering short texts using the k-means algorithm; 2) ranking content of each cluster by building a graph-based ranking model using BM25; 3) generating main point of each cluster with the help of neural machine translation model on the top ranked sentence. The experimental results reveal that our proposed fully data-driven approach outperforms state-of-the-art method.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Opinion Summarization for Short Texts based on BM25 and Syntactic Parsing
    Niu, Jianwei
    Zhao, Qingjuan
    Wang, Lei
    Chen, Huan
    Zheng, Shichao
    [J]. 2016 IEEE 14TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2016, : 1177 - 1180
  • [2] INCREMENTAL CLUSTERING IN SHORT TEXT STREAMS BASED ON BM25
    Xu, Lixin
    Chen, Guang
    Yang, Lei
    [J]. 2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2014, : 8 - 12
  • [3] Probabilistic Neural Network Based Text Summarization
    Fattah, Mohamed Abdel
    Ren, Fuji
    [J]. IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 43 - 48
  • [4] Injecting the BM25 Score as Text Improves BERT-Based Re-rankers
    Askari, Arian
    Abolghasemi, Amin
    Pasi, Gabriella
    Kraaij, Wessel
    Verberne, Suzan
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT I, 2023, 13980 : 66 - 83
  • [5] Convolutional Neural Network based for Automatic Text Summarization
    Alquliti, Wajdi Homaid
    Ghani, Norjihan Binti Abdul
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (04) : 200 - 211
  • [6] Field-weighted XML retrieval based on BM25
    Lu, Wei
    Robertson, Stephen
    MacFarlane, Andrew
    [J]. ADVANCES IN XML INFORMATION RETRIEVAL AND EVALUATION, 2006, 3977 : 161 - 171
  • [7] Bug report quality detection based on the BM25 algorithm
    Chen L.
    Huang S.
    Sun J.
    Hui Z.
    Wu K.
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2020, 60 (10): : 829 - 836
  • [8] A Review on Neural network based Abstractive Text Summarization models
    Tandel, Jinal
    Mistree, Kinjal
    Shah, Parth
    [J]. 2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [9] Text Summarization Method Based on Gated Attention Graph Neural Network
    Huang, Jingui
    Wu, Wenya
    Li, Jingyi
    Wang, Shengchun
    [J]. SENSORS, 2023, 23 (03)
  • [10] A Generative Text Summarization Model Based on Document Structure Neural Network
    Huang, Haihui
    Zha, Maohong
    [J]. APPLIED INTELLIGENCE AND INFORMATICS, AII 2021, 2021, 1435 : 176 - 187