Content curation algorithm on blog posts using hybrid computing

被引:4
|
作者
Khatter, Harsh [1 ]
Ahlawat, Anil Kumar [1 ]
机构
[1] Dr APJ Abdul Kalam Tech Univ, KIET Grp Inst, Delhi NCR, Lucknow, Uttar Pradesh, India
关键词
Blog; Cosine similarity; Fuzzy logic self-attention; Bi-directional Long short term memory auto encoder;
D O I
10.1007/s11042-022-12105-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Content curation is a significant step to identify the relevant content for the searched topics. There are many methods introduced to generate summarized contents but those methods focussed only on generating precise contents that lacked the key essence of the input texts. Therefore, we propose a hybrid model with the integration of self-attention to the bi-directional long short-term memory auto-encoder (Bi-LSTM-AE) to generate information-rich abstracts. Initially, the dataset is pre-processed and then the major word-level and sentence-level features are extracted. Then, based on the similarities between the contents, the extractive summary is generated which is then given to the auto-encoder for final abstraction. The efficiency of the model has been proved through simulations with the CNN/Daily Mail dataset in terms of ROUGE metrics. The proposed model outperformed the other compared models with a score of 0.59 for ROUGE 1, 0.39 for ROUGE 2 and 0.71 for ROUGE L with high generalization.
引用
收藏
页码:7589 / 7609
页数:21
相关论文
共 50 条
  • [1] Content curation algorithm on blog posts using hybrid computing
    Harsh Khatter
    Anil Kumar Ahlawat
    Multimedia Tools and Applications, 2022, 81 : 7589 - 7609
  • [2] A blog ranking algorithm using analysis of both blog influence and characteristics of blog posts
    Jiwon Kim
    Unil Yun
    Gwangbum Pyun
    Heungmo Ryang
    Gangin Lee
    Eunchul Yoon
    Keun Ho Ryu
    Cluster Computing, 2015, 18 : 157 - 164
  • [3] A blog ranking algorithm using analysis of both blog influence and characteristics of blog posts
    Kim, Jiwon
    Yun, Unil
    Pyun, Gwangbum
    Ryang, Heungmo
    Lee, Gangin
    Yoon, Eunchul
    Ryu, Keun Ho
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (01): : 157 - 164
  • [4] Content analysis of cancer blog posts
    Kim, Sujin
    JOURNAL OF THE MEDICAL LIBRARY ASSOCIATION, 2009, 97 (04) : 260 - 266
  • [5] EXTRACTING MAIN CONTENT-BLOCKS FROM BLOG POSTS
    Akbar, Saiful
    Slaughter, Laura
    Nytro, Oystein
    KDIR 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2010, : 438 - 443
  • [6] Influence detection between blog posts through blog features, content analysis, and community identity
    Tan, Luke Kien-Weng
    Na, Jin-Cheon
    Theng, Yin-Leng
    ONLINE INFORMATION REVIEW, 2011, 35 (03) : 425 - 442
  • [7] Ranking Algorithm Based on Blog Posts Features and Link Analysis
    Zhang, Yong
    Wang, Fang
    2011 INTERNATIONAL CONFERENCE ON FUTURE SOFTWARE ENGINEERING AND MULTIMEDIA ENGINEERING (FSME 2011), 2011, 7 : 126 - 131
  • [8] Gender Clustering of Blog Posts using Distinguishable Features
    HaCohen-Kerner, Yaakov
    Tzach, Yarden
    Asis, Ori
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 384 - 391
  • [9] Blog posts recommendation based on PLSA and Naive bayesian classification algorithm
    Cui, Lin
    Wang, Caiyin
    Wu, Xiaoyin
    Journal of Chemical and Pharmaceutical Research, 2013, 5 (12) : 851 - 858
  • [10] Web Blog Content Curation Using Fuzzy-Related Capsule Network-Based Auto Encoder
    Khatter, Harsh
    Ahlawat, Anil
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (01)