Text feature weighting for summarization of documents in bahasa Indonesia using genetic algorithm

被引:1
|
作者
Aristoteles [1 ]
Herdiyeni, Yeni [2 ]
Ridha, Ahmad [2 ]
Adisantoso, Julio [2 ]
机构
[1] Department of Computer Science, University of Lampung, Bandar Lampung, 35145, Indonesia
[2] Department of Computer Science, Bogor Agricultural University, Bogor, 16680, Indonesia
来源
关键词
Semantics - Text processing;
D O I
暂无
中图分类号
学科分类号
摘要
This paper aims to perform text feature weighting for summarization of documents in bahasa Indonesia using genetic algorithm. There are eleven text features, i.e, sentence position (f1), positive keywords in sentence (f2), negative keywords in sentence (f3), sentence centrality (f4), sentence resemblance to the title (f5), sentence inclusion of name entity (f6), sentence inclusion of numerical data (f7), sentence relative length (f8), bushy path of the node (f9), summation of similarities for each node (f10), and latent semantic feature (f11). We investigate the effect of the first ten sentence features on the summarization task. Then, we use latent semantic feature to increase the accuracy. All feature score functions are used to train a genetic algorithm model to obtain a suitable combination of feature weights. Evaluation of text summarization uses F-measure. The Fmeasure is directly related to the compression rate. The results showed that adding f11 increases the F-measure by 3.26% and 1.55% for compression ratio of 10% and 30%, respectively. On the other hand, it decreases the F-measure by 0.58% for compression ratio of 20%. Analysis of text feature weight showed that only using f2, f4, f5, and f11 can deliver a similar performance using all eleven features. © 2012 International Journal of Computer Science Issues.
引用
收藏
页码:1 / 6
相关论文
共 50 条
  • [21] Unsupervised extractive multi-document text summarization using a Genetic Algorithm
    Neri-Mendoza, Veronica
    Ledeneva, Yulia
    Garcia-Hernandez, Rene Arnulfo
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 2397 - 2408
  • [22] Text Summarization of Hindi Documents using Rule Based Approach
    Gupta, Manisha
    Garg, Naresh Kumar
    2016 INTERNATIONAL CONFERENCE ON MICRO-ELECTRONICS AND TELECOMMUNICATION ENGINEERING (ICMETE), 2016, : 366 - 370
  • [23] Arabic Text Summarization using Firefly Algorithm
    Al-Abdallah, Raed Z.
    Al-Taani, Ahmad T.
    PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 61 - 65
  • [24] Automatic Text Summarization using Maximum Marginal Relevance for Health Ethics Protocol Document in Bahasa
    Purbawa, Doni Putra
    Malikhah
    Anggraini, Ratih Nur Esti
    Sarno, Riyanarto
    PROCEEDINGS OF 2021 13TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2021, : 324 - 329
  • [25] GaSUME: A BERT-Covered Genetic Algorithm for Text Summarization
    Tanfouri, Imen
    Jarray, Fethi
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2023, 2024, 14546 : 411 - 424
  • [26] An Evolutionary Algorithm for Feature Selective Double Clustering of Text Documents
    Nourashrafeddin, S. N.
    Milios, Evangelos
    Arnold, Dirk V.
    2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 446 - 453
  • [27] Feature Selection and Feature Weighting Using Tunicate Swarm Genetic Optimization Algorithm With Deep Residual Networks
    Diaz, P. M.
    Jiju, Julie Emerald
    INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH, 2022, 13 (01)
  • [28] Using Readers' Highlighting on Monochromatic Documents for Automatic Text Transcription and Summarization
    Barboza, Ricardo da Silva
    Lins, Rafael Dueire
    Pereira, Victor Matheus de S.
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 212 - 216
  • [29] Using Dictionary in a Knowledge Based Algorithm for Clustering Short Texts in Bahasa Indonesia
    Thamrin, Husni
    Sabardila, Atiqa
    2014 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2014,
  • [30] Automatic Single Document Text Summarization Using Key Concepts in Documents
    Sarkar, Kamal
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2013, 9 (04): : 602 - 620