STEMMING IMPACT ANALYSIS ON INDONESIAN QURAN TRANSLATION AND THEIR EXEGESIS CLASSIFICATION FOR ONTOLOGY INSTANCES

被引:3
|
作者
Utomo, Fandy Setyo [1 ,2 ]
Suryana, Nanna [2 ]
Azmi, Mohd Sanusi [2 ]
机构
[1] Univ AMIKOM Purwokerto, Dept Informat Syst, Fac Comp Sci, Purwokerto, Indonesia
[2] Univ Teknikal Malaysia Melaka, Ctr Adv Comp Technol, Fac Informat & Commun Technol, Melaka, Malaysia
来源
IIUM ENGINEERING JOURNAL | 2020年 / 21卷 / 01期
关键词
K-nearest neighbor; neural network; ontology learning; ontology population; support vector machine;
D O I
10.31436/iiumej.v21i1.1170
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The current gap that appears in the Quran ontology population domain is stemming impact analysis on Indonesian Quran translation and its exegesis (Tafsir) to develop ontology instances. The existing studies of stemming effect analysis were performed in various languages, datasets, stemming methods, cases, and classifiers. However, there is a lack of literature that studies the stemming influence on instance classification for Quran ontology with different datasets, classifiers, Quran translations, and their exegesis in Indonesian. Based on this problem, our study aims to investigate and analyse the stemming impact on instance classification results using Indonesian Quran translation and their exegesis as datasets with multiple supervised classifiers. Our classification framework consists of text pre-processing, feature extraction, and text classification stage. Sastrawi stemmer was used to perform stemming operation in the text pre-processing stage. Based on our experiment results, it was found that Support Vector Machine (SVM) with Term Frequency-Inverse Document Frequency (TF-IDF) and stemming operation owns the best classification performance, i.e., 70.75% for average accuracy and 71.55% for average precision in Indonesian Quran translation dataset on 20% test data size. While in 30% test data size, SVM and TF-IDF with stemming process own the best classification performance, i.e., 67.30% for average accuracy and 68.10% for average precision in Ministry of Religious Affairs Indonesia dataset. Furthermore, in this study, it was also discovered that the Backpropagation Neural Network has the most precision and accuracy reduction due to the negative impact of stemming operations.
引用
收藏
页码:33 / 50
页数:18
相关论文
共 1 条
  • [1] Analysis of Stemming Influence on Indonesian Tweet Classification
    Hidayatullah A.F.
    Ratnasari C.I.
    Wisnugroho S.
    [J]. Telkomnika (Telecommunication Computing Electronics and Control), 2016, 14 (02) : 665 - 673