QSST: A Quranic Semantic Search Tool based on word embedding

被引:9
|
作者
Mohamed, Ensaf Hussein [1 ]
Shokry, Eyad Mohamed [1 ]
机构
[1] Helwan Univ, Fac Comp & Artificial Intelligence, Comp Sci Dept, Cairo, Egypt
关键词
Information Retrieval; Word Embedding; Concept-based Search; Ontology; Semantic Search; Arabic Natural Language Processing; Holy Quran;
D O I
10.1016/j.jksuci.2020.01.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Retrieving information from the Quran is an important field for Quran scholars and Arabic researchers. There are two types of Quran searching techniques: semantic or concept-based and keyword-based. Concept-based search is a challenging task, especially in a complex corpus such as Quran. This paper presents a concept-based searching tool (QSST) for the Holy Quran. It consists of four phases. In the first phase, the Quran dataset is built by manually annotating Quran verses based on the ontology of Mushaf Al-Tajweed. The second phase is word Embedding, this phase generates features' vectors for words by training a Continuous Bag of Words (CBOW) architecture on large Quranic and Classic Arabic corpus. The third phase includes calculating the features' vectors of both input query and Quranic topics. Finally, retrieving the most relevant verses by computing the cosine similarity between both topic and query vectors. The performance of the proposed QSST is measured by comparing results against Mushaf Al-Tajweed. Then, precision, recall, and F-score are computed and their percentages were 76.91%, 72.23% 69.28% respectively. In addition, the results are evaluated by three Islamic experts and the average precision was 91.95%. Finally, QSST results are compared with the recent existing tools; QSST outperformed them. (C) 2020 The Authors. Production and hosting by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:934 / 945
页数:12
相关论文
共 50 条
  • [31] SEMANTIC CATEGORY EFFECTS IN VISUAL WORD SEARCH
    KARLIN, MB
    BOWER, GH
    PERCEPTION & PSYCHOPHYSICS, 1976, 19 (05): : 417 - 424
  • [32] Combining Word Embedding and Lexical Database for Semantic Relatedness Measurement
    Lee, Yang-Yin
    Ke, Hao
    Huang, Hen-Hsen
    Chen, Hsin-Hsi
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 73 - 74
  • [33] Predicting Gene Functional Interactions Using Semantic Word Embedding
    Roy, Arpita
    Pan, Shimei
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 509 - 513
  • [34] Embedding memoization to the semantic tree search for deciding QBFs
    GhasemZadeh, M
    Klotz, V
    Meinel, C
    AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 681 - 693
  • [35] Semantic Proximity Search on Heterogeneous Graph by Proximity Embedding
    Liu, Zemin
    Zheng, Vincent W.
    Zhao, Zhou
    Zhu, Fanwei
    Chang, Kevin Chen-Chuan
    Wu, Minghui
    Ying, Jing
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 154 - 160
  • [36] Enhancing Domain Word Embedding via Latent Semantic Imputation
    Yao, Shibo
    Yu, Dantong
    Xiao, Keli
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 557 - 565
  • [37] Modelling the Semantic Change Dynamics using Diachronic Word Embedding
    Boukhaled, Mohamed Amine
    Fagard, Benjamin
    Poibeau, Thierry
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 944 - 951
  • [38] The Semantic Similarity Relation of Entities Discovery: Using Word Embedding
    Ruan, Dong-ru
    Mao, Yu-xin
    Pan, Hong-yan
    Gao, Kai
    2017 9TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC 2017), 2017, : 845 - 850
  • [39] GoNTogle: A Tool for Semantic Annotation and Search
    Giannopoulos, Giorgos
    Bikakis, Nikos
    Dalamagas, Theodore
    Sellis, Timos
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, PT 2, PROCEEDINGS, 2010, 6089 : 376 - +
  • [40] Semantic proximity assessment in Bhojpuri and Maithili: a word embedding perspective
    Yadav, Arun Kumar
    Kumar, Abhishek
    Kumar, Mohit
    Yadav, Divakar
    SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)