Novel Approach towards Arabic Question Similarity Detection

被引:0
|
作者
Daoud, Mohammad [1 ]
机构
[1] Amer Univ Madaba, CS Dept, Fac IT, Madaba, Jordan
来源
2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS) | 2019年
关键词
text similarity; question analysis; question similarity; semantic similarity; data science; Natural Language Processing;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we are addressing the automatic detection of Arabic question similarity, which is an essential issue in a variety of NLP/NLU applications such as question answering systems, virtual assistants, chatbots.etc. We are proposing and experimenting a rule-based approach that relies on lexical and semantic similarity between questions with the utilization of supervised learning algorithms. Our approach categorizes questions semantically according to their type and scope; this categorization is based on hypothetical rules that have been validated empirically, for example, a Timex Factoid question (a question asking about time) is less likely similar to an Enamex Factoid question (a question asking about a named entity). This article details the procedures of question pairs preprocessing, lexical analysis, feature extraction and selection and most importantly the similarity measures. According to the experiment we have conducted, our approach achieved promising precision and accuracy based on a test data of 1450 question pairs.
引用
收藏
页码:158 / 163
页数:6
相关论文
共 50 条
  • [1] A Text Semantic Similarity Approach for Arabic Paraphrase Detection
    Mahmoud, Adnen
    Zrigui, Ahmed
    Zrigui, Mounir
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II, 2018, 10762 : 338 - 349
  • [2] FArSS: Fast and Efficient Semantic Question Similarity in Arabic
    Alkaoud, Mohamed
    IEEE Access, 2025, 13 : 10944 - 10953
  • [3] FArSS: Fast and Efficient Semantic Question Similarity in Arabic
    Alkaoud, Mohamed
    IEEE ACCESS, 2025, 13 : 10944 - 10953
  • [4] FArSS: Fast and Efficient Semantic Question Similarity in Arabic
    Alkaoud, Mohamed
    IEEE ACCESS, 2025, 13 : 10944 - 10953
  • [5] Towards Building an Arabic Plagiarism Detection System: Plagiarism Detection in Arabic
    Khan, Imtiaz Hussain
    Siddiqui, Muazzam Ahmed
    Jambi, Kamal M.
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2019, 9 (03) : 12 - 22
  • [6] Arabic Semantic Similarity Approach for Farmers' Complaints
    Farouk, Rehab Ahmed
    Khafagy, Mohammed H.
    Ali, Mostafa
    Munir, Kamran
    Badry, Rasha M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (10) : 348 - 358
  • [7] Learning English and Arabic question similarity with Siamese Neural Networks in community question answering services
    Othman, Nouha
    Faiz, Rim
    Smaili, Kamel
    DATA & KNOWLEDGE ENGINEERING, 2022, 138
  • [8] Question similarity calculating method towards medical question answering system
    Wan, Fucheng
    Zhang, Dongjiao
    Zhang, Lei
    Zhu, Ao
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 278 - 278
  • [9] Novel Approach to Phishing Detection Using ML and Visual Similarity
    Sanghavi, Preet
    Kunchapu, Achyuth
    Kulkarni, Apeksha
    Solani, Devansh
    Anson, A.
    MACHINE LEARNING AND AUTONOMOUS SYSTEMS, 2022, 269 : 117 - 131
  • [10] Towards a Passages Extraction Method for Arabic Question Answering Systems
    Lahbari, Imane
    Alami, Hamza
    Zidani, Khalid Alaoui
    Ouatik, Said El Alaoui
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2019): VOL 1 - ADVANCED INTELLIGENT SYSTEMS FOR EDUCATION AND INTELLIGENT LEARNING SYSTEM, 2020, 1102 : 230 - 237