NGram Approach for Semantic Similarity on Arabic Short Text

被引:0
|
作者
Al-Mahmoud, Rana Husni [1 ]
Sharieh, Ahmad [2 ]
机构
[1] Appl Sci Private Univ, Fac Informat Technol, Amman, Jordan
[2] Univ Jordan, Comp Sci Dept, King Abdullah II Sch Informat Technol, Amman, Jordan
关键词
-Arabic text; Ngram; semantic sentences similarity; short text; ALMaany; natural language; semantic similarity of words; corpus-based measures; TWEETS;
D O I
10.14569/IJACSA.2022.0131199
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Measuring the semantic similarity between words requires a method that can simulate human thought. The use of computers to quantify and compare semantic similarities has become an important research area in various fields, including artificial intelligence, knowledge management, information re-trieval, and natural language processing. Computational seman-tics require efficient measures for computing concept similarity, which still need to be developed. Several computational measures quantify semantic similarity based on knowledge resources such as the WordNet taxonomy. Several measures based on taxonom-ical parameters have been applied to optimize the expression for content semantics. This paper presents a new similarity measure for quantifying the semantic similarity between concepts, words, sentences, short text, and long text based on NGram features and Synonyms of NGram related to the same domain. The proposed algorithm was tested on 700 tweets, and the semantic similarity values were compared with cosine similarity on the same dataset. The results were analyzed manually by a domain expert who concluded that the values provided by the proposed algorithm were better than the cosine similarity values within the selected domain regarding the semantic similarity between the datasets' short texts.
引用
收藏
页码:857 / 866
页数:10
相关论文
共 50 条
  • [41] Arabic Text Classification based on Semantic Relations
    Hijazi, Musab
    Zeki, Akram
    Ismail, Amelia
    [J]. INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2022, 17 (02): : 937 - 946
  • [42] Context-based Arabic Word Sense Disambiguation using Short Text Similarity Measure
    Bekkali, Mohammed
    Lachkar, Abdelmonaime
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA'18), 2018,
  • [43] Learning short-text semantic similarity with word embeddings and external knowledge sources
    Nguyen, Hien T.
    Duong, Phuc H.
    Cambria, Erik
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 182
  • [44] Short Text Similarity Measurement Based on Coupled Semantic Relation and Strong Classification Features
    Ma, Huifang
    Liu, Wen
    Li, Zhixin
    Lin, Xianghong
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I, 2019, 11439 : 135 - 147
  • [45] Bridging the Gap Between Relevance Matching and Semantic Matching for Short Text Similarity Modeling
    Rao, Jinfeng
    Liu, Linqing
    Tay, Yi
    Yang, Wei
    Shi, Peng
    Lin, Jimmy
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5370 - 5381
  • [46] A Chinese Short Text Semantic Similarity Computation Model Based on Stop Words and TongyiciCilin
    Tang Shancheng
    Bai Yunyue
    Ma Fuyu
    [J]. PROCEEDINGS OF 2017 6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2017), 2017, : 310 - 314
  • [47] Automatic Bangla Text Summarization Using Term Frequency and Semantic Similarity Approach
    Sarkar, Avik
    Hossen, Md Sharif
    [J]. 2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [48] A Distributed Arabic Text Classification Approach Using Latent Semantic Analysis for Big data
    Alazzam, Hadeel
    Alsmady, Abdulsalam
    [J]. PROCEEDINGS OF THE 2017 12TH INTERNATIONAL SCIENTIFIC AND TECHNICAL CONFERENCE ON COMPUTER SCIENCES AND INFORMATION TECHNOLOGIES (CSIT 2017), VOL. 1, 2017, : 58 - 61
  • [49] FArSS: Fast and Efficient Semantic Question Similarity in Arabic
    Alkaoud, Mohamed
    [J]. IEEE Access, 2025, 13 : 10944 - 10953
  • [50] AWSS: An Algorithm for Measuring Arabic Word Semantic Similarity
    Almarsoomi, Faaza A.
    O'Shea, James D.
    Bandar, Zuhair
    Crockett, Keeley
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 504 - 509