NGram Approach for Semantic Similarity on Arabic Short Text

被引:0
|
作者
Al-Mahmoud, Rana Husni [1 ]
Sharieh, Ahmad [2 ]
机构
[1] Appl Sci Private Univ, Fac Informat Technol, Amman, Jordan
[2] Univ Jordan, Comp Sci Dept, King Abdullah II Sch Informat Technol, Amman, Jordan
关键词
-Arabic text; Ngram; semantic sentences similarity; short text; ALMaany; natural language; semantic similarity of words; corpus-based measures; TWEETS;
D O I
10.14569/IJACSA.2022.0131199
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Measuring the semantic similarity between words requires a method that can simulate human thought. The use of computers to quantify and compare semantic similarities has become an important research area in various fields, including artificial intelligence, knowledge management, information re-trieval, and natural language processing. Computational seman-tics require efficient measures for computing concept similarity, which still need to be developed. Several computational measures quantify semantic similarity based on knowledge resources such as the WordNet taxonomy. Several measures based on taxonom-ical parameters have been applied to optimize the expression for content semantics. This paper presents a new similarity measure for quantifying the semantic similarity between concepts, words, sentences, short text, and long text based on NGram features and Synonyms of NGram related to the same domain. The proposed algorithm was tested on 700 tweets, and the semantic similarity values were compared with cosine similarity on the same dataset. The results were analyzed manually by a domain expert who concluded that the values provided by the proposed algorithm were better than the cosine similarity values within the selected domain regarding the semantic similarity between the datasets' short texts.
引用
收藏
页码:857 / 866
页数:10
相关论文
共 50 条
  • [1] A Text Semantic Similarity Approach for Arabic Paraphrase Detection
    Mahmoud, Adnen
    Zrigui, Ahmed
    Zrigui, Mounir
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II, 2018, 10762 : 338 - 349
  • [2] Short Text Semantic Similarity Measurement Approach Based on Semantic Network
    Hameed, Naamah Hussien
    Alimi, Adel M.
    Sadiq, Ahmed T.
    [J]. BAGHDAD SCIENCE JOURNAL, 2022, 19 (06) : 1581 - 1591
  • [3] Benchmarking short text semantic similarity
    O'Shea, James
    Bandar, Zuhair
    Crockett, Keeley
    McLean, David
    [J]. International Journal of Intelligent Information and Database Systems, 2010, 4 (02) : 103 - 120
  • [4] A New Alignment Word-Space Approach for Measuring Semantic Similarity for Arabic Text
    Ismail, Shimaa
    Shishtawy, Tarek E. L.
    Alsammak, Abdelwahab Kamel
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2022, 18 (01)
  • [5] Arabic Semantic Similarity Approach for Farmers' Complaints
    Farouk, Rehab Ahmed
    Khafagy, Mohammed H.
    Ali, Mostafa
    Munir, Kamran
    Badry, Rasha M.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (10) : 348 - 358
  • [6] An Approach to Semantic Text Similarity Computing
    Akermi, Imen
    Faiz, Rim
    [J]. MODERN TRENDS AND TECHNIQUES IN COMPUTER SCIENCE (CSOC 2014), 2014, 285 : 383 - 393
  • [7] A Fast and Efficient Semantic Short Text Similarity Metric
    Croft, David
    Coupland, Simon
    Shell, Jethro
    Brown, Stephen
    [J]. 2013 13TH UK WORKSHOP ON COMPUTATIONAL INTELLIGENCE (UKCI), 2013, : 221 - 227
  • [8] An algorithm for semantic similarity of short text based on WordNet
    Zhai, Yan-Dong
    Wang, Kang-Ping
    Zhang, Dong-Na
    Hunag, Lan
    Zhou, Chun-Guang
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2012, 40 (03): : 617 - 620
  • [9] Short Text Similarity Calculation Using Semantic Information
    Pu, Haoyu
    Fei, Gaolei
    Zhao, Hailin
    Hu, Guangmin
    Jiao, Chengbo
    Xu, Zhoujun
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM), 2017, : 144 - 150
  • [10] Semantic similarity based approach for reducing Arabic texts dimensionality
    Awajan, Arafat
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (02) : 191 - 201