TAG RECOMMENDATION FOR SHORT ARABIC TEXT BY USING LATENT SEMANTIC ANALYSIS OF WIKIPEDIA

被引:3
|
作者
AlAgha, Iyad [1 ]
Abu-Samra, Yousef [1 ]
机构
[1] Islamic Univ Gaza, Dept Comp Sci, Gaza, Palestine
关键词
Tag recommendation; Arabic; Short text; Latent semantic analysis; Wikipedia; Apache Spark;
D O I
10.5455/jjcit.71-1575827721
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text tagging has gained a growing attention as a way of associating metadata that supports information retrieval and classification. To resolve the difficulties of manual tagging, tag recommendation has emerged as a solution to assist users in tagging by presenting a list of relevant tags. However, the majority of existing approaches for tag recommendation have focused on domain-specific tagging and tackled long-form text. Open-domain tagging can be challenging due to the lack of comprehensive knowledge and the intensive computations involved. Furthermore, tagging of short text can be problematic due to the difficulty of extracting statistical features. In terms of the language, most efforts have focused on tagging text written in English. The tagging of Arabic text has been challenged by the difficulty of processing the Arabic language and the lack of knowledge sources in Arabic. This work proposes an approach for tag recommendation for short Arabic text. It exploits the Arabic Wikipedia as a background knowledge and uses it to generate tags in response to input short text. Latent semantic analysis is exploited to analyze Wikipedia content and find articles relevant to the input text. Then, tags are selected from the titles and categories of these articles and are ranked according to relevance. The approach was evaluated based on experts' ratings of relevance of 993 tags. Results showed that the approach achieved 84.39% mean average precision and 96.53% mean reciprocal rank. A thorough discussion of results is given to highlight the limitations and the strengths of the approach.
引用
收藏
页码:165 / 181
页数:17
相关论文
共 50 条
  • [1] A Distributed Arabic Text Classification Approach Using Latent Semantic Analysis for Big data
    Alazzam, Hadeel
    Alsmady, Abdulsalam
    [J]. PROCEEDINGS OF THE 2017 12TH INTERNATIONAL SCIENTIFIC AND TECHNICAL CONFERENCE ON COMPUTER SCIENCES AND INFORMATION TECHNOLOGIES (CSIT 2017), VOL. 1, 2017, : 58 - 61
  • [2] Advertising Keywords Recommendation for Short-Text Web Pages Using Wikipedia
    Zhang, Weinan
    Wang, Dingquan
    Xue, Gui-Rong
    Zha, Hongyuan
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 3 (02)
  • [3] Text summarization using Latent Semantic Analysis
    Ozsoy, Makbule Gulcin
    Alpaslan, Ferda Nur
    Cicekli, Ilyas
    [J]. JOURNAL OF INFORMATION SCIENCE, 2011, 37 (04) : 405 - 417
  • [4] Semantic tag recommendation based on associated words exploiting the interwiki links of Wikipedia
    Hong, Hyun-Ki
    Kim, Gun-Woo
    Lee, Dong-Ho
    [J]. JOURNAL OF INFORMATION SCIENCE, 2018, 44 (03) : 298 - 313
  • [5] Latent Semantic Analysis Models on Wikipedia and TASA
    Stefanescu, Dan
    Banjade, Rajendra
    Rus, Vasile
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1417 - 1422
  • [6] Automatic Text Summarization Using Latent Semantic Analysis
    Mashechkin, I. V.
    Petrovskiy, M. I.
    Popov, D. S.
    Tsarev, D. V.
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2011, 37 (06) : 299 - 305
  • [7] Automatic text summarization using latent semantic analysis
    I. V. Mashechkin
    M. I. Petrovskiy
    D. S. Popov
    D. V. Tsarev
    [J]. Programming and Computer Software, 2011, 37 : 299 - 305
  • [8] KANNADA TEXT SUMMARIZATION USING LATENT SEMANTIC ANALYSIS
    Geetha, J. K.
    Deepamala, N.
    [J]. 2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1508 - 1512
  • [9] Semantic Tag Recommendation Using Concept Model
    Li, Chenliang
    Datta, Anwitaman
    Sun, Aixin
    [J]. PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1159 - 1160
  • [10] Implementing an Individualized Recommendation System using Latent Semantic Analysis
    Quoc-Viet Dang
    [J]. PROCEEDINGS OF THE 2018 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND EDUCATION TECHNOLOGY (ICIET 2018), 2015, : 239 - 243