Conceptual Search for Arabic Web Content

被引:5
|
作者
Al-Zoghby, Aya M. [1 ]
Shaalan, Khaled [2 ]
机构
[1] Mansoura Univ, Fac Comp & Informat Syst, Mansoura, Egypt
[2] British Univ, Dubai, U Arab Emirates
关键词
Semantic Web (SW); Arabic Language; Arabic web content; Semantic Search; Vector Space Model (VSM); Universal Word Net (UWN); Wikipedia; Concept indexing;
D O I
10.1007/978-3-319-18117-2_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main reason of adopting Semantic Web technology in information retrieval is to improve the retrieval performance. A semantic search-based system is characterized by locating web contents that are semantically related to the query's concepts rather than relying on the exact matching with keywords in queries. There is a growing interest in Arabic web content worldwide due to its importance for culture, political aspect, strategic location, and economics. Arabic is linguistically rich across all levels which makes the effective search of Arabic text a challenge. In the literature, researches that address searching the Arabic web content using semantic web technology are still insufficient compared to Arabic's actual importance as a language. In this research, we propose an Arabic semantic search approach that is applied on Arabic web content. This approach is based on the Vector Space Model (VSM), which has proved its success and many researches have been focused on improving its traditional version. Our approach uses the Universal WordNet to build a rich concept-space index instead of the traditional term-space index. This index is used for enabling a Semantic VSM capabilities. Moreover, we introduced a new incidence measurement to calculate the semantic significance degree of the concept in a document which fits with our model rather than the traditional term frequency. Furthermore, for the purpose of determining the semantic similarity of two vectors, we introduced a new formula for calculating the semantic weight of the concept. Because documents are indexed by their topics and classified semantically, we were able to search Arabic documents effectively. The experimental results in terms of Precision, Recall and F-measure have showed improvement in performance from 77%, 56%, and 63% to 71%, 96%, and 81%, respectively.
引用
收藏
页码:405 / 416
页数:12
相关论文
共 50 条
  • [1] Quality Assessment of Arabic Web Content: The case of the Arabic Wikipedia
    Yahya, Adnan
    Salhi, Ali
    [J]. 2014 10TH INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY (IIT), 2014, : 36 - 41
  • [2] Applying authorship analysis to arabic web content
    Abbasi, A
    Chen, HC
    [J]. INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2005, 3495 : 183 - 197
  • [3] Web design for dyslexics: Accessibility of Arabic content
    Al-Wabil, Areej
    Zaphiris, Panayiotis
    Wilson, Stephanie
    [J]. COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PROCEEDINGS, 2006, 4061 : 817 - 822
  • [4] Web intelligence: Conceptual search engine and navigation
    Nikravesh, M
    [J]. INDIN 2003: IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, PROCEEDINGS, 2003, : 390 - 395
  • [5] Conceptual content management for enterprise web services
    Bossung, S
    Sehring, HW
    Schmidt, JW
    [J]. PERSPECTIVES IN CONCEPTUAL MODELING, 2005, 3770 : 343 - 353
  • [6] Conceptual classification to improve a Web site content
    Rios, Sebastian A.
    Velasquez, Juan D.
    Yasuda, Hiroshi
    Aoki, Terumasa
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, 2006, 4224 : 869 - 877
  • [7] Search Engine for Amharic Web Content
    Redwan, Hassen
    Mindaye, Tessema
    Atnafu, Solomon
    [J]. 2009 AFRICON, VOLS 1 AND 2, 2009, : 630 - 635
  • [8] On the criteria of content web search enablement
    Radaideh, Moh'd A.
    [J]. JOURNAL OF ENTERPRISE INFORMATION MANAGEMENT, 2005, 18 (06) : 709 - 720
  • [9] CLUSTERING WEB SEARCH RESULTS USING CONCEPTUAL GROUPING
    Li, Hong-Mei
    Sun, Chen-Xia
    Wang, Ke-Jian
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 1499 - 1503
  • [10] Content-based analysis to detect Arabic web spam
    Al-Kabi, Mohammed
    Wahsheh, Heider
    Alsmadi, Izzat
    Al-Shawakfa, Emad
    Wahbeh, Abdullah
    Al-Hmoud, Ahmed
    [J]. JOURNAL OF INFORMATION SCIENCE, 2012, 38 (03) : 284 - 296