Information Retrieval from Unstructured Arabic Legal Data

被引:1
|
作者
Mezghanni, Imen Bouaziz [1 ]
Gargouri, Faiez [1 ]
机构
[1] ISIM Sfax, MIRACL Lab, Sfax, Tunisia
关键词
Information retrieval; Arabic information retrieval; Unstructured data; Structured data;
D O I
10.1007/978-3-319-42911-3_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the steady increase of published and stored information in the form of Arabic unstructured texts, current Information Retrieval (IR) systems must be able to suit the nature and requirements of this language for an accurate and efficient search. This paper sheds light on the challenges in Arabic IR (AIR) and proposes an approach for enhancing the process of AIR based on transforming these texts into structured documents in XML format through a document ontology as well as a set of linguistic grammars. The IR system hence is done on the XML documents. The aim of such system is to incorporate the knowledge on the document structure and on specific content elements in computing the relevance of an information element. A query expansion module mainly based on domain ontology as well as user profile is proposed for the enhancement of the search results.
引用
收藏
页码:44 / 54
页数:11
相关论文
共 50 条
  • [1] Challenges in Information Retrieval from Unstructured Arabic Data
    Khalil, Hussein
    Osman, Taha
    [J]. 2014 UKSIM-AMSS 16TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2014, : 456 - 461
  • [2] Arabic information retrieval
    Abu El-Khair, Ibrahim
    [J]. ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2007, 41 : 505 - 533
  • [3] Arabic Information Retrieval
    Darwish, Kareem
    Magdy, Walid
    [J]. FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2013, 7 (04): : I - 342
  • [4] The Use of Arabic WordNet in Arabic Information Retrieval
    Abbache, Ahmed
    Barigou, Fatiha
    Belkredim, Fatma Zohra
    Belalem, Ghalem
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2014, 4 (03) : 54 - 65
  • [5] Information Extraction from Unstructured Recipe Data
    Silva, Nuno
    Ribeiro, David
    Ferreira, Liliana
    [J]. PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND TECHNOLOGY APPLICATIONS (ICCTA 2019), 2019, : 165 - 168
  • [6] Sampling Process Information from Unstructured Data
    Popp, J.
    Ortloff, D.
    Schmidt, T.
    Hahn, K.
    Mielke, M.
    Brueck, R.
    [J]. 2011 22ND ANNUAL IEEE/SEMI ADVANCED SEMICONDUCTOR MANUFACTURING CONFERENCE (ASMC), 2011,
  • [7] CONDOR, AN INTEGRATED DATA-BASE INFORMATION-RETRIEVAL SYSTEM FOR STRUCTURED AND UNSTRUCTURED DATA
    FISCHER, HG
    [J]. SIEMENS FORSCHUNGS-UND ENTWICKLUNGSBERICHTE-SIEMENS RESEARCH AND DEVELOPMENT REPORTS, 1981, 10 (03): : 179 - 187
  • [8] Arabic Information Retrieval: Stemming or Lemmatization?
    Zeroual, Imad
    Lakhouaja, Abdelhak
    [J]. 2017 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2017,
  • [9] Semantic Boolean Arabic Information Retrieval
    Elabd, Emad
    Alshari, Eissa
    Abdulkader, Hatem
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (03) : 311 - 316
  • [10] Arabic Studies' Progress in Information Retrieval
    Hanandeh, Essam
    Khafajah, Hayel
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (02) : 234 - 238