Corpus-based semantic role approach in information retrieval

被引:24
|
作者
Moreda, Palorna [1 ]
Navarro, Borja [1 ]
Palomar, Manuel [1 ]
机构
[1] Univ Alicante, Dept Software & Comp Syst, Nat Language Proc & Informat Syst Grp, E-03080 Alicante, Spain
关键词
semantic roles; information retrieval systems; corpus-based methods; feature selection procedure; word sense disambiguation; shallow parsing; PoS tag; lemma;
D O I
10.1016/j.datak.2006.06.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a method to determine the semantic role for the constituents of a sentence is presented. This method, named SemRol, is a corpus-based approach that uses two different statistical models, conditional Maximum Entropy (ME) Probability Models and the TiMBL program, a Memory-based Learning. It consists of three phases that make use of features using words, lemmas, PoS tags and shallow parsing information. Our method introduces a new phase in the Semantic Role Labeling task which has usually been approached as a two phase procedure consisting of recognition and labeling arguments. From our point of view, firstly the sense of the verbs in the sentences must be disambiguated. That is why depending on the sense of the verb a different set of roles must be considered. Regarding the labeling arguments phase, a tuning procedure is presented. As a result of this procedure one of the best sets of features for the labeling arguments task is detected. With this set, that is different for TiMBL and ME, precisions of 76.71% for TiMBL or 70.55% for ME, are obtained. Furthermore, the semantic role information provided by our SemRol method could be used as an extension of Information Retrieval or Question Answering systems. We propose using this semantic information as an extension of an Information Retrieval system in order to reduce the number of documents or passages retrieved by the system. (C) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:467 / 483
页数:17
相关论文
共 50 条
  • [1] An axiomatic approach to corpus-based cross-language information retrieval
    Rahimi, Razieh
    Montazeralghaem, Ali
    Shakery, Azadeh
    [J]. INFORMATION RETRIEVAL JOURNAL, 2020, 23 (03): : 191 - 215
  • [2] An axiomatic approach to corpus-based cross-language information retrieval
    Razieh Rahimi
    Ali Montazeralghaem
    Azadeh Shakery
    [J]. Information Retrieval Journal, 2020, 23 : 191 - 215
  • [3] A Corpus-based Approach to the Semantic Prosody of DOG
    周美芝
    [J]. 海外英语, 2012, (04) : 273 - 274
  • [4] Using Corpus-Based Approaches in a System for Multilingual Information Retrieval
    Martin Braschler
    Peter Schäuble
    [J]. Information Retrieval, 2000, 3 : 273 - 284
  • [5] Using corpus-based approaches in a system for multilingual information retrieval
    Braschler, M
    Schäuble, P
    [J]. INFORMATION RETRIEVAL, 2000, 3 (03): : 273 - 284
  • [6] Behavioral profiles A corpus-based approach to cognitive semantic analysis
    Gries, Stefan Th.
    Divjak, Dagmar
    [J]. NEW DIRECTIONS IN COGNITIVE LINGUISTICS, 2009, 24 : 57 - 75
  • [7] Corpus-based cross-language information retrieval in retrieval of highly relevant documents
    Talvensaari, Tuomas
    Juhola, Martti
    Laurikkala, Jorma
    Jarvelin, Kalervo
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (03): : 322 - 334
  • [8] Complementing WordNet with Roget's and corpus-based thesauri for information retrieval
    Mandala, R
    Tokunaga, T
    Tanaka, H
    [J]. NINTH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS, 1999, : 94 - 101
  • [9] A Corpus-based Semantic Study of Possibly
    Wu, Guoliang
    Feng, Chuncan
    [J]. PROCEEDINGS OF 2011 INTERNATIONAL SYMPOSIUM ON COGNITIVE LINGUISTICS AND ENGLISH LEARNING, 2012, : 190 - 197
  • [10] Corpus-based learning of analogies and semantic relations
    Turney, PD
    Littman, ML
    [J]. MACHINE LEARNING, 2005, 60 (1-3) : 251 - 278