Automatic acquisition of qualia structure from corpus data

被引:1
|
作者
Yamada, Ichiro [1 ]
Baldwin, Timothy [2 ]
Sumiyoshi, Hideki [1 ]
Shibata, Masahiro [1 ]
Yagi, Nobuyuki [1 ]
机构
[1] Science and Technical Research Laboratories, NHK, Tokyo, 157-8510, Japan
[2] Department of Computer Science and Software Engineering, University of Melbourne, VIC 3010, Australia
关键词
D O I
10.1093/ietisy/e90-d.10.1534
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method to automatically acquire a given noun's telic and agentive roles from corpus data. These relations form part of the qualia structure assumed in the generative lexicon, where the telic role represents a typical purpose of the entity and the agentive role represents the origin of the entity. Our proposed method employs a supervised machine-learning technique which makes use of template-based contextual features derived from token instances of each noun. The output of our method is a ranked list of verbs for each noun, across the different qualia roles. We also propose a variant of Spearman's rank correlation to evaluate the correlation of two top-N ranked lists. Using this correlation method, we represent the ability of the proposed method to identify qualia structure relative to a conventional template-based method. Copyright © 2007 The Institute of Electronics, Information and Communication Engineers.
引用
收藏
页码:1534 / 1541
相关论文
共 50 条
  • [1] Automatic acquisition of qualia structure from corpus data
    Yamada, Ichiro
    Baldwin, Timothy
    Sumiyoshi, Hideki
    Shibata, Masahiro
    Yagi, Nobuyuki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (10): : 1534 - 1541
  • [2] Automatic acquisition of a Slovak lexicon from a raw corpus
    Sagot, B
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2005, 3658 : 156 - 163
  • [3] Automatic acquisition of basic Katakana lexicon from a given corpus
    Nakazawa, T
    Kawahara, D
    Kurohashi, S
    NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 682 - 693
  • [4] Automatic acquisition of basic Katakana lexicon from a given corpus
    University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
    Lect. Notes Comput. Sci., (682-693):
  • [5] Automatic acquisition of terminological relations from a corpus for query expansion
    Jean-David, Sta
    SIGIR Forum (ACM Special Interest Group on Information Retrieval), 1998, : 371 - 372
  • [6] Automatic acquisition of Chinese-English parallel corpus from the web
    Zhang, Ying
    Wu, Ke
    Gao, Jianfeng
    Vines, Phil
    ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 420 - 431
  • [7] AUTOMATIC ACQUISITION OF DATA FROM LABORATORY INSTRUMENTS
    PEZZI, G
    VALLINI, G
    ZUCKERINDUSTRIE, 1994, 119 (06): : 482 - 484
  • [8] AUTOMATIC ACQUISITION OF DATA FROM TENSIOMETERS WITH MERCURY MANOMETERS
    ATTEIA, O
    DUBOIS, JP
    SOIL SCIENCE SOCIETY OF AMERICA JOURNAL, 1993, 57 (03) : 689 - 690
  • [9] Automatic discovery of telic and agentive roles from corpus data
    Yamada, Ichiro
    Baldwin, Timothy
    PACLIC 18: PROCEEDINGS OF THE 18TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2004, : 115 - 125
  • [10] Automatic Acquisition of Large-scale Academic Bilingual Parallel Corpus from the Web
    Han Yong
    Li Yu
    He Xiaoning
    Yang Muyun
    Lei Guohua
    2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 318 - 321