Construction of a Probabilistic hierarchical structure based on a Japanese corpus and a Japanese thesaurus

被引:0
|
作者
Terai, Asuka [1 ]
Liu, Bin [2 ]
Nakagawa, Masanori [1 ]
机构
[1] Tokyo Inst Technol, Meguro Ku, 2-12-1 Ookayama, Tokyo 152, Japan
[2] Nissay Informat Technol Co Ltd, Tokyo, Japan
基金
日本学术振兴会;
关键词
D O I
10.1007/978-3-540-78159-2_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of this study is to construct a probabilistic hierarchical structure of categories based on a statistical analysis of Japanese corpus data and to verify the validity of the structure by conducting a psychological experiment. At first, the co-occurrence frequencies of adjectives and nouns within modification relations were extracted from a Japanese corpus. Secondly, a probabilistic hierarchical structure was constructed based on the probability, P (category I noun), representing the category membership of the nouns, and utilizing categorization information in a thesaurus and a soft clustering method (Rose's method [1]) with co-occurrence frequencies as initial values. This method makes it possible to identify the constructed hierarchical structure. In order to examine the validity of the constructed hierarchy, a psychological experiment was conducted. The results of the experiment verified the psychological validity of the hierarchical structure.
引用
收藏
页码:132 / +
页数:3
相关论文
共 50 条
  • [11] Corpus-based thesaurus construction for image retrieval in specialist domains
    Ahmad, K
    Tariq, M
    Vrusias, B
    Handy, C
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 502 - 510
  • [12] Japanese semcor: A sense-tagged corpus of Japanese
    Linguistics and Multilingual Studies, Nanyang Technological University, Singapore
    不详
    不详
    GWC Int. WordNet Conf. Proc., (56-63):
  • [13] CONSTRUCTION OF A LARGE-SCALE JAPANESE ASR CORPUS ON TV RECORDINGS
    Ando, Shintaro
    Fujihara, Hiromasa
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6948 - 6952
  • [14] Construction of a large-scale Japanese ASR corpus on TV recordings
    Ando, Shintaro
    Fujihara, Hiromasa
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2021, 2021-June : 6948 - 6952
  • [15] CONTEXTS AND MEANINGS OF JAPANESE SPEECH STYLES: A CASE OF HIERARCHICAL IDENTITY CONSTRUCTION AMONG JAPANESE COLLEGE STUDENTS
    Enyo, Yumiko
    PRAGMATICS, 2015, 25 (03): : 345 - 367
  • [16] Generalizing Hierarchical Structure of Indices for Japanese Legal Documents
    Tho Thi Ngoc Le
    Minh Le Nguyen
    Shimazu, Akira
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 19TH ANNUAL CONFERENCE, KES-2015, 2015, 60 : 103 - 112
  • [17] Hierarchical Structure in Semantic Networks of Japanese Word Associations
    Miyake, Maki
    Joyce, Terry
    Jung, Jaeyoung
    Akama, Hiroyuki
    PACLIC 21: THE 21ST PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, PROCEEDINGS, 2007, : 321 - +
  • [18] Hierarchical communities in the walnut structure of the Japanese production network
    Chakraborty, Abhijit
    Kichikawa, Yuichi
    Iino, Takashi
    Iyetomi, Hiroshi
    Inoue, Hiroyasu
    Fujiwara, Yoshi
    Aoyama, Hideaki
    PLOS ONE, 2018, 13 (08):
  • [19] Language learning system for Japanese adjectives aided by Japanese-English bilingual thesaurus
    Nakabasami, C
    Kondo, K
    Shimada, S
    ARTIFICIAL INTELLIGENCE IN EDUCATION: KNOWLEDGE AND MEDIA IN LEARNING SYSTEMS, 1997, 39 : 641 - 643
  • [20] A Japanese Chess Commentary Corpus
    Mori, Shinsuke
    Richardson, John
    Ushiku, Atsushi
    Sasada, Tetsuro
    Kameko, Hirotaka
    Tsuruoka, Yoshimasa
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1415 - 1420