Dependency parsing of Japanese monologue using clause boundaries

被引:0
|
作者
Tomohiro Ohno
Shigeki Matsubara
Hideki Kashioka
Takehiko Maruyama
Hideki Tanaka
Yasuyoshi Inagaki
机构
[1] Nagoya University,Department of Information Engineering, Graduate School of Information Science
[2] Nagoya University,Information Technology Center
[3] ATR Spoken Language Communication Research Laboratories,Faculty of Information Science and Technology
[4] The National Institute for Japanese Language,undefined
[5] NHK Science & Technical Research Laboratories,undefined
[6] Aichi Prefectural University,undefined
来源
关键词
Dependency structure; Parsing accuracy; Parsing time; Sentence segmentation; Speech corpus; Speech understanding; Spoken language; Stochastic parsing; Syntactically annotated corpus;
D O I
暂无
中图分类号
学科分类号
摘要
Spoken monologues feature greater sentence length and structural complexity than spoken dialogues. To achieve high-parsing performance for spoken monologues, simplifying the structure by dividing a sentence into suitable language units could prove effective. This paper proposes a method for dependency parsing of Japanese spoken monologues based on sentence segmentation. In this method, dependency parsing is executed in two stages: at the clause level and the sentence level. First, dependencies within a clause are identified by dividing a sentence into clauses and executing stochastic dependency parsing for each clause. Next, dependencies across clause boundaries are identified stochastically, and the dependency structure of the entire sentence is thus completed. An experiment using a spoken monologue corpus shows the effectiveness of this method for efficient dependency parsing of Japanese monologue sentences.
引用
收藏
页码:263 / 279
页数:16
相关论文
共 50 条
  • [1] Dependency parsing of Japanese monologue using clause boundaries
    Ohno, Tomohiro
    Matsubara, Shigeki
    Kashioka, Hideki
    Maruyama, Takehiko
    Tanaka, Hideki
    Inagaki, Yasuyoshi
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2006, 40 (3-4) : 263 - 279
  • [2] Dependency Parsing of Japanese Spoken Monologue Based on Clause Boundaries
    Ohno, Tomohiro
    Matsubara, Shigeki
    Kashioka, Hideki
    Maruyama, Takehiko
    Inagaki, Yasuyoshi
    [J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 169 - 176
  • [3] Dependency Parsing of Japanese Spoken Monologue Based on Clause-Starts Detection
    Ohno, Tomohiro
    Matsubara, Shigeki
    Kashioka, Hideki
    Inagaki, Yasuyoshi
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2454 - +
  • [4] Annotated Clause Boundaries' Influence on Parsing Results
    Sarg, Dage
    Muischnek, Kadri
    Muurisep, Kaili
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 171 - 179
  • [5] Relative Clause Attachment: Nondeterminism in Japanese Parsing
    Yuki Kamide
    Don C. Mitchell
    [J]. Journal of Psycholinguistic Research, 1997, 26 : 247 - 254
  • [6] Relative clause attachment: Nondeterminism in Japanese parsing
    Kamide, Y
    Mitchell, DC
    [J]. JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 1997, 26 (02) : 247 - 254
  • [7] Intraclausal Coordination and Clause Detection as a Preprocessing Step to Dependency Parsing
    Marincic, Domen
    Gams, Matjaz
    Sef, Tomaz
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2009, 5729 : 147 - 153
  • [8] Extracting Experiences Using Dependency Parsing on Japanese E-commerce Websites
    Hagiwara, Kazuki
    Ono, Kazuki
    Hatano, Kenji
    [J]. 2014 IIAI 3RD INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2014), 2014, : 813 - 818
  • [9] Robust dependency parsing of spontaneous Japanese spoken language
    Ohno, T
    Matsubara, S
    Kawaguchi, N
    Inagaki, Y
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 545 - 552
  • [10] Using Smaller Constituents Rather Than Sentences in Active Learning for Japanese Dependency Parsing
    Sassano, Manabu
    Kurohashi, Sadao
    [J]. ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 356 - 365