NATURAL LANGUAGE GENERATION METHOD USING AUTOMATICALLY CONSTRUCTED LEXICAL RESOURCES

被引:0
|
作者
Ito, Naho [1 ]
Hagiwara, Masafumi [1 ]
机构
[1] Keio Univ, Fac Sci & Technol, Kohoku Ku, 3-14-1 Hiyoshi, Yokohama, Kanagawa 2238522, Japan
关键词
Sentence generation; N-gram; Case frame;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a natural language generation method based on automatically constructed lexical resources. Many conventional approaches in sentence generation use manually constructed templates. Therefore, the variety of available sentences depends heavily on the quality and quantity of the templates, and the cost to construct these templates is very high. The proposed sentence generation method uses large-scale case frames and Google N-gram, which both are compiled automatically from Web documents. The proposed method uses words as an input. It generates a sentence from case frames, using Google N-gram as to consider co-occurrence frequency between words. Since we only use lexical resources which are constructed automatically, the proposed method has high coverage compared with the other methods using manually constructed templates. We carried out experiments to examine the quality of generated sentences and obtained satisfactory results.
引用
收藏
页码:397 / 411
页数:15
相关论文
共 50 条
  • [1] Natural Language Generation Using Automatically Constructed Lexical Resources
    Ito, Naho
    Hagiwara, Masafumi
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 980 - 987
  • [2] Natural language database access using semi-automatically constructed translation knowledge
    Kang, IS
    Bae, JHJ
    Lee, JH
    [J]. NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 280 - 289
  • [3] Automatically Creating Multilingual Lexical Resources
    Khang Nhut Lam
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 3077 - 3078
  • [4] Using Empirically Constructed Lexical Resources for Named Entity Recognition
    Jonnalagadda, Siddhartha
    Cohen, Trevor
    Wu, Stephen
    Liu, Hongfang
    Gonzalez, Graciela
    [J]. BIOMEDICAL INFORMATICS INSIGHTS, 2013, 6 : 17 - 27
  • [5] The contribution of lexical resources to natural language processing of CJK languages
    Halpern, Jack
    [J]. Chinese Spoken Language Processing, Proceedings, 2006, 4274 : 768 - 780
  • [6] Corpus-based lexical choice in natural language generation
    Bangalore, S
    Rambow, O
    [J]. 38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 464 - 471
  • [7] Natural language processing using lexical and logical combinators
    Ortiz, Juan Fernandez
    Villadsen, Jorgen
    [J]. LOGIC PROGRAMMING, PROCEEDINGS, 2006, 4079 : 444 - 446
  • [8] A Repository of Data and Evaluation Resources for Natural Language Generation
    Belz, Anja
    Gatt, Albert
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 4027 - 4032
  • [9] An Investigation into the Validity of Some Metrics for Automatically Evaluating Natural Language Generation Systems
    Reiter, Ehud
    Belz, Anja
    [J]. COMPUTATIONAL LINGUISTICS, 2009, 35 (04) : 529 - 558
  • [10] IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation
    Cahyawijaya, Samuel
    Winata, Genta Indra
    Wilie, Bryan
    Vincentio, Karissa
    Li, Xiaohong
    Kuncoro, Adhiguna
    Ruder, Sebastian
    Lim, Zhi Yuan
    Bahar, Syafri
    Khodra, Masayu Leylia
    Purwarianti, Ayu
    Fung, Pascale
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8875 - 8898