A RE-RANKER SCHEME FOR INTEGRATING LARGE SCALE NLU MODELS

被引:0
|
作者
Su, Chengwei [1 ]
Gupta, Rahul [1 ]
Ananthakrishnan, Shankar [1 ]
Matsoukas, Spyros [1 ]
机构
[1] Amazon Com, Seattle, WA 98109 USA
关键词
Re-ranking; calibration; multi-task learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large scale Natural Language Understanding (NLU) systems are typically trained on large quantities of data, requiring a fast and scalable training strategy. A typical design for NLU systems consists of domain-level NLU modules (domain classifier, intent classifier and named entity recognizer). Hypotheses (NLU interpretations consisting of various intent+slot combinations) from these domain specific modules are typically aggregated with another downstream component. The re-ranker integrates outputs from domain-level recognizers, returning a scored list of cross domain hypotheses. An ideal re-ranker will exhibit the following two properties: (a) it should prefer the most relevant hypothesis for the given input as the top hypothesis and, (b) the interpretation scores corresponding to each hypothesis produced by the re-ranker should be calibrated. Calibration allows the final NLU interpretation score to be comparable across domains. We propose a novel re-ranker strategy that addresses these aspects, while also maintaining domain specific modularity. We design optimization loss functions for such a modularized re-ranker and present results on decreasing the top hypothesis error rate as well as maintaining the model calibration. We also experiment with an extension involving training the domain specific re-rankers on datasets curated independently by each domain to allow further asynchronization.
引用
收藏
页码:670 / 676
页数:7
相关论文
共 50 条
  • [41] Integrating the functional encryption and proxy re-cryptography to secure DRM scheme
    Abdalla H.
    Hu X.
    Wahaballa A.
    Abdalla A.
    Ramadan M.
    Zhiguang Q.
    Abdalla, Hisham (hisham_awaw@hotmail.com), 1600, Femto Technique Co., Ltd. (19): : 27 - 38
  • [42] BiGG Models: A platform for integrating, standardizing and sharing genome-scale models
    King, Zachary A.
    Lu, Justin
    Draeger, Andreas
    Miller, Philip
    Federowicz, Stephen
    Lerman, Joshua A.
    Ebrahim, Ali
    Palsson, Bernhard O.
    Lewis, Nathan E.
    NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) : D515 - D522
  • [43] Delayed Difference Scheme for Large Scale Scientific Simulations
    Mudigere, Dheevatsa
    Sherlekar, Sunil D.
    Ansumali, Santosh
    PHYSICAL REVIEW LETTERS, 2014, 113 (21)
  • [44] Localization Scheme for Large Scale Wireless Sensor Networks
    Tinh, Pham Doan
    Noguchi, Taku
    Kawai, Makoto
    ISSNIP 2008: PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSORS, SENSOR NETWORKS, AND INFORMATION PROCESSING, 2008, : 25 - 30
  • [45] Multicast scheme for large scale input queued switch
    Kawarai, K
    Matsuoka, N
    Tomonaga, H
    Kato, T
    Hakata, A
    PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 316 - 319
  • [46] A scalable key agreement scheme for large scale networks
    Zhou, Yun
    Fang, Yuguang
    PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, 2006, : 631 - 636
  • [47] A scheme for efficient construction of large scale cluster state
    Diao, Da-Sheng
    Zhang, Jin-Juan
    Zhang, Yong-Sheng
    OPTIK, 2017, 146 : 33 - 37
  • [48] A couple scheme for a large-scale fluid simulation
    Wu, Xiaolong
    Wu, Enhua
    Zhang, Hui
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2011, 23 (06): : 1028 - 1033
  • [49] Large-Scale Taxonomy Mapping for Restructuring and Integrating Wikipedia
    Ponzetto, Simone Paolo
    Navigli, Roberto
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 2083 - 2088
  • [50] Integrating Large-Scale Photovoltaic Power Plants into the Grid
    Jansson, Peter Mark
    Michelfelder, Richard A.
    Udo, Victor E.
    Sheehan, Gary
    Hetznecker, Sarah
    Freeman, Michael
    2008 IEEE ENERGY 2030 CONFERENCE, 2008, : 120 - +