A RE-RANKER SCHEME FOR INTEGRATING LARGE SCALE NLU MODELS

被引:0
|
作者
Su, Chengwei [1 ]
Gupta, Rahul [1 ]
Ananthakrishnan, Shankar [1 ]
Matsoukas, Spyros [1 ]
机构
[1] Amazon Com, Seattle, WA 98109 USA
关键词
Re-ranking; calibration; multi-task learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large scale Natural Language Understanding (NLU) systems are typically trained on large quantities of data, requiring a fast and scalable training strategy. A typical design for NLU systems consists of domain-level NLU modules (domain classifier, intent classifier and named entity recognizer). Hypotheses (NLU interpretations consisting of various intent+slot combinations) from these domain specific modules are typically aggregated with another downstream component. The re-ranker integrates outputs from domain-level recognizers, returning a scored list of cross domain hypotheses. An ideal re-ranker will exhibit the following two properties: (a) it should prefer the most relevant hypothesis for the given input as the top hypothesis and, (b) the interpretation scores corresponding to each hypothesis produced by the re-ranker should be calibrated. Calibration allows the final NLU interpretation score to be comparable across domains. We propose a novel re-ranker strategy that addresses these aspects, while also maintaining domain specific modularity. We design optimization loss functions for such a modularized re-ranker and present results on decreasing the top hypothesis error rate as well as maintaining the model calibration. We also experiment with an extension involving training the domain specific re-rankers on datasets curated independently by each domain to allow further asynchronization.
引用
收藏
页码:670 / 676
页数:7
相关论文
共 50 条
  • [1] Long Document Re-ranking with Modular Re-ranker
    Gao, Luyu
    Callan, Jamie
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2371 - 2376
  • [2] Semantic Relatedness Based Re-ranker for Text Spotting
    Sabir, Ahmed
    Moreno-Noguer, Francesc
    Padro, Lluis
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3451 - 3457
  • [3] A Syntax-Aware Re-ranker for Microblog Retrieval
    Severyn, Aliaksei
    Moschitti, Alessandro
    Tsagkias, Manos
    Berendsen, Richard
    de Rijke, Maarten
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1067 - 1070
  • [4] A swarm-inspired re-ranker system for statistical machine translation
    Farzi, Saeed
    Faili, Heshaam
    COMPUTER SPEECH AND LANGUAGE, 2015, 29 (01): : 45 - 62
  • [5] AN EFFICIENT DP-SGD MECHANISM FOR LARGE SCALE NLU MODELS
    Dupuy, Christophe
    Arava, Radhika
    Gupta, Rahul
    Rumshisky, Anna
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4118 - 4122
  • [6] Grounded Dialogue Generation with Cross-encoding Re-ranker, Grounding Span Prediction, and Passage Dropout
    Li, Kun
    Zhang, Tianhua
    Tang, Liping
    Li, Junan
    Lu, Hongyuan
    Wu, Xixin
    Meng, Helen
    PROCEEDINGS OF THE SECOND DIALDOC WORKSHOP ON DOCUMENT-GROUNDED DIALOGUE AND CONVERSATIONAL QUESTION ANSWERING (DIALDOC 2022), 2022, : 123 - 129
  • [7] Retrieval for Extremely LongQueries and Documents with RPRS: A Highly Efficient and Effective Transformer-based Re-Ranker
    Askari, Arian
    Verberne, Suzan
    Abolghasemi, Amin
    Kraaij, Wessel
    Pasi, Gabriella
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (05)
  • [8] MergeRUCB: A Method for Large-Scale Online Ranker Evaluation
    Zoghi, Masrour
    Whiteson, Shimon
    de Rijke, Maarten
    WSDM'15: PROCEEDINGS OF THE EIGHTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2015, : 17 - 26
  • [9] MergeDTS: A Method for Effective Large-Scale Online Ranker Evaluation
    Li, Chang
    Markov, Ilya
    De Rijke, Maarten
    Zoghi, Masrour
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2020, 38 (04)
  • [10] A splitting scheme for large-scale atmosphere dynamics models
    Bourchtein, Andrei
    Bourchtein, Ludmila
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2008, PT 2, PROCEEDINGS, 2008, 5073 : 627 - 640