A RE-RANKER SCHEME FOR INTEGRATING LARGE SCALE NLU MODELS

被引：0

作者：

Su, Chengwei ^{[1
]}

Gupta, Rahul ^{[1
]}

Ananthakrishnan, Shankar ^{[1
]}

Matsoukas, Spyros ^{[1
]}

机构：

[1] Amazon Com, Seattle, WA 98109 USA

来源：

2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018) | 2018年

关键词：

Re-ranking; calibration; multi-task learning;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Large scale Natural Language Understanding (NLU) systems are typically trained on large quantities of data, requiring a fast and scalable training strategy. A typical design for NLU systems consists of domain-level NLU modules (domain classifier, intent classifier and named entity recognizer). Hypotheses (NLU interpretations consisting of various intent+slot combinations) from these domain specific modules are typically aggregated with another downstream component. The re-ranker integrates outputs from domain-level recognizers, returning a scored list of cross domain hypotheses. An ideal re-ranker will exhibit the following two properties: (a) it should prefer the most relevant hypothesis for the given input as the top hypothesis and, (b) the interpretation scores corresponding to each hypothesis produced by the re-ranker should be calibrated. Calibration allows the final NLU interpretation score to be comparable across domains. We propose a novel re-ranker strategy that addresses these aspects, while also maintaining domain specific modularity. We design optimization loss functions for such a modularized re-ranker and present results on decreasing the top hypothesis error rate as well as maintaining the model calibration. We also experiment with an extension involving training the domain specific re-rankers on datasets curated independently by each domain to allow further asynchronization.

引用

页码：670 / 676

页数：7

共 50 条

[21] Integrating knowledge and omics to decipher mechanisms via large-scale models of signaling networks
Garrido-Rodriguez, Martin
Zirngibl, Katharina
Ivanova, Olga
Lobentanzer, Sebastian
Saez-Rodriguez, Julio
MOLECULAR SYSTEMS BIOLOGY, 2022, 18 (07)
[22] Semantic segmentation of large-scale point clouds by integrating attention mechanisms and transformer models
Yuan, Tiebiao
Yu, Yangyang
Wang, Xiaolong
IMAGE AND VISION COMPUTING, 2024, 146
[23] REQUIEM FOR LARGE SCALE MODELS
HOUSE, PW
JOURNAL OF THE AMERICAN INSTITUTE OF PLANNERS, 1974, 40 (01): : 52 - 53
[24] Diagnosis scheme for a large-scale system
Lee, Won Y.
Alexander, Suraj M.
Journal of Intelligent Manufacturing, 1993, 4 (05)
[25] A DIAGNOSIS SCHEME FOR A LARGE-SCALE SYSTEM
LEE, WY
ALEXANDER, SM
JOURNAL OF INTELLIGENT MANUFACTURING, 1993, 4 (05) : 341 - 354
[26] Integrating large-scale genotype and phenotype data
Hernandez-Boussard, Tina
Woon, Mark
Klein, Teri E.
Altman, Russ B.
OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2006, 10 (04) : 545 - 554
[27] Integrating LIMS into a large-scale manufacturing environment
Thurston, Colin G.
American Laboratory, 2004, 36 (06): : 16 - 22
[28] Integrating LIMS into a large-scale manufacturing environment
Thurston, CG
AMERICAN LABORATORY, 2004, 36 (06) : 16 - +
[29] A mass-flux cumulus parameterization scheme for large-scale models: description and test with observations
Tongwen Wu
Climate Dynamics, 2012, 38 : 725 - 744
[30] STUDY OF THE CONVERGENCE OF A DRY CONVECTION ADAPTATION SCHEME IN MODELS OF LARGE-SCALE ATMOSPHERIC PROCESSES.
Cholakh, I.V.
Soviet meteorology and hydrology, 1981, (10): : 85 - 88

← 1 2 3 4 5 →