Learning a Better Negative Sampling Policy with Deep Neural Networks for Search

被引：6

作者：

Cohen, Daniel ^{[1
]}

Jordan, Scott M. ^{[2
]}

Croft, W. Bruce ^{[1
]}

机构：

[1] Univ Massachusetts Amherst, Ctr Intelligent Informat Retrieval, Amherst, MA 01003 USA

[2] Univ Massachusetts Amherst, Autonomous Learning Lab, Amherst, MA 01003 USA

来源：

PROCEEDINGS OF THE 2019 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'19) | 2019年

关键词：

D O I：

10.1145/3341981.3344220

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In information retrieval, sampling methods used to select documents for neural models must often deal with large class imbalances during training. This issue necessitates careful selection of negative instances when training neural models to avoid the risk of overfitting. For most work, heuristic sampling approaches, or policies, are created based off of domain experts, such as choosing samples with high BM25 scores or a random process over candidate documents. However, these sampling approaches are done with the test distribution in mind. In this paper, we demonstrate that the method chosen to sample negative documents during training plays a critical role in both the stability of training, as well as overall performance. Furthermore, we establish that using reinforcement learning to optimize a policy over a set of sampling functions can significantly improve performance over standard training practices with respect to IR metrics and is robust to hyperparameters and random seeds.

引用

页码：19 / 26

页数：8

共 50 条

[21] Deep Learning with Random Neural Networks
Gelenbe, Erol
Yin, Yongha
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1633 - 1638
[22] Deep Learning with Random Neural Networks
Gelenbe, Erol
Yin, Yongha
PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 2, 2018, 16 : 450 - 462
[23] Deep learning in spiking neural networks
Tavanaei, Amirhossein
Ghodrati, Masoud
Kheradpisheh, Saeed Reza
Masquelier, Timothee
Maida, Anthony
NEURAL NETWORKS, 2019, 111 : 47 - 63
[24] Deep learning in neural networks: An overview
Schmidhuber, Juergen
NEURAL NETWORKS, 2015, 61 : 85 - 117
[25] Artificial neural networks and deep learning
Geubbelmans, Melvin
Rousseau, Axel-Jan
Burzykowski, Tomasz
Valkenborg, Dirk
AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2024, 165 (02) : 248 - 251
[26] Shortcut learning in deep neural networks
Robert Geirhos
Jörn-Henrik Jacobsen
Claudio Michaelis
Richard Zemel
Wieland Brendel
Matthias Bethge
Felix A. Wichmann
Nature Machine Intelligence, 2020, 2 : 665 - 673
[27] Fast learning in Deep Neural Networks
Chandra, B.
Sharma, Rajesh K.
NEUROCOMPUTING, 2016, 171 : 1205 - 1215
[28] Deep associative learning for neural networks
Liu, Jia
Zhang, Wenhua
Liu, Fang
Xiao, Liang
NEUROCOMPUTING, 2021, 443 (443) : 222 - 234
[29] Collaborative Learning for Deep Neural Networks
Song, Guocong
Chai, Wei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[30] Big learning and deep neural networks
Montavon, Grégoire
Müller, Klaus-Robert
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2012, 7700 LECTURE NO : 419 - 420

← 1 2 3 4 5 →