Top-K Ranking Deep Contextual Bandits for Information Selection Systems

被引：2

作者：

Freeman, Jade ^{[1
]}

Rawson, Michael ^{[2
]}

机构：

[1] DEVCOM Army Res Lab, 2800 Powder Mill Rd, Adelphi, MD 20883 USA

[2] Univ Maryland, Dept Math, College Pk, MD 20742 USA

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC) | 2021年

关键词：

D O I：

10.1109/SMC52423.2021.9658912

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In today's technology environment, information is abundant, dynamic, and heterogeneous in nature. Automated filtering and prioritization of information is based on the distinction between whether the information adds substantial value toward one's goal or not. Contextual multi-armed bandit has been widely used for learning to filter contents and prioritize according to user interest or relevance. Learn-to-Rank technique optimizes the relevance ranking on items, allowing the contents to be selected accordingly. We propose a novel approach to top-K rankings under the contextual multi-armed bandit framework. We model the stochastic reward function with a neural network to allow non-linear approximation to learn the relationship between rewards and contexts. We demonstrate the approach and evaluate the the performance of learning from the experiments using real world data sets in simulated scenarios. Empirical results show that this approach performs well under the complexity of a reward structure and high dimensional contextual features.

引用

页码：2209 / 2214

页数：6

共 50 条

[31] A Top-K Retrieval algorithm based on a decomposition of ranking functions
Madrid, Nicolas
Rusnok, Pavel
[J]. INFORMATION SCIENCES, 2019, 474 : 136 - 153
[32] Ranking uncertain sky: The probabilistic top-k skyline operator
Zhang, Ying
Zhang, Wenjie
Lin, Xuemin
Jiang, Bin
Pei, Jian
[J]. INFORMATION SYSTEMS, 2011, 36 (05) : 898 - 915
[33] Oneshot Differentially Private Top-k Selection
Qiao, Gang
Su, Weijie J.
Zhang, Li
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[34] Indexable Bayesian Personalized Ranking for Effiicient Top-k Recommendation
Le, Dung D.
Lauw, Hady W.
[J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1389 - 1398
[35] Accelerating Top-k ListNet Training for Ranking Using FPGA
Li, Qiang
Fleming, Shane T.
Thomas, David B.
Cheung, Peter Y. K.
[J]. 2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018), 2018, : 245 - 248
[36] Optimal Instance Adaptive Algorithm for the Top-K Ranking Problem
Chen, Xi
Gopi, Sivakanth
Mao, Jieming
Schneider, Jon
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (09) : 6139 - 6160
[37] A Rating-Ranking Method for Crowdsourced Top-k Computation
Li, Kaiyu
Zhang, Xiaohang
Li, Guoliang
[J]. SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 975 - 990
[38] Top-K Query Retrieval of Combinations with Sum-of-Subsets Ranking
Majumder, Subhashis
Sanyal, Biswajit
Gupta, Prosenjit
Sinha, Soumik
Pande, Shiladitya
Hon, Wing-Kai
[J]. COMBINATORIAL OPTIMIZATION AND APPLICATIONS (COCOA 2014), 2014, 8881 : 490 - 505
[39] Affirmative Action Policies for Top-k Candidates Selection
Mathioudakis, Michael
Castillo, Carlos
Barnabo, Giorgio
Celis, Sergio
[J]. PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 440 - 449
[40] Time-homogeneous top-K ranking using tensor decompositions
Ataei, Masoud
Chen, Shengyuan
Yang, Zijiang
Peyghami, M. Reza
[J]. Optimization Methods and Software, 2020, 35 (06) : 1119 - 1143

← 1 2 3 4 5 →