Top-K Ranking Deep Contextual Bandits for Information Selection Systems

被引:2
|
作者
Freeman, Jade [1 ]
Rawson, Michael [2 ]
机构
[1] DEVCOM Army Res Lab, 2800 Powder Mill Rd, Adelphi, MD 20883 USA
[2] Univ Maryland, Dept Math, College Pk, MD 20742 USA
关键词
D O I
10.1109/SMC52423.2021.9658912
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In today's technology environment, information is abundant, dynamic, and heterogeneous in nature. Automated filtering and prioritization of information is based on the distinction between whether the information adds substantial value toward one's goal or not. Contextual multi-armed bandit has been widely used for learning to filter contents and prioritize according to user interest or relevance. Learn-to-Rank technique optimizes the relevance ranking on items, allowing the contents to be selected accordingly. We propose a novel approach to top-K rankings under the contextual multi-armed bandit framework. We model the stochastic reward function with a neural network to allow non-linear approximation to learn the relationship between rewards and contexts. We demonstrate the approach and evaluate the the performance of learning from the experiments using real world data sets in simulated scenarios. Empirical results show that this approach performs well under the complexity of a reward structure and high dimensional contextual features.
引用
收藏
页码:2209 / 2214
页数:6
相关论文
共 50 条
  • [31] A Top-K Retrieval algorithm based on a decomposition of ranking functions
    Madrid, Nicolas
    Rusnok, Pavel
    [J]. INFORMATION SCIENCES, 2019, 474 : 136 - 153
  • [32] Ranking uncertain sky: The probabilistic top-k skyline operator
    Zhang, Ying
    Zhang, Wenjie
    Lin, Xuemin
    Jiang, Bin
    Pei, Jian
    [J]. INFORMATION SYSTEMS, 2011, 36 (05) : 898 - 915
  • [33] Oneshot Differentially Private Top-k Selection
    Qiao, Gang
    Su, Weijie J.
    Zhang, Li
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [34] Indexable Bayesian Personalized Ranking for Effiicient Top-k Recommendation
    Le, Dung D.
    Lauw, Hady W.
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1389 - 1398
  • [35] Accelerating Top-k ListNet Training for Ranking Using FPGA
    Li, Qiang
    Fleming, Shane T.
    Thomas, David B.
    Cheung, Peter Y. K.
    [J]. 2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018), 2018, : 245 - 248
  • [36] Optimal Instance Adaptive Algorithm for the Top-K Ranking Problem
    Chen, Xi
    Gopi, Sivakanth
    Mao, Jieming
    Schneider, Jon
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (09) : 6139 - 6160
  • [37] A Rating-Ranking Method for Crowdsourced Top-k Computation
    Li, Kaiyu
    Zhang, Xiaohang
    Li, Guoliang
    [J]. SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 975 - 990
  • [38] Top-K Query Retrieval of Combinations with Sum-of-Subsets Ranking
    Majumder, Subhashis
    Sanyal, Biswajit
    Gupta, Prosenjit
    Sinha, Soumik
    Pande, Shiladitya
    Hon, Wing-Kai
    [J]. COMBINATORIAL OPTIMIZATION AND APPLICATIONS (COCOA 2014), 2014, 8881 : 490 - 505
  • [39] Affirmative Action Policies for Top-k Candidates Selection
    Mathioudakis, Michael
    Castillo, Carlos
    Barnabo, Giorgio
    Celis, Sergio
    [J]. PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 440 - 449
  • [40] Time-homogeneous top-K ranking using tensor decompositions
    Ataei, Masoud
    Chen, Shengyuan
    Yang, Zijiang
    Peyghami, M. Reza
    [J]. Optimization Methods and Software, 2020, 35 (06) : 1119 - 1143