STATISTICAL ANALYSIS OF k-NEAREST NEIGHBOR COLLABORATIVE RECOMMENDATION

被引:19
|
作者
Biau, Gerard [1 ,2 ]
Cadre, Benoit [3 ]
Rouviere, Laurent [4 ]
机构
[1] Univ Paris 06, LSTA, F-75013 Paris, France
[2] Univ Paris 06, LPMA, F-75013 Paris, France
[3] UEB, IRMAR, ENS CACHAN BRETAGNE, CNRS, F-35170 Bruz, France
[4] UEB, IRMAR, CREST ENSAI, F-35172 Bruz, France
来源
ANNALS OF STATISTICS | 2010年 / 38卷 / 03期
关键词
Collaborative recommendation; cosine-type similarity; nearest neighbor estimate; consistency; rate of convergence; SYSTEMS;
D O I
10.1214/09-AOS759
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Collaborative recommendation is an information-filtering technique that attempts to present information items that are likely of interest to an Internet user. Traditionally, collaborative systems deal with situations with two types of variables, users and items. In its most common form, the problem is framed as trying to estimate ratings for items that have not yet been consumed by a user. Despite wide-ranging literature, little is known about the statistical properties of recommendation systems. In fact, no clear probabilistic model even exists which would allow us to precisely describe the mathematical forces driving collaborative filtering. To provide an initial contribution to this, we propose to set out a general sequential stochastic model for collaborative recommendation. We offer an in-depth analysis of the so-called cosine-type nearest neighbor collaborative method, which is one of the most widely used algorithms in collaborative filtering, and analyze its asymptotic performance as the number of users grows. We establish consistency of the procedure under mild assumptions on the model. Rates of convergence and examples are also provided.
引用
收藏
页码:1568 / 1592
页数:25
相关论文
共 50 条
  • [41] An optimal k-nearest neighbor for density estimation
    Kung, Yi-Hung
    Lin, Pei-Sheng
    Kao, Cheng-Hsiung
    STATISTICS & PROBABILITY LETTERS, 2012, 82 (10) : 1786 - 1791
  • [42] Enhanced Weighted K-nearest Neighbor Positioning
    Li, Xinze
    Al-Tous, Hanan
    Hajri, Salah Eddine
    Tirkkonen, Olav
    2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,
  • [43] Privacy preserving K-nearest neighbor classification
    Zhan, Justin
    Chang, Li Wu
    Matwin, Stan
    International Journal of Network Security, 2005, 1 (01) : 46 - 51
  • [44] MULTIVARIATE K-NEAREST NEIGHBOR DENSITY ESTIMATES
    MACK, YP
    ROSENBLATT, M
    JOURNAL OF MULTIVARIATE ANALYSIS, 1979, 9 (01) : 1 - 15
  • [45] Intrusion Detection Using k-Nearest Neighbor
    Govindarajan, M.
    Chandrasekaran, R. M.
    FIRST INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING 2009 (ICAC 2009), 2009, : 13 - +
  • [46] Graph Clustering with K-Nearest Neighbor Constraints
    Jakawat, Wararat
    Makkhongkaew, Raywat
    2019 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2019), 2019, : 309 - 313
  • [47] A "soft" K-Nearest Neighbor voting scheme
    Mitchell, HB
    Schaefer, PA
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2001, 16 (04) : 459 - 468
  • [48] Map Reduce by K-Nearest Neighbor Joins
    Bethu, Srikanth
    Babu, B. Sankara
    Rao, S. Govinda
    Florence, R. Aruna
    2018 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC 2018), 2018, : 222 - 231
  • [49] Asymptotics of k-nearest Neighbor Riesz Energies
    Douglas P. Hardin
    Edward B. Saff
    Oleksandr Vlasiuk
    Constructive Approximation, 2024, 59 : 333 - 383
  • [50] k-nearest neighbor estimation of entropies with confidence
    Sricharan, Kumar
    Raich, Raviv
    Hero, Alfred O., III
    2011 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2011, : 1205 - 1209