Non-parametric regression for networks

被引:4
|
作者
Severn, Katie E. [1 ]
Dryden, Ian L. [1 ]
Preston, Simon P. [1 ]
机构
[1] Univ Nottingham, Sch Math Sci, Univ Pk, Nottingham NG7 2RD, England
来源
STAT | 2021年 / 10卷 / 01期
基金
英国工程与自然科学研究理事会;
关键词
consistency; dynamic network; graph Laplacian; manifold; metric; Nadaraya-Watson; STATISTICS; MATRICES;
D O I
10.1002/sta4.373
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Network data are becoming increasingly available, and so there is a need to develop a suitable methodology for statistical analysis. Networks can be represented as graph Laplacian matrices, which are a type of manifold-valued data. Our main objective is to estimate a regression curve from a sample of graph Laplacian matrices conditional on a set of Euclidean covariates, for example, in dynamic networks where the covariate is time. We develop an adapted Nadaraya-Watson estimator which has uniform weak consistency for estimation using Euclidean and power Euclidean metrics. We apply the methodology to the Enron email corpus to model smooth trends in monthly networks and highlight anomalous networks. Another motivating application is given in corpus linguistics, which explores trends in an author's writing style over time based on word co-occurrence networks.
引用
收藏
页数:11
相关论文
共 50 条