Outlying Sequence Detection in Large Data Sets

被引:41
|
作者
Tajer, Ali [1 ]
Veeravalli, Venugopal V. [2 ]
Poor, H. Vincent [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
[2] Univ Illinois, Urbana, IL USA
基金
美国国家科学基金会;
关键词
ANOMALY DETECTION; OUTLIER DETECTION; ALGORITHMS; CLASSIFICATION; FREQUENCY; TESTS;
D O I
10.1109/MSP.2014.2329428
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Outliers refer to observations that do not conform to the expected patterns in high-dimensional data sets. When such outliers signify risks (e.g., in fraud detection) or opportunities (e.g., in spectrum sensing), harnessing the costs associated with the risks or missed opportunities necessitates mechanisms that can identify them effectively. Designing such mechanisms involves striking an appropriate balance between reliability and cost of sensing, as two opposing performance measures, where improving one tends to penalize the other. This article poses and analyzes outlying sequence detection in a hypothesis testing framework under different outlier recovery objectives and different degrees of knowledge about the underlying statistics of the outliers. © 2014 IEEE.
引用
收藏
页码:44 / 56
页数:13
相关论文
共 50 条
  • [1] Large outlying stable constant mean curvature spheres in initial data sets
    Simon Brendle
    Michael Eichmair
    [J]. Inventiones mathematicae, 2014, 197 : 663 - 682
  • [2] Large outlying stable constant mean curvature spheres in initial data sets
    Brendle, Simon
    Eichmair, Michael
    [J]. INVENTIONES MATHEMATICAE, 2014, 197 (03) : 663 - 682
  • [3] OUTLYING SEQUENCE DETECTION IN LARGE DATASETS: COMPARISON OF UNIVERSAL HYPOTHESIS TESTING AND CLUSTERING
    Li, Yun
    Veeravalli, Venugopal V.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6180 - 6184
  • [4] UNIVERSAL OUTLYING SEQUENCE DETECTION FOR CONTINUOUS OBSERVATIONS
    Bu, Yuheng
    Zou, Shaofeng
    Liang, Yingbin
    Veeravalli, Venugopal V.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4254 - 4258
  • [5] Outlier detection in large data sets
    Buzzi-Ferraris, Guido
    Manenti, Flavio
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2011, 35 (02) : 388 - 390
  • [6] Efficient Mining of Outlying Sequence Patterns for Analyzing Outlierness of Sequence Data
    Wang, Tingting
    Duan, Lei
    Dong, Guozhu
    Bao, Zhifeng
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2020, 14 (05)
  • [7] DETECTION OF OUTLYING DATA IN BIOAVAILABILITY BIOEQUIVALENCE STUDIES
    LIU, JP
    WENG, CS
    [J]. STATISTICS IN MEDICINE, 1991, 10 (09) : 1375 - 1389
  • [8] Identification of outlying observations for large-dimensional data
    Wang, Tao
    Yang, Xiaona
    Guo, Yunfei
    Li, Zhonghua
    [J]. JOURNAL OF APPLIED STATISTICS, 2023, 50 (02) : 370 - 386
  • [9] Tractable group detection on large link data sets
    Kubica, J
    Moore, A
    Schneider, J
    [J]. THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 573 - 576
  • [10] An Effective Filter for IBD Detection in Large Data Sets
    Huang, Lin
    Bercovici, Sivan
    Rodriguez, Jesse M.
    Batzoglou, Serafim
    [J]. PLOS ONE, 2014, 9 (03):