Model-based Outlier Detection for Object-Relational Data

被引:5
|
作者
Riahi, Fatemeh [1 ]
Schulte, Oliver [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC, Canada
关键词
D O I
10.1109/SSCI.2015.224
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper extends unsupervised statistical outlier detection to the case of object-relational data. Object-relational data represent a complex heterogeneous network [9], which comprises objects of different types, links among these objects, also of different types, and attributes of these links. This special structure prohibits a direct vectorial data representation. We apply state-of-the-art probabilistic modelling techniques for object- relational data that construct a graphical model (Bayesian network), which compactly represents probabilistic associations in the data. We propose a new metric, based on the learned object-relational model, that quantifies the extent to which the individual association pattern of a potential outlier deviates from that of the whole population. The metric is based on the likelihood ratio of two parameter vectors: One that represents the population associations, and another that represents the individual associations. Our method is validated on synthetic datasets and on real-world data sets about soccer matches and movies. Compared to baseline methods, our novel transformed likelihood ratio achieved the best detection accuracy on all datasets.
引用
收藏
页码:1590 / 1598
页数:9
相关论文
共 50 条
  • [41] Object-relational databases: the next wave in pharmaceutical data management
    Cargill, JF
    MacCuish, NE
    DRUG DISCOVERY TODAY, 1998, 3 (12) : 547 - 551
  • [42] Concurrent data materialization for object-relational database with semantic metadata
    Fong, J
    Pang, R
    Fong, A
    Pang, F
    Poon, K
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2003, 13 (03) : 257 - 291
  • [43] Storing and maintaining semistructured data efficiently in an object-relational database
    Mo, YY
    Ling, TW
    WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, 2002, : 247 - 256
  • [44] Performance Comparison Slowly Changing Dimensions using Model Relational and Object-Relational
    Urrutia Sepulveda, Angelica
    Cofre Loyola, Rodrigo
    Wilson Hernandez, Manuel
    2015 34TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2015,
  • [45] Interval sequences:: An object-relational approach to manage spatial data
    Kriegel, HP
    Pötke, M
    Seidl, T
    ADVANCES IN SPATIAL AND TEMPORAL DATABASES, PROCEEDINGS, 2001, 2121 : 481 - 501
  • [46] Knowledge based, data driven and object-relational workflow management for microarray processing pipeline
    Li, Xin
    CITSA 2007/CCCT 2007: INTERNATIONAL CONFERENCE ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS : INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL III, POST-CONFERENCE ISSUE, PROCEEDINGS, 2007, : 204 - 209
  • [47] Logical designs of object-relational databases
    Mok, WY
    CHALLENGES OF INFORMATION TECHNOLOGY MANAGEMENT IN THE 21ST CENTURY, 2000, : 900 - 901
  • [48] Modeling relationships in object-relational databases
    Soutou, C
    DATA & KNOWLEDGE ENGINEERING, 2001, 36 (01) : 79 - 107
  • [49] Implementation of object-relational DBMSs in a relational database course
    Wang, M
    PROCEEDINGS OF THE THIRTY-SECOND SIGCSE TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, 2001, 33 (01): : 367 - 370
  • [50] A Model-Based Approach for Outlier Detection in Sensor Networks
    Ding, Min
    Liang, Qilian
    Cheng, Xiuzhen
    Al-Rodhaan, Mznah
    Al-Dhelaan, Abdullah
    Huang, Scott C. -H.
    Chen, Dechang
    AD HOC & SENSOR WIRELESS NETWORKS, 2011, 12 (3-4) : 275 - 293