Mining Query-Based Subnetwork Outliers in Heterogeneous Information Networks

被引:11
|
作者
Zhuang, Honglei [1 ]
Zhang, Jing [2 ]
Brova, George [1 ]
Tang, Jie [2 ]
Cam, Hasan [3 ]
Yan, Xifeng [4 ]
Han, Jiawei [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Champaign, IL 61801 USA
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[3] US Army, Res Lab, Adelphi, MD USA
[4] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
关键词
D O I
10.1109/ICDM.2014.85
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining outliers in a heterogeneous information network is a challenging problem: It is even unclear what should be outliers in a large heterogeneous network (e.g., outliers in the entire bibliographic network consisting of authors, titles, papers and venues). In this study, we propose an interesting class of outliers, query-based subnetwork outliers: Given a heterogeneous network, a user raises a query to retrieve a set of task-relevant subnetworks, among which, subnetwork outliers are those that significantly deviate from others (e.g., outliers of author groups among those studying "topic modeling"). We formalize this problem and propose a general framework, where one can query for finding subnetwork outliers with respect to different semantics. We introduce the notion of subnetwork similarity that captures the proximity between two subnetworks by their membership distributions. We propose an outlier detection algorithm to rank all the subnetworks according to their outlierness without tuning parameters. Our quantitative and qualitative experiments on both synthetic and real data sets show that the proposed method outperforms other baselines.
引用
收藏
页码:1127 / 1132
页数:6
相关论文
共 50 条
  • [1] Intertopic Information Mining for Query-Based Summarization
    Ouyang, You
    Li, Wenjie
    Li, Sujian
    Lu, Qin
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (05): : 1062 - 1072
  • [2] QANet: Tensor Decomposition Approach for Query-Based Anomaly Detection in Heterogeneous Information Networks
    Ranjbar, Vahid
    Salehi, Mostafa
    Jandaghi, Pegah
    Jalili, Mahdi
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (11) : 2178 - 2189
  • [3] Mining data by query-based error-propagation
    Lai, LB
    Chang, RI
    Kouh, JS
    ADVANCES IN NATURAL COMPUTATION, PT 1, PROCEEDINGS, 2005, 3610 : 1224 - 1233
  • [4] Query-based association rule mining supporting user perspective
    Song, Seung-Jae
    Kim, Eung-Hee
    Kim, Hong-Gee
    Kumar, Harshit
    COMPUTING, 2011, 93 (01) : 1 - 25
  • [5] Disease diagnosis using query-based neural networks
    Chang, RI
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 3, PROCEEDINGS, 2005, 3498 : 767 - 773
  • [6] Query-Based Information, Self-Construals and Persuasion
    Ku, Hsuan-Hsuan
    NTU MANAGEMENT REVIEW, 2021, 31 (02): : 101 - 128
  • [7] Query-based association rule mining supporting user perspective
    Seung-Jae Song
    Eung-Hee Kim
    Hong-Gee Kim
    Harshit Kumar
    Computing, 2011, 93 : 1 - 25
  • [8] On Detecting Association-Based Clique Outliers In Heterogeneous Information Networks
    Gupta, Manish
    Gao, Jing
    Yan, Xifeng
    Cam, Hasan
    Han, Jiawei
    2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, : 114 - 121
  • [9] Maximizing the Lifetime of Query-Based Wireless Sensor Networks
    Degirmenci, Guvenc
    Kharoufeh, Jeffrey P.
    Prokopyev, Oleg A.
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2014, 10 (04)
  • [10] On the performance evaluation of query-based wireless sensor networks
    Degirmenci, Guvenc
    Kharoufeh, Jeffrey P.
    Baldwin, Rusty O.
    PERFORMANCE EVALUATION, 2013, 70 (02) : 124 - 147