Mining the Best Observational Window to Model Social Phenomena

被引:0
|
作者
Yan, Chao [1 ]
Yin, Zhijun [1 ]
Xiang, Stanley [1 ]
Chen, You [1 ]
Vorobeychik, Yevgeniy [2 ]
Fabbri, Daniel [1 ]
Kho, Abel [3 ]
Liebovitz, David [4 ]
Malin, Bradley [1 ]
机构
[1] Vanderbilt Univ, 221 Kirkland Hall, Nashville, TN 37235 USA
[2] Washington Univ, St Louis, MO 63110 USA
[3] Northwestern Univ, Evanston, IL USA
[4] Univ Chicago, Chicago, IL 60637 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Data Mining; Organizational modeling; Temporal optimization; Anomaly detection; CARE; NETWORKS;
D O I
10.1109/CIC.2018.00-41
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The structure and behavior of organizations can be learned by mining the event logs of the information systems they manage. This supports numerous applications, such as inferring the structure of social relations, uncovering implicit workflows, and detecting illicit behavior. However, to date, no clear guidelines regarding how to select an appropriate time period to perform organizational modeling have been articulated. This is a significant concern because an inaccurately defined period can lead to incorrect models and poor performance in data-driven applications. In this paper, we introduce a data-driven approach to infer the optimal time period for organizational modeling. Our approach 1) represents the system as a social network, 2) decomposes it into its respective principal components, and 3) optimizes the signal-to-noise ratio over varying temporal observation windows. In doing so, we minimize the variance in the organizational structure while maximizing its patterns. We assess the capability of this approach using an anomaly detection scenario, which is based on the patterns learned from the interactions documented in audit logs. The classification performance of two known algorithms is investigated over a range of time periods in two representative datasets. First, we use the electronic health record access logs from Northwestern Memorial Hospital to demonstrate that our framework detects a period that coincides with the optimal performance of the anomaly detection algorithms. Second, we assess the generalizability of the framework through an analysis with a less clearly defined organization, in the form of the social network inferred from the DBLP co-authorship dataset. The results with this data further illustrate that our framework can discover the optimal time period in the context of a more loosely organized group.
引用
收藏
页码:46 / 55
页数:10
相关论文
共 50 条
  • [11] A basic city simulation model for evaluating social phenomena
    Ichikawa, Manabu
    Koyama, Yuhsuke
    Deguchi, Hiroshi
    AGENT-BASED APPROACHES IN ECONOMIC AND SOCIAL COMPLEX SYSTEMS IV, 2007, 3 : 71 - +
  • [12] Coco Model: A Predictive Model to Explain Interaction Phenomena in Social Networks
    Liu, Chen
    Deng, Qianni
    2017 IEEE 2ND ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2017, : 282 - 286
  • [13] OBSERVATIONAL EVIDENCE OF ANOMALISTIC PHENOMENA
    BAKER, RML
    JOURNAL OF THE ASTRONAUTICAL SCIENCES, 1968, 15 (01): : 31 - &
  • [14] Mining frequent itemsets in data streams using the weighted sliding window model
    Tsai, Pauray S. M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (09) : 11617 - 11625
  • [15] A fuzzy approach for the model of sliding window. An application to behaviour patterns mining
    Ros, Maria
    Delgado, Miguel
    Vila, Amparo M.
    PROCEEDINGS OF THE JOINT 2009 INTERNATIONAL FUZZY SYSTEMS ASSOCIATION WORLD CONGRESS AND 2009 EUROPEAN SOCIETY OF FUZZY LOGIC AND TECHNOLOGY CONFERENCE, 2009, : 1211 - 1216
  • [17] THE WORLDS BEST DESIGNS IN WINDOW DISPLAY
    WOHLWEND, F
    GRAPHIS, 1984, 40 (231): : 80 - &
  • [18] Capabilities of the Window Observational Research Facility
    Eppler, D
    Scott, KP
    Conover, S
    Turner, R
    SPACE TECHNOLOGY AND APPLICATIONS INTERNATIONAL FORUM - 1999, PTS ONE AND TWO, 1999, 458 : 145 - 150
  • [19] Impact of an observational time window on coupled data assimilation: simulation with a simple climate model
    Zhao, Yuxin
    Deng, Xiong
    Zhang, Shaoqing
    Liu, Zhengyu
    Liu, Chang
    Vecchi, Gabriel
    Han, Guijun
    Wu, Xinrong
    NONLINEAR PROCESSES IN GEOPHYSICS, 2017, 24 (04) : 681 - 694
  • [20] A Recruitment System Based on Data Mining: Finding the Best Candidate from Social Media
    Pei, Caixia
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2025,