On Cost-Efficient Learning of Data Dependency

被引:0
|
作者
Jang, Hyeryung [1 ]
Song, Hyungseok [2 ]
Yi, Yung [2 ]
机构
[1] Dongguk Univ, Dept Artificial Intelligence, Seoul 04620, South Korea
[2] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon 34141, South Korea
基金
新加坡国家研究基金会;
关键词
Costs; Distributed databases; Inference algorithms; Graphical models; Task analysis; Data models; Tree graphs; Graph structure learning; distributed inference; sample complexity; large deviation principle; belief propagation; BELIEF-PROPAGATION; DISTRIBUTED INFERENCE; TRACKING; PRODUCT;
D O I
10.1109/TNET.2022.3141128
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider the problem of learning a tree graph structure that represents the statistical data dependency among nodes for a set of data samples generated by nodes, which provides the basic structure to perform a probabilistic inference task. Inference in the data graph includes marginal inference and maximum a posteriori (MAP) estimation, and belief propagation (BP) is a commonly used algorithm to compute the marginal distribution of nodes via message-passing, incurring non-negligible amount of communication cost. We inevitably have the trade-off between the inference accuracy and the message-passing cost because the learned structure of data dependency and physical connectivity graph are often highly different. In this paper, we formalize this trade-off in an optimization problem which outputs the data dependency graph that jointly considers learning accuracy and message-passing costs. We focus on two popular implementations of BP, ASYNC-BP and SYNC-BP, which have different message-passing mechanisms and cost structures. In ASYNC-BP, we propose a polynomial-time learning algorithm that is optimal, motivated by finding a maximum weight spanning tree of a complete graph. In SYNC-BP, we prove the NP-hardness of the problem and propose a greedy heuristic. For both BP implementations, we quantify how the error probability that the learned cost-efficient data graph differs from the ideal one decays as the number of data samples grows, using the large deviation principle, which provides a guideline on how many samples are necessary to obtain a certain trade-off. We validate our theoretical findings through extensive simulations, which confirms that it has a good match.
引用
收藏
页码:1382 / 1394
页数:13
相关论文
共 50 条
  • [1] Cost-Efficient Data Redundancy in the Cloud
    Waibel, Philipp
    Hochreiner, Christoph
    Schulte, Stefan
    [J]. 2016 IEEE 9TH INTERNATIONAL CONFERENCE ON SERVICE-ORIENTED COMPUTING AND APPLICATIONS (SOCA), 2016, : 1 - 9
  • [2] Cost-Efficient Partitioning of Spatial Data on Cloud
    Akdogan, Afsin
    Indrakanti, Saratchandra
    Demiryurek, Ugur
    Shahabi, Cyrus
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 501 - 506
  • [3] Dynamic Cost-Efficient Replication in Data Clouds
    Bonvin, Nicolas
    Papaioannou, Thanasis G.
    Aberer, Karl
    [J]. FIRST WORKSHOP ON AUTOMATED CONTROL FOR DATACENTERS AND CLOUDS (ACDC '09), 2009, : 49 - 56
  • [4] Learning Data Dependency with Communication Cost
    Jang, Hyeryung
    Song, HyungSeok
    Yi, Yung
    [J]. PROCEEDINGS OF THE 2018 THE NINETEENTH INTERNATIONAL SYMPOSIUM ON MOBILE AD HOC NETWORKING AND COMPUTING (MOBIHOC '18), 2018, : 171 - 180
  • [5] Cost-efficient and Differentiated Data Availability Guarantees in Data Clouds
    Bonvin, Nicolas
    Papaioannou, Thanasis G.
    Aberer, Karl
    [J]. 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 980 - 983
  • [6] Machine Learning and Data-Driven Solutions for Cost-efficient Network Automation (Invited)
    Rahman, Sabidur
    [J]. 2020 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP) AND INTERNATIONAL CONFERENCE ON INFORMATION PHOTONICS AND OPTICAL COMMUNICATIONS (IPOC), 2020,
  • [7] Systematic Literature Review on Cost-Efficient Deep Learning
    Klemetti, Antti
    Raatikainen, Mikko
    Myllyaho, Lalli
    Mikkonen, Tommi
    Nurminen, Jukka K.
    [J]. IEEE ACCESS, 2023, 11 : 90158 - 90180
  • [8] Cost-efficient designing
    Schmuck, M
    Gäbelein, J
    [J]. KUNSTSTOFFE-PLAST EUROPE, 2005, 95 (05): : 83 - 85
  • [9] IS EMPATHY COST-EFFICIENT
    BOOK, HE
    [J]. AMERICAN JOURNAL OF PSYCHOTHERAPY, 1991, 45 (01) : 21 - 30
  • [10] Cost-efficient and individual
    Niederhagen, R
    [J]. KUNSTSTOFFE-PLAST EUROPE, 1997, 87 (03): : 268 - &