Measurement, selection, and visualization of association rules: A compositional data perspective A Compositional Data perspective on Association Rules

被引:2
|
作者
Vives-Mestres, Marina [1 ,2 ]
Kenett, Ron S. [3 ,4 ]
Thio-Henestrosa, Santiago [1 ]
Martin-Fernandez, Josep Antoni [1 ]
机构
[1] Univ Girona, Dept Comp Sci Appl Math & Stat, POLITECNICA 4,Campus Montilivi, Girona 17003, Spain
[2] Curelator Inc, Clin Stat, 210 Broadway 201, Cambridge, MA 02139 USA
[3] KPA Grp, Raanana, Israel
[4] Samuel Neaman Inst, Raanana, Israel
关键词
Aitchison geometry; association rule; independence test; measure of interestingness; odds ratio test; simplex representation;
D O I
10.1002/qre.2910
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Association rule mining is a powerful data analytic technique used for extracting information from transaction databases with a collection of itemsets. The aim is to indicate what item goes with what item (ie, an association rule) in a set of collected transactions. It is extensively used in text analytics of text records or social media. Here we use Compositional Data analysis (CoDa) techniques to generate new visualizations and insights from association rule mining. These CoDa methods show the relationship between itemsets, their strength, and direction of dependency. Moreover, after expressing each association rule as a contingency table, we discuss two statistical tests to guide identification of the relevant rules by analyzing the relative importance of the elements of the table. As an example, we use these visualizations and statistical tests for investigating the association of negative mood emotions to various types of headache/migraine events. Data for those analysis comes from N1-Headache(TM), a digital platform where individual users record attacks and symptoms as well as their daily exposure to a list of potential factors.
引用
收藏
页码:1327 / 1339
页数:13
相关论文
共 50 条
  • [41] Exploration of Ordinal Data Using Association Rules
    Büchter, Oliver
    Wirth, Rüdiger
    Knowledge and Information Systems, 1999, 1 (04): : 393 - 414
  • [42] Scalable parallel data mining for association rules
    Han, EH
    Karypis, G
    Kumar, V
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2000, 12 (03) : 337 - 352
  • [43] Intelligent Structuring of Association Rules in Data Structure
    Wang, Yan
    Lei, Yuxia
    Cao, Baoxiang
    PROCEEDINGS OF THE 2008 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN, VOL 2, 2008, : 211 - 214
  • [44] A data cleaning method based on association rules
    Wei, Weijie
    Zhang, Mingwei
    Zhang, Bin
    Tang, Xiaochun
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE 2007), 2007,
  • [45] Mining association rules in big data with NGEP
    Chen, Yunliang
    Li, Fangyuan
    Fan, Junqing
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 577 - 585
  • [46] Mining Association Rules in Seasonal Transaction Data
    Ayu, Sabrina Kusuma
    Surjandari, Isti
    Zulkarnain
    2018 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2018), 2018, : 321 - 325
  • [47] Exploration of Ordinal Data Using Association Rules
    Oliver Büchter
    Rüdiger Wirth
    Knowledge and Information Systems, 1999, 1 (4) : 393 - 414
  • [48] MINING ASSOCIATION RULES FOR TRAJECTORIES OF SPATIOTEMPORAL DATA
    Hong, Hao
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 247 - 253
  • [49] Improvements in data mining association rules algorithm
    Li, Dai
    International Journal of Database Theory and Application, 2015, 8 (02): : 1 - 10
  • [50] Application of Association Rules in the Hepatocellular Carcinoma Data
    Ye Xiaofei
    Wu Jianhu
    He Jia
    RECENT ADVANCE IN STATISTICS APPLICATION AND RELATED AREAS, VOLS I AND II, 2009, : 1057 - 1064