Brief Overview of Symbolic Data and Analytic Issues

被引:20
|
作者
Billard L. [1 ]
机构
[1] Department of Statistics, University of Georgia, Athens
来源
关键词
Aggregated data; Complex data; Histogram data; Internal variation; Interval data; Large datasets; Multi-modal data; Rules; Symbolic data analysis;
D O I
10.1002/sam.10115
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
With the advent of contemporary computers, datasets can be massively huge, too large for direct analysis. One of the many approaches to this problem of size is to aggregate the data according to some appropriate scientific question of interest, with the resulting dataset perforce being one with symbolic-valued observations such as lists, intervals, histograms, and the like. Other datasets, small or large, are naturally symbolic in nature. One aim here is to provide a brief nontechnical overview of symbolic data and discuss how they arise. We also provide brief insights into some of the issues that arise in their analyses. These include the need to take into account the internal variations inherent in symbolic data but not present in classical data. Another issue is that, by the nature of the aggregation, resulting datasets can contain "holes" or regions that are not possible; thus, accommodation for these need to be taken into account, when, e.g. seemingly interval data are actually some other form of symbolic data (such as histogram data). Also, we show how other forms of complex data differ from symbolic data; so, e.g. fuzzy data are a different domain than that for symbolic data. Finally, we look at further research needs for the subject. A more technical introduction to symbolic data and available analytic methodology is given by Noirhomme and Brito. Copyright © 2011 Wiley Periodicals, Inc., A Wiley Company.
引用
收藏
页码:149 / 156
页数:7
相关论文
共 50 条
  • [31] AVOIDANT PERSONALITY-DISORDER - A BRIEF REVIEW OF ISSUES AND DATA
    MILLON, T
    JOURNAL OF PERSONALITY DISORDERS, 1991, 5 (04) : 353 - 362
  • [32] A BRIEF OVERVIEW OF THE CURRENT STATE, CHALLENGING ISSUES AND FUTURE DIRECTIONS OF POINT CLOUD REGISTRATION
    Brightman, Nathan
    Fan, Lei
    14TH GEOINFORMATION FOR DISASTER MANAGEMENT, GI4DM 2022, VOL. 10-3, 2022, : 17 - 23
  • [33] Contemporary issues in adolescent video game playing: brief overview and introduction to the special issue
    Anderson, CA
    Funk, JB
    Griffiths, MD
    JOURNAL OF ADOLESCENCE, 2004, 27 (01) : 1 - 3
  • [34] Symbolic circuit analysis: An overview
    Hassoun, MM
    Huelsman, LP
    38TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 957 - 960
  • [35] Analytic spread of filtrations and symbolic algebras
    Cutkosky, Steven Dale
    Sarkar, Parangama
    JOURNAL OF THE LONDON MATHEMATICAL SOCIETY-SECOND SERIES, 2022, 106 (03): : 2635 - 2662
  • [36] A Brief Overview of Purpose and Overview Clauses
    Wortley, Scott
    EDINBURGH LAW REVIEW, 2022, 26 (03) : 443 - 448
  • [37] An overview of analytic philosophy
    Gagnon, M
    DIALOGUE-CANADIAN PHILOSOPHICAL REVIEW, 2002, 41 (03) : 624 - 627
  • [38] Security and Privacy Issues in Vehicular Named Data Networks: An Overview
    Khelifi, Hakima
    Luo, Senlin
    Nour, Boubakr
    Shah, Sayed Chhattan
    MOBILE INFORMATION SYSTEMS, 2018, 2018
  • [39] AN OVERVIEW OF DATA VERACITY ISSUES IN SHIP PERFORMANCE AND NAVIGATION MONITORING
    Perera, Lokukaluge P.
    Mo, Brage
    PROCEEDINGS OF THE ASME 37TH INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, 2018, VOL 11B, 2018,
  • [40] OVERVIEW OF KEY CONCEPTS OF MISSING DATA ISSUES IN GERONTOLOGICAL RESEARCH
    Xue, Q.
    GERONTOLOGIST, 2012, 52 : 252 - 252