Analysis of data consistency identifies measurement abnormality in Howells' craniometric test data set

被引:1
|
作者
Pang, Jinyong [1 ,2 ]
Dong, Yibo [1 ,2 ,4 ]
Turner, Christopher [3 ]
Li, Chang [1 ,2 ]
Liu, Xiaoming [1 ,2 ]
机构
[1] Univ S Florida, USF Genom, 3720 Spectrum Blvd,Suite 304, Tampa, FL 33612 USA
[2] Univ S Florida, Coll Publ Hlth, 3720 Spectrum Blvd,Suite 304, Tampa, FL 33612 USA
[3] Univ S Florida, Coll Arts & Sci, Dept Anthropol, Tampa, FL 33612 USA
[4] Bur Publ Hlth Labs, 1217 N Pearl St, Jacksonville, FL USA
来源
关键词
data contency; SIS; Howells' craniometric data; simotic chord; simotic subtense; sis; WNB;
D O I
10.1002/ajpa.24631
中图分类号
Q98 [人类学];
学科分类号
030303 ;
摘要
Howells' craniometric data set is the largest publicly available craniometric data set on the internet and has been widely used in craniometric methods development. The data consists of a main data set of 2524 human crania from 28 populations and an additional "test" data set of 524 crania. Up to 82 measurements were recorded from those crania. We studied the data consistency between the main and test data sets for potential combined usage of the two. We found that the two data sets can be separated clearly via Uniform Manifold Approximation and Projection, suggesting some data inconsistency between the two. To further investigate the cause, we split the two data sets into six continental groups (African, Austro-Melanesian, East Asian, European, Native American, and Polynesian) and tested the distribution difference between the two data sets for each of the groups. We found that the measures of simotic chord (WNB) and simotic subtense (SIS) are significantly and abnormally larger in the test data set than in the main data set. After removing the two measures, the two data sets are broadly comparable. We further showed the evidence that missing decimal points likely caused the abnormality.
引用
收藏
页码:687 / 692
页数:6
相关论文
共 50 条
  • [1] Howells' craniometric data on the Internet
    Howells, WW
    AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 1996, 101 (03) : 441 - 442
  • [2] Consistency of Ames Test Results: An Analysis of NTP Data
    Witt, Kristine L.
    Mitchell, Constance A.
    Embry, Michelle R.
    Zeiger, Errol
    ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 2023, 64 : 69 - 70
  • [3] Test Data Measurement Uncertainty Analysis
    Buck, David T.
    SENSORS AND INSTRUMENTATION, AIRCRAFT/AEROSPACE AND DYNAMIC ENVIRONMENTS TESTING, VOL 7, 2023, : 1 - 4
  • [4] Factor Consistency of Neuropsychological Test Battery Versions in the NACC Uniform Data Set
    Culhane, Jessica E.
    Chan, Kwun C. G.
    Teylan, Merilee A.
    Chen, Yen-Chi
    Mock, Charles
    Gauthreaux, Kathryn
    Kukull, Walter A.
    ALZHEIMER DISEASE & ASSOCIATED DISORDERS, 2020, 34 (02): : 175 - 177
  • [5] Test Program Set Data Collection and Data Mining
    Williams, Ian
    Moran, Susan
    IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2010, 13 (04) : 34 - 40
  • [6] ORDINAL DATA AHP ANALYSIS - A PROPOSED COEFFICIENT OF CONSISTENCY AND A NONPARAMETRIC TEST
    JENSEN, RE
    HICKS, TE
    MATHEMATICAL AND COMPUTER MODELLING, 1993, 17 (4-5) : 135 - 150
  • [7] ANALYSIS OF A MODIFIED DATA TEST SET BY USE OF GENERATING FUNCTIONS
    HAN, B
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1974, CO22 (10) : 1706 - 1710
  • [8] Abnormality Analysis of Streamed Log Data
    Harutyunyan, Ashot N.
    Poghosyan, Arnak V.
    Grigoryan, Naira M.
    Marvasti, Mazda A.
    2014 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2014,
  • [9] CONSISTENCY TEST FOR TERNARY AZEOTROPIC DATA.
    Shiozaki, Jun'ichi
    Matsuyama, Hisayoshi
    Memoirs of the Kyushu University, Faculty of Engineering, 1981, 41 (01): : 49 - 58
  • [10] THERMODYNAMIC CONSISTENCY TEST FOR BINARY VLE DATA
    KOLLARHUNEK, K
    KEMENY, S
    HEBERGER, K
    ANGYAL, P
    THURY, E
    FLUID PHASE EQUILIBRIA, 1986, 27 : 405 - 425