Mining massive text data and developing tracking statistics

被引:0
|
作者
Jeske, DR [1 ]
Liu, RY [1 ]
机构
[1] Univ Calif Riverside, Riverside, CA 92521 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper outlines a systematic data mining procedure for exploring large free-style text datasets to discover useful features and develop tracking statistics, generally referred to as performance measures or risk indicators. The procedure includes text mining, risk analysis, classification for error measurements and nonparametric multivariate analysis. Two aviation safety report repositories PTRS from the FAA and AAS from the NTSB will be used to illustrate applications of our research to aviation risk management and general decision-support systems. Some specific text analysis methodologies and tracking statistics will be discussed. Approaches to incorporating misclassified data or error measurements into tracking statistics will be discussed as well.
引用
收藏
页码:495 / 510
页数:16
相关论文
共 50 条
  • [1] Mining and tracking massive text data: Classification, construction of tracking statistics, and inference under misclassification
    Jeske, Daniel R.
    Liu, Regina Y.
    [J]. TECHNOMETRICS, 2007, 49 (02) : 116 - 128
  • [2] Multidimensional Mining of Massive Text Data
    Zhang, Chao
    Han, Jiawei
    [J]. Synthesis Lectures on Data Mining and Knowledge Discovery, 2019, 11 (02): : 1 - 198
  • [3] Mining Eye-Tracking Data for Text Summarization
    Taieb-Maimon, Meirav
    Romanovski-Chernik, Aleksandr
    Last, Mark
    Litvak, Marina
    Elhadad, Michael
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2024, 40 (17) : 4887 - 4905
  • [4] New challenges and roles of metadata in text/data mining in statistics
    Soltés, D
    [J]. Knowledge Mining, 2005, 185 : 191 - 199
  • [5] Text mining in official statistics
    Becue, M
    Fridlund, B
    Fyhrlund, A
    Prat, A
    Sundgren, B
    [J]. TEXT MINING AND ITS APPLICATIONS, 2004, 138 : 189 - 204
  • [6] On the Power of Big Data: Mining Structures from Massive, Unstructured Text Data
    Han, Jiawei
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 4 - 4
  • [7] Massive Data Mining Algorithm for Web Text Based on Clustering Algorithm
    Luo, Nan-Chao
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2019, 23 (02) : 362 - 365
  • [8] Mining Structures from Massive Text Data: Will It Help Software Engineering?
    Han, Jiawei
    [J]. PROCEEDINGS OF THE 2017 32ND IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE'17), 2017, : 2 - 2
  • [9] Using text mining in official statistics
    Fyhrlund, A
    Fridlund, B
    Sundgren, B
    [J]. Knowledge Mining, 2005, 185 : 201 - 211
  • [10] Data mining on text
    Clifton, C
    Steinheiser, R
    [J]. TWENTY-SECOND ANNUAL INTERNATIONAL COMPUTER SOFTWARE & APPLICATIONS CONFERENCE - PROCEEDINGS, 1998, : 630 - 635