Bilevel Feature Extraction-Based Text Mining for Fault Diagnosis of Railway Systems

被引:99
|
作者
Wang, Feng [1 ]
Xu, Tianhua [1 ]
Tang, Tao [1 ]
Zhou, MengChu [2 ,3 ]
Wang, Haifeng [4 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
[2] Macau Univ Sci & Technol, Inst Syst Engn, Macau 999078, Peoples R China
[3] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA
[4] Beijing Jiaotong Univ, Natl Engn Res Ctr Rail Transportat Operat & Contr, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Text mining; feature selection; fault diagnosis; railway systems; FUSION; STRATEGY;
D O I
10.1109/TITS.2016.2521866
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
A vast amount of text data is recorded in the forms of repair verbatim in railway maintenance sectors. Efficient text mining of such maintenance data plays an important role in detecting anomalies and improving fault diagnosis efficiency. However, unstructured verbatim, high-dimensional data, and imbalanced fault class distribution pose challenges for feature selections and fault diagnosis. We propose a bilevel feature extraction-based text mining that integrates features extracted at both syntax and semantic levels with the aim to improve the fault classification performance. We first perform an improved chi(2) statistics-based feature selection at the syntax level to overcome the learning difficulty caused by an imbalanced data set. Then, we perform a prior latent Dirichlet allocation-based feature selection at the semantic level to reduce the data set into a low-dimensional topic space. Finally, we fuse fault features derived from both syntax and semantic levels via serial fusion. The proposed method uses fault features at different levels and enhances the precision of fault diagnosis for all fault classes, particularly minority ones. Its performance has been validated by using a railway maintenance data set collected from 2008 to 2014 by a railway corporation. It outperforms traditional approaches.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 50 条
  • [1] Dynamic Feature Extraction-Based Quadratic Discriminant Analysis for Industrial Process Fault Classification and Diagnosis
    Li, Hanqi
    Jia, Mingxing
    Mao, Zhizhong
    [J]. ENTROPY, 2023, 25 (12)
  • [2] The feature extraction of text mining based on Web
    Liu, LZ
    Chen, JJ
    Song, HT
    [J]. ICEMI'2003: PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOLS 1-3, 2003, : 547 - 550
  • [3] Text Mining Based Fault Diagnosis of Vehicle On-board Equipment for High Speed Railway
    Zhao Yang
    Xu Tian-hua
    Wang Hai-feng
    [J]. 2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2014, : 900 - 905
  • [4] Fault diagnosis of rotor systems Using ICA Based Feature Extraction
    Jiao, Weidong
    Chang, Yongping
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2009), VOLS 1-4, 2009, : 1286 - 1291
  • [5] Failure Cause Extraction of Railway Switches Based on Text Mining
    Lin, Chunni
    Wang, Guang
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2017), 2017, : 237 - 241
  • [6] Short Text Mining for Fault Diagnosis of Railway System Based on Multi-Granularity Topic Model
    Wu, Shun
    [J]. 2018 8TH INTERNATIONAL CONFERENCE ON LOGISTICS, INFORMATICS AND SERVICE SCIENCES (LISS), 2018,
  • [7] Bearing Fault Feature Extraction and Fault Diagnosis Method Based on Feature Fusion
    Zhu, Huibin
    He, Zhangming
    Wei, Juhui
    Wang, Jiongqi
    Zhou, Haiyin
    [J]. SENSORS, 2021, 21 (07)
  • [8] Multiscale Symbolic Lempel-Ziv: An Effective Feature Extraction Approach for Fault Diagnosis of Railway Vehicle Systems
    Li, Yongbo
    Liu, Fulong
    Wang, Shun
    Yin, Jiancheng
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (01) : 199 - 208
  • [9] Oscillatory Behavior based Fault Feature Extraction for Bearing Fault Diagnosis
    Shi, Juanjuan
    Liang, Ming
    [J]. 2015 INTERNATIONAL CONFERENCE ON ADVANCED MECHATRONIC SYSTEMS (ICAMECHS), 2015, : 473 - 478
  • [10] EXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS
    Kyoomarsi, F.
    Khosravi, H.
    Eslami, E.
    Davoudi, M.
    [J]. IRANIAN JOURNAL OF FUZZY SYSTEMS, 2010, 7 (03): : 15 - 32