Discovery of Corrosion Patterns using Symbolic Time Series Representation and N-gram Model

被引:0
|
作者
Taib, Shakirah Mohd [1 ]
Zabidi, Zahiah Akhma Mohd [1 ]
Aziz, Izzatdin Abdul [1 ]
Mousor, Farahida Hanim [1 ]
Abu Bakar, Azuraliza [2 ]
Mokhtar, Ainul Akmar [3 ]
机构
[1] Univ Teknol Petronas, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
[2] Univ Kebangsaan Malaysia, Ctr Artificial Intelligence Technol, Ukm Bangi 43600, Selangor, Malaysia
[3] Univ Teknol Petronas, Dept Mech Engn, Seri Iskandar 32610, Perak, Malaysia
关键词
Pipelines corrosion analysis; Symbolic Aggregation Approximation (SAX) representation; corrosion patterns; corrosion factor;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There are many factors that can contribute to corrosion in the pipeline. Therefore, it is important for decision makers to analyze and identify the main factor of corrosion in order to take appropriate actions. The factor of corrosion can be analyzed using data mining based on historical datasets collected from monitoring sensors. The purpose of this study is to analyze the trends of corroding agents for pipeline corrosion based on symbolic representation of time series corrosion dataset using Symbolic Aggregation Approximation (SAX). The paper presents the analysis and evaluation of the patterns using Ngram model. Text mining using N-gram model is proposed to mine trend changes from corrosion time series dataset that are transformed as symbolic representation. N-gram was applied for the analysis in order to find significant symbolic patterns that are represented as text. Pattern analysis is performed and the results are discussed according to each environmental factor of pipeline corrosion.
引用
收藏
页码:554 / 560
页数:7
相关论文
共 50 条
  • [1] Symbolic Translation of Time Series using Piecewise N-gram Similarity Voting
    Delannoy, Siegfried
    Caillault, Emilie
    Bigand, Andre
    Rousseeuw, Kevin
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 327 - 333
  • [2] Pseudo-Conventional N-Gram Representation of the Discriminative N-Gram Model for LVCSR
    Zhou, Zhengyu
    Meng, Helen
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (06) : 943 - 952
  • [3] N-gram Events for Analysis of Financial Time Series
    Borovikov, Igor
    Sadovsky, Michael
    [J]. PROCEEDINGS OF ECCS 2014: EUROPEAN CONFERENCE ON COMPLEX SYSTEMS, 2016, : 155 - 167
  • [4] Linguistic Summarization using a Weighted N-gram Language Model based on the Similarity of Time-series Data
    Aoki, Kasumi
    Kobayashi, Ichiro
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 595 - 601
  • [5] Recasting the discriminative n-gram model as a pseudo-conventional n-gram model for LVCSR
    Zhou, Zhengyu
    Meng, Helen
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4933 - 4936
  • [6] Pipilika N-gram Viewer: An Efficient Large Scale N-gram Model for Bengali
    Ahmad, Adnan
    Talha, Mahbubur Rub
    Amin, Md. Ruhul
    Chowdhury, Farida
    [J]. 2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [7] Extracting Mobile Behavioral Patterns with the Distant N-Gram Topic Model
    Farrahi, Katayoun
    Gatica-Perez, Daniel
    [J]. 2012 16TH INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (ISWC), 2012, : 1 - 8
  • [8] A symbolic representation of time series
    Wang, Q
    Megalooikonomou, V
    Li, G
    [J]. ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2005, : 655 - 658
  • [9] Supervised N-gram Topic Model
    Kawamae, Noriaki
    [J]. WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 473 - 482
  • [10] Similar N-gram Language Model
    Gillot, Christian
    Cerisara, Christophe
    Langlois, David
    Haton, Jean-Paul
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1824 - 1827