Anomaly detection in genomic catalogues using unsupervised multi-view autoencoders

被引:0
|
作者
Ferre, Quentin [1 ,2 ]
Cheneby, Jeanne [1 ]
Puthier, Denis [1 ]
Capponi, Cecile [2 ]
Ballester, Benoit [1 ]
机构
[1] Aix Marseille Univ, TAGC, INSERM, Marseille, France
[2] Aix Marseille Univ, Univ Toulon, LIS, CNRS, Marseille, France
关键词
Genomic assay; Anomaly detection; Cis regulatory element; Unsupervised curation; Convolutional autoencoder; ChIP-seq peak quality; Model interpretability; CHIP-SEQ; INTEGRATIVE ANALYSIS; REGULATORY REGIONS;
D O I
10.1186/s12859-021-04359-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Accurate identification of Transcriptional Regulator binding locations is essential for analysis of genomic regions, including Cis Regulatory Elements. The customary NGS approaches, predominantly ChIP-Seq, can be obscured by data anomalies and biases which are difficult to detect without supervision. Results Here, we develop a method to leverage the usual combinations between many experimental series to mark such atypical peaks. We use deep learning to perform a lossy compression of the genomic regions' representations with multiview convolutions. Using artificial data, we show that our method correctly identifies groups of correlating series and evaluates CRE according to group completeness. It is then applied to the ReMap database's large volume of curated ChIP-seq data. We show that peaks lacking known biological correlators are singled out and less confirmed in real data. We propose normalization approaches useful in interpreting black-box models. Conclusion Our approach detects peaks that are less corroborated than average. It can be extended to other similar problems, and can be interpreted to identify correlation groups. It is implemented in an open-source tool called atyPeak.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] Semi-supervised Variational Multi-view Anomaly Detection
    Wang, Shaoshen
    Chen, Ling
    Hussain, Farookh
    Zhang, Chengqi
    WEB AND BIG DATA, APWEB-WAIM 2021, PT I, 2021, 12858 : 125 - 133
  • [22] A Deep Multi-View Framework for Anomaly Detection on Attributed Networks
    Peng, Zhen
    Luo, Minnan
    Li, Jundong
    Xue, Luguo
    Zheng, Qinghua
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2539 - 2552
  • [23] Unsupervised anomaly detection with LSTM autoencoders using statistical data-filtering
    Maleki, Sepehr
    Maleki, Sasan
    Jennings, Nicholas R.
    APPLIED SOFT COMPUTING, 2021, 108
  • [24] Towards a Hierarchical Bayesian Model of Multi-View Anomaly Detection
    Wang, Zhen
    Lan, Chao
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2420 - 2426
  • [25] Unsupervised anomaly detection with LSTM autoencoders using statistical data-filtering
    Maleki, Sepehr
    Maleki, Sasan
    Jennings, Nicholas R.
    Applied Soft Computing, 2021, 108
  • [26] MVS2: Deep Unsupervised Multi-view Stereo with Multi-View Symmetry
    Dai, Yuchao
    Zhu, Zhidong
    Rao, Zhibo
    Li, Bo
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 1 - 8
  • [27] Unsupervised Spectrum Anomaly Detection With Distillation and Memory Enhanced Autoencoders
    Qi P.
    Jiang T.
    Xu J.
    He J.
    Zheng S.
    Li Z.
    IEEE Internet of Things Journal, 2024, 11 (24) : 1 - 1
  • [28] Graph Anomaly Detection via Multi-View Discriminative Awareness Learning
    Lian, Jie
    Wang, Xuzheng
    Lin, Xincan
    Wu, Zhihao
    Wang, Shiping
    Guo, Wenzhong
    IEEE Transactions on Network Science and Engineering, 2024, 11 (06): : 6623 - 6635
  • [29] Attention-based anomaly detection in multi-view surveillance videos
    Li, Qun
    Yang, Rui
    Xiao, Fu
    Bhanu, Bir
    Zhang, Feng
    KNOWLEDGE-BASED SYSTEMS, 2022, 252
  • [30] Unsupervised Anomaly Detection in RS-485 Traffic using Autoencoders with Unobtrusive Measurement
    Chirupphapa, Pawissakan
    Hossain, Md Delwar
    Esaki, Hiroshi
    Ochiai, Hideya
    2022 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE, IPCCC, 2022,