Explainable YouTube Video Identification Using Sufficient Input Subsets

被引:1
|
作者
Afandi, Waleed [1 ]
Bukhari, Syed Muhammad Ammar Hassan [1 ]
Khan, Muhammad U. S. [1 ]
Maqsood, Tahir [1 ]
Fayyaz, Muhammad A. B. [2 ]
Ansari, Ali R. [3 ]
Nawaz, Raheel [4 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Abbottabad 22060, Pakistan
[2] Manchester Metropolitan Univ, OTEHM, Manchester M15 6BH, England
[3] Gulf Univ Sci & Technol, Dept Math & Nat Sci, Mubarak Al Abdullah 32093, Kuwait
[4] Staffordshire Univ, Pro Vice Chancellor Digital Transformat, Stoke On Trent ST4 2DE, England
关键词
Streaming media; Fingerprint recognition; Convolutional neural networks; Data models; Telecommunication traffic; Cryptography; Video on demand; Video identification; fingerprinting; deep learning; classification; variable bitrate;
D O I
10.1109/ACCESS.2023.3261562
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural network models are black boxes in nature. The mechanics behind these black boxes are practically unexplainable. Having the insight into patterns identified by these algorithms can help unravel important properties of the subject in query. These artificial intelligence based algorithms are used in every domain for prediction. This research focuses on patterns formed in network traffic that can be leveraged to identify videos streaming over the network. The proposed work uses a sufficient input subset (SIS) model on two separate video identification techniques to understand and explain the patterns detected by the techniques. The first technique creates the fingerprints of videos on a period-based algorithm to handle variable bitrate inconsistencies. These fingerprints are passed to a convolutional Neural Network (CNN) for pattern recognition. The second technique is based on traffic pattern plot identification that creates a graph of packet size with respect to time for each stream before passing that to a CNN as an image. For model explainability, a sufficient input subset (SIS) model is used to identify features that are sufficient to reach the same prediction under a certain threshold of confidence by the model. The generated SIS of each input sample is clustered using DBSCAN, K-Means, and cosine-based Hierarchical clustering. The clustered SIS highlight the common patterns for each class. The SIS patterns learnt by each model of three individual videos are discussed. Furthermore, these patterns are used to investigate misclassification and provide a rationale behind it to justify the working of the classifier model.
引用
收藏
页码:33178 / 33188
页数:11
相关论文
共 50 条
  • [1] Critiquing Protein Family Classification Models Using Sufficient Input Subsets
    Carter, Brandon
    Bileschi, Maxwell
    Smith, Jamie
    Sanderson, Theo
    Bryant, Drew
    Belanger, David
    Colwell, Lucy J.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (08) : 1219 - 1231
  • [2] Female urinary incontinence on TikTok and YouTube: is online video content sufficient?
    Mehmet Serkan Özkent
    Muzaffer Tansel Kılınç
    International Urogynecology Journal, 2023, 34 : 2775 - 2781
  • [3] Female urinary incontinence on TikTok and YouTube: is online video content sufficient?
    Ozkent, Mehmet Serkan
    Kilinc, Muzaffer Tansel
    INTERNATIONAL UROGYNECOLOGY JOURNAL, 2023, 34 (11) : 2775 - 2781
  • [4] Automated Identification and Reconstruction of YouTube Video Access
    Patterson, Jonathan
    Hargreaves, Christopher
    JOURNAL OF DIGITAL FORENSICS SECURITY AND LAW, 2012, 7 (02) : 43 - 59
  • [5] DASHing YouTube: An Analysis of Using DASH in YouTube Video Service
    Krishnappa, Dilip Kumar
    Bhat, Divyashri
    Zink, Michael
    PROCEEDINGS OF THE 2013 38TH ANNUAL IEEE CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2013), 2013, : 407 - 415
  • [6] A Media For Teaching Speaking Using Youtube Video
    Arianti, Arin
    Nurnaningsih
    Pratiwi, Veronika Unun
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND ENGINEERING (ICASE 2018), 2018, 175 : 71 - 73
  • [7] Optimizing Prediction of YouTube Video Popularity Using XGBoost
    Nisa, Meher U. N.
    Mahmood, Danish
    Ahmed, Ghufran
    Khan, Suleman
    Mohammed, Mazin Abed
    ius, Robertas
    Damasevicius, Robertas
    ELECTRONICS, 2021, 10 (23)
  • [8] Traffic spills the beans: A robust video identification attack against YouTube
    Zhang, Xiyuan
    Xiong, Gang
    Li, Zhen
    Yang, Chen
    Lin, Xinjie
    Gou, Gaopeng
    Fang, Binxing
    COMPUTERS & SECURITY, 2024, 137
  • [9] What made you do this? Understanding black-box decisions with sufficient input subsets
    Carter, Brandon
    Mueller, Jonas
    Jain, Siddhartha
    Gifford, David
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 567 - 576
  • [10] Using YouTube Analytics to Investigate Instructional Video Viewing Patterns
    O'Brien, Michael
    Slattery, Darina
    Walsh, John
    PROCEEDINGS OF THE 18TH EUROPEAN CONFERENCE ON E-LEARNING (ECEL 2019), 2019, : 428 - 436