Explainable YouTube Video Identification Using Sufficient Input Subsets

被引：1

作者：

Afandi, Waleed ^{[1
]}

Bukhari, Syed Muhammad Ammar Hassan ^{[1
]}

Khan, Muhammad U. S. ^{[1
]}

Maqsood, Tahir ^{[1
]}

Fayyaz, Muhammad A. B. ^{[2
]}

Ansari, Ali R. ^{[3
]}

Nawaz, Raheel ^{[4
]}

机构：

[1] COMSATS Univ Islamabad, Dept Comp Sci, Abbottabad 22060, Pakistan

[2] Manchester Metropolitan Univ, OTEHM, Manchester M15 6BH, England

[3] Gulf Univ Sci & Technol, Dept Math & Nat Sci, Mubarak Al Abdullah 32093, Kuwait

[4] Staffordshire Univ, Pro Vice Chancellor Digital Transformat, Stoke On Trent ST4 2DE, England

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Streaming media; Fingerprint recognition; Convolutional neural networks; Data models; Telecommunication traffic; Cryptography; Video on demand; Video identification; fingerprinting; deep learning; classification; variable bitrate;

D O I：

10.1109/ACCESS.2023.3261562

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Neural network models are black boxes in nature. The mechanics behind these black boxes are practically unexplainable. Having the insight into patterns identified by these algorithms can help unravel important properties of the subject in query. These artificial intelligence based algorithms are used in every domain for prediction. This research focuses on patterns formed in network traffic that can be leveraged to identify videos streaming over the network. The proposed work uses a sufficient input subset (SIS) model on two separate video identification techniques to understand and explain the patterns detected by the techniques. The first technique creates the fingerprints of videos on a period-based algorithm to handle variable bitrate inconsistencies. These fingerprints are passed to a convolutional Neural Network (CNN) for pattern recognition. The second technique is based on traffic pattern plot identification that creates a graph of packet size with respect to time for each stream before passing that to a CNN as an image. For model explainability, a sufficient input subset (SIS) model is used to identify features that are sufficient to reach the same prediction under a certain threshold of confidence by the model. The generated SIS of each input sample is clustered using DBSCAN, K-Means, and cosine-based Hierarchical clustering. The clustered SIS highlight the common patterns for each class. The SIS patterns learnt by each model of three individual videos are discussed. Furthermore, these patterns are used to investigate misclassification and provide a rationale behind it to justify the working of the classifier model.

引用

页码：33178 / 33188

页数：11

共 50 条

[1] Critiquing Protein Family Classification Models Using Sufficient Input Subsets
Carter, Brandon
Bileschi, Maxwell
Smith, Jamie
Sanderson, Theo
Bryant, Drew
Belanger, David
Colwell, Lucy J.
JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (08) : 1219 - 1231
[2] Female urinary incontinence on TikTok and YouTube: is online video content sufficient?
Mehmet Serkan Özkent
Muzaffer Tansel Kılınç
International Urogynecology Journal, 2023, 34 : 2775 - 2781
[3] Female urinary incontinence on TikTok and YouTube: is online video content sufficient?
Ozkent, Mehmet Serkan
Kilinc, Muzaffer Tansel
INTERNATIONAL UROGYNECOLOGY JOURNAL, 2023, 34 (11) : 2775 - 2781
[4] Automated Identification and Reconstruction of YouTube Video Access
Patterson, Jonathan
Hargreaves, Christopher
JOURNAL OF DIGITAL FORENSICS SECURITY AND LAW, 2012, 7 (02) : 43 - 59
[5] DASHing YouTube: An Analysis of Using DASH in YouTube Video Service
Krishnappa, Dilip Kumar
Bhat, Divyashri
Zink, Michael
PROCEEDINGS OF THE 2013 38TH ANNUAL IEEE CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2013), 2013, : 407 - 415
[6] A Media For Teaching Speaking Using Youtube Video
Arianti, Arin
Nurnaningsih
Pratiwi, Veronika Unun
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND ENGINEERING (ICASE 2018), 2018, 175 : 71 - 73
[7] Optimizing Prediction of YouTube Video Popularity Using XGBoost
Nisa, Meher U. N.
Mahmood, Danish
Ahmed, Ghufran
Khan, Suleman
Mohammed, Mazin Abed
ius, Robertas
Damasevicius, Robertas
ELECTRONICS, 2021, 10 (23)
[8] Traffic spills the beans: A robust video identification attack against YouTube
Zhang, Xiyuan
Xiong, Gang
Li, Zhen
Yang, Chen
Lin, Xinjie
Gou, Gaopeng
Fang, Binxing
COMPUTERS & SECURITY, 2024, 137
[9] What made you do this? Understanding black-box decisions with sufficient input subsets
Carter, Brandon
Mueller, Jonas
Jain, Siddhartha
Gifford, David
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 567 - 576
[10] Using YouTube Analytics to Investigate Instructional Video Viewing Patterns
O'Brien, Michael
Slattery, Darina
Walsh, John
PROCEEDINGS OF THE 18TH EUROPEAN CONFERENCE ON E-LEARNING (ECEL 2019), 2019, : 428 - 436

← 1 2 3 4 5 →