Video Scene Segmentation Using Tensor-Train Faster-RCNN for Multimedia IoT Systems

被引：19

作者：

Dai, Cheng ^{[1
,2
]}

Liu, Xingang ^{[1
]}

Yang, Laurence T. ^{[3
]}

Ni, Minghao ^{[1
]}

Ma, Zhenchao ^{[3
]}

Zhang, Qingchen ^{[3
]}

Deen, M. Jamal ^{[2
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China

[2] McMaster Univ, Dept Elect Engn & Comp Sci, Hamilton, ON L8S 4K1, Canada

[3] St Francis Xavier Univ, Dept Comp Sci, Antigonish, NS B2G 2W5, Canada

来源：

IEEE INTERNET OF THINGS JOURNAL | 2021年 / 8卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Tensile stress; Machine learning; Computational modeling; Training; Feature extraction; Image segmentation; Internet of Things; Deep learning; multimedia Internet-of-Things (IoT) system; tensor train; video scene segmentation;

D O I：

10.1109/JIOT.2020.3022353

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video surveillance techniques like scene segmentation are playing an increasingly important role in multimedia Internet-of-Things (IoT) systems. However, existing deep learning-based methods face challenges in both accuracy and memory when deployed on edge computing devices with limited computing resources. To address these challenges, a tensor-train video scene segmentation scheme that compares the local background information in regional scene boundary boxes in adjacent frames is proposed. Compared to the existing methods, the proposed scheme can achieve competitive performance in both segmentation accuracy and parameter compression rate. In detail, first, an improved faster region convolutional neural network (faster-RCNN) model is proposed to recognize and generate a large number of region boxes with foreground and background to achieve boundary boxes. Then, the foreground boxes with sparse objects are removed and the rest are considered as optional background boxes used to measure the similarity between two adjacent frames. Second, to accelerate the training efficiency and reduce memory size, a general and efficient training way using tensor-train decomposition to factor the input-to-hidden weight matrix is proposed. Finally, experiments are conducted to evaluate the performance of the proposed scheme in terms of accuracy and model compression. Our results demonstrate that the proposed model can improve the training efficiency and save the memory space for the deep computation model with good accuracy. This work opens the potential for the use of artificial intelligence methods in edge computing devices for multimedia IoT systems.

引用

页码：9697 / 9705

页数：9

共 6 条

[1] Bullet Hole Detection Using Series Faster-RCNN and Video Analysis
Du, Fengtong
Zhou, Yanzhuo
Chen, Wenjie
Yang, Lei
ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041
[2] Semantic segmentation for multiscale target based on object recognition using the improved Faster-RCNN model
Jiang, Du
Li, Gongfa
Tan, Chong
Huang, Li
Sun, Ying
Kong, Jianyi
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 123 : 94 - 104
[3] RTF-RCNN: An Architecture for Real-Time Tomato Plant Leaf Diseases Detection in Video Streaming Using Faster-RCNN
Alruwaili, Madallah
Siddiqi, Muhammad Hameed
Khan, Asfandyar
Azad, Mohammad
Khan, Abdullah
Alanazi, Saad
BIOENGINEERING-BASEL, 2022, 9 (10):
[4] Cascaded 3-Stage Nuclei Segmentation using U-net, Faster-RCNN and SegNet for Higher Precision
Shihavuddin, A. S. M.
Kiron, Mohammad Kamrozzaman
Islam, Md Imamul
Maruf, Md Hasan
Ashique, Ratil H.
Kabir, Shahriar Mahmud
2021 3RD INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR INDUSTRY 4.0 (STI), 2021,
[5] A Collaborative Region Detection and Grading Framework for Forest Fire Smoke Using Weakly Supervised Fine Segmentation and Lightweight Faster-RCNN
Pan, Jin
Ou, Xiaoming
Xu, Liang
FORESTS, 2021, 12 (06):
[6] Automated Synthesis of Low-rank Control Systems from sc-LTL Specifications using Tensor-Train Decompositions
Alora, John Irvin
Gorodetsky, Alex
Karaman, Sertac
Marzouk, Youssef
Lowry, Nathan
2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 1131 - 1138

← 1 →