Video Scene Segmentation Using Tensor-Train Faster-RCNN for Multimedia IoT Systems

被引:19
|
作者
Dai, Cheng [1 ,2 ]
Liu, Xingang [1 ]
Yang, Laurence T. [3 ]
Ni, Minghao [1 ]
Ma, Zhenchao [3 ]
Zhang, Qingchen [3 ]
Deen, M. Jamal [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
[2] McMaster Univ, Dept Elect Engn & Comp Sci, Hamilton, ON L8S 4K1, Canada
[3] St Francis Xavier Univ, Dept Comp Sci, Antigonish, NS B2G 2W5, Canada
基金
中国国家自然科学基金;
关键词
Tensile stress; Machine learning; Computational modeling; Training; Feature extraction; Image segmentation; Internet of Things; Deep learning; multimedia Internet-of-Things (IoT) system; tensor train; video scene segmentation;
D O I
10.1109/JIOT.2020.3022353
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video surveillance techniques like scene segmentation are playing an increasingly important role in multimedia Internet-of-Things (IoT) systems. However, existing deep learning-based methods face challenges in both accuracy and memory when deployed on edge computing devices with limited computing resources. To address these challenges, a tensor-train video scene segmentation scheme that compares the local background information in regional scene boundary boxes in adjacent frames is proposed. Compared to the existing methods, the proposed scheme can achieve competitive performance in both segmentation accuracy and parameter compression rate. In detail, first, an improved faster region convolutional neural network (faster-RCNN) model is proposed to recognize and generate a large number of region boxes with foreground and background to achieve boundary boxes. Then, the foreground boxes with sparse objects are removed and the rest are considered as optional background boxes used to measure the similarity between two adjacent frames. Second, to accelerate the training efficiency and reduce memory size, a general and efficient training way using tensor-train decomposition to factor the input-to-hidden weight matrix is proposed. Finally, experiments are conducted to evaluate the performance of the proposed scheme in terms of accuracy and model compression. Our results demonstrate that the proposed model can improve the training efficiency and save the memory space for the deep computation model with good accuracy. This work opens the potential for the use of artificial intelligence methods in edge computing devices for multimedia IoT systems.
引用
收藏
页码:9697 / 9705
页数:9
相关论文
共 6 条
  • [1] Bullet Hole Detection Using Series Faster-RCNN and Video Analysis
    Du, Fengtong
    Zhou, Yanzhuo
    Chen, Wenjie
    Yang, Lei
    ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041
  • [2] Semantic segmentation for multiscale target based on object recognition using the improved Faster-RCNN model
    Jiang, Du
    Li, Gongfa
    Tan, Chong
    Huang, Li
    Sun, Ying
    Kong, Jianyi
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 123 : 94 - 104
  • [3] RTF-RCNN: An Architecture for Real-Time Tomato Plant Leaf Diseases Detection in Video Streaming Using Faster-RCNN
    Alruwaili, Madallah
    Siddiqi, Muhammad Hameed
    Khan, Asfandyar
    Azad, Mohammad
    Khan, Abdullah
    Alanazi, Saad
    BIOENGINEERING-BASEL, 2022, 9 (10):
  • [4] Cascaded 3-Stage Nuclei Segmentation using U-net, Faster-RCNN and SegNet for Higher Precision
    Shihavuddin, A. S. M.
    Kiron, Mohammad Kamrozzaman
    Islam, Md Imamul
    Maruf, Md Hasan
    Ashique, Ratil H.
    Kabir, Shahriar Mahmud
    2021 3RD INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR INDUSTRY 4.0 (STI), 2021,
  • [5] A Collaborative Region Detection and Grading Framework for Forest Fire Smoke Using Weakly Supervised Fine Segmentation and Lightweight Faster-RCNN
    Pan, Jin
    Ou, Xiaoming
    Xu, Liang
    FORESTS, 2021, 12 (06):
  • [6] Automated Synthesis of Low-rank Control Systems from sc-LTL Specifications using Tensor-Train Decompositions
    Alora, John Irvin
    Gorodetsky, Alex
    Karaman, Sertac
    Marzouk, Youssef
    Lowry, Nathan
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 1131 - 1138