FSS: algorithm and neural network accelerator for style transfer

被引:0
|
作者
Yi LING [1 ]
Yujie HUANG [1 ,2 ]
Yujie CAI [1 ,2 ]
Zhaojie LI [1 ,2 ]
Mingyu WANG [1 ]
Wenhong LI [1 ]
Xiaoyang ZENG [1 ]
机构
[1] State Key Laboratory of ASIC & System,Fudan University
[2] Shanghai Explore X Technology Co.,Ltd.
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP183 [人工神经网络与计算]; TP391.41 [];
学科分类号
080203 ;
摘要
Neural networks(NNs), owing to their impressive performance, have gradually begun to dominate multimedia processing. For resource-constrained and energy-sensitive mobile devices, an efficient NN accelerator is necessary. Style transfer is an important multimedia application. However, existing arbitrary style transfer networks are complex and not well supported by current NN accelerators, limiting their application on mobile devices. Moreover, the quality of style transfer needs improvement. Thus, we design the Fast Style system(FSS), where a novel algorithm and an NN accelerator are proposed for style transfer. In FSS, we first propose a novel arbitrary style transfer algorithm, Fast Style. We propose a light network that contributes to high quality and low computational complexity and a prior mechanism to avoid retraining when the style changes. Then, we redesign an NN accelerator for Fast Style by applying two improvements to the basic NVIDIA deep learning accelerator(NVDLA) architecture. First, a flexible dat FSM and wt FSM are redesigned to enable the original data path to perform other operations(including the GRAM operation)by software programming. Moreover, statistics and judgment logic are designed to utilize the continuity of a video stream and remove the data dependency in the instance normalization, which improves the accelerator performance by 18.6%. The experimental results demonstrate that the proposed Fast Style can achieve higher quality with a lower computational cost, making it more suitable for mobile devices. The proposed NN accelerator is implemented on the Xilinx VCU118 FPGA under a 180-MHz clock. Experimental results show that the accelerator can stylize 512×512-pixel video with 20 FPS, and the measured performance reaches up to 306.07 GOPS. The ASIC implementation in TSMC 28 nm achieves about 22 FPS in the case of a 720-p video.
引用
收藏
页码:253 / 266
页数:14
相关论文
共 50 条
  • [21] Stereoscopic Neural Style Transfer
    Chen, Dongdong
    Yuan, Lu
    Liao, Jing
    Yu, Nenghai
    Hua, Gang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6654 - 6663
  • [22] Demystifying Neural Style Transfer
    Li, Yanghao
    Wang, Naiyan
    Liu, Jiaying
    Hou, Xiaodi
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2230 - 2236
  • [23] Neural Policy Style Transfer
    Fernandez-Fernandez, Raul
    Victores, Juan G.
    Gago, Jennifer J.
    Estevez, David
    Balaguer, Carlos
    COGNITIVE SYSTEMS RESEARCH, 2022, 72 : 23 - 32
  • [24] A Layered Algorithm of Style Transfer
    Lin, Qi
    Zhu, Qing
    Li, Weiran
    2022 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY, HUMAN-COMPUTER INTERACTION AND ARTIFICIAL INTELLIGENCE, VRHCIAI, 2022, : 162 - 166
  • [25] NEURAL ACCELERATOR FOR PARALLELIZATION OF BACKPROPAGATION ALGORITHM
    FRANZI, E
    MICROPROCESSING AND MICROPROGRAMMING, 1993, 38 (1-5): : 689 - 696
  • [26] Controlling Stroke Size in Fast Style Transfer with Recurrent Convolutional Neural Network
    Yang, Lingchen
    Yang, Lumin
    Zhao, Mingbo
    Zheng, Youyi
    COMPUTER GRAPHICS FORUM, 2018, 37 (07) : 97 - 107
  • [27] CAPTCHA Image Generation Using Style Transfer Learning in Deep Neural Network
    Kwon, Hyun
    Yoon, Hyunsoo
    Park, Ki-Woong
    INFORMATION SECURITY APPLICATIONS, WISA 2019, 2020, 11897 : 234 - 246
  • [28] A New Accelerator for Convolution Neural Network
    Wu, Fan
    Song, Jie
    Zhuang, Haoran
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7982 - 7985
  • [29] A 3.2 GFLOPS neural network accelerator
    Komori, S
    Arima, Y
    Kondo, Y
    Tsubota, H
    Tanaka, K
    Kyuma, K
    IEICE TRANSACTIONS ON ELECTRONICS, 1997, E80C (07) : 859 - 867
  • [30] Neural network accelerator for quantum control
    Xu, David
    Ozguler, A. Baris
    Di Guglielmo, Giuseppe
    2022 IEEE/ACM THIRD INTERNATIONAL WORKSHOP ON QUANTUM COMPUTING SOFTWARE (QCS), 2022, : 43 - 49