Real-time rate distortion-optimized image compression with region of interest on the ARM architecture for underwater robotics applications

被引:0
|
作者
Eduardo M. Rubino
Alberto J. Álvares
Raúl Marín
Pedro J. Sanz
机构
[1] University Jaume-I,Computer Science and Engineering Department
[2] University of Brasilia,Department of Mechanical Engineering
来源
关键词
Progressive image compression; Region of interest (ROI); Parallel wavelet transforms; Arm architecture; Underwater robotics applications;
D O I
暂无
中图分类号
学科分类号
摘要
This paper proposes the use of a real-time progressive image compression and region of interest algorithm for the ARM processor architecture. This algorithm is used for the design of an underwater image sensor for an autonomous underwater vehicle for intervention, under a highly constrained available bandwidth scenario, allowing for a more agile data exchange between the vehicle and a human operator supervising the underwater intervention. For high compression ratios (smaller output size), execution time is dominated by the transformation algorithm, which plays a progressively smaller role as the compression ratio gets smaller (larger output size). A novel progressive rate distortion-optimized image compression algorithm based on the discrete wavelet transform (DWT) is presented, with special emphasis on a novel minimal time parallel DWT algorithm, which allows full memory bandwidth saturation using only a few cores of a modern multicore embedded processor. The paper focuses in a novel efficient inplace, multithreaded, and cache-friendly parallel 2-D wavelet transform algorithm, based on the lifting transform using the ARM Architecture. In order to maximize the cache utilization and consequently minimize the memory bus bandwidth use, the threads compete to work on a small memory area, maximizing the chances of finding the data in the cache. Their synchronization is done with very low overhead, without the use of any locks and relying solely on the basic compare-and-swap atomic primitive. An implementation in C programming language with and without the use of vector instructions (single instruction multiple data) is provided for both, single (serial) and multi-(parallel) threaded single-loop DWT implementations, as well as serial and parallel naive implementations using linear (row order) and strided (column order) memory access patterns for comparison. Results show a significant improvement over the single-threaded optimized implementation and a much greater improvement over both, the single- and multi-threaded naive implementations, reaching minimal running time depending on the memory access pattern, the number of processor cores, and the available memory bus bandwidth, i.e., it becomes memory bound using the minimum number of memory accesses. Due to memory saturation, the inplace 2-D DWT transform can be executed in the same time as a 1-D DWT transform or as an inplace memory block copy.
引用
收藏
页码:193 / 225
页数:32
相关论文
共 42 条
  • [31] An automated design flow for optimized implementation of real-time image processing applications onto FPGA
    Kaouane, L
    Akil, M
    Sorel, Y
    IEEE REGION 8 EUROCON 2003, VOL A, PROCEEDINGS: COMPUTER AS A TOOL, 2003, : 71 - 75
  • [32] Real-time vector quantization-based image compression on the SIMPil low memory SIMD architecture
    Gentile, A
    Cat, H
    Kossentini, F
    Sorbello, F
    Wills, DS
    1977 IEEE INTERNATIONAL PERFORMANCE, COMPUTING AND COMMUNICATIONS CONFERENCE, 1997, : 10 - 16
  • [33] FUNCTIONAL PROGRAMMING ON A DATA-FLOW ARCHITECTURE - APPLICATIONS IN REAL-TIME IMAGE-PROCESSING
    SEROT, J
    QUENOT, G
    ZAVIDOVIQUE, B
    MACHINE VISION AND APPLICATIONS, 1993, 7 (01) : 44 - 56
  • [34] Parallel Rate Distortion Optimized Quantization for 4K Real-time GPU-based HEVC Encoder
    Igarashi, Hiroaki
    Takano, Fumiyo
    Takenaka, Takashi
    Inoue, Hiroaki
    Moriyoshi, Tatsuji
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [35] PSO optimized 1-D CNN-SVM architecture for real-time detection and classification applications
    Navaneeth, Bhaskar
    Suchetha, M.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 108 : 85 - 92
  • [36] A YOLO v3-tiny FPGA Architecture using a Reconfigurable Hardware Accelerator for Real-time Region of Interest Detection
    Herrmann, Viktor
    Knapheide, Justin
    Steinert, Fritjof
    Stabernack, Benno
    2022 25TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2022, : 84 - 92
  • [37] Beyond pure quality: Progressive modes, region of interest coding, and real time video decoding for PDE-based image compression
    Peter, Pascal
    Schmaltz, Christian
    Mach, Nicolas
    Mainberger, Markus
    Weickert, Joachim
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 31 : 253 - 265
  • [38] Real-Time Adaptive Correction for Varying Source-To-Image-Distance (SID) for a Novel Region of Interest (ROI) Fluoroscopy Dose Reduction Technique Involving Spatially Different Temporal Filtering
    Nagesh, S. Setlur
    Panse, A.
    Jain, A.
    Sharma, P.
    Ionita, C.
    Titus, A.
    Cartwright, A.
    Bednarek, D.
    Rudin, S.
    MEDICAL PHYSICS, 2012, 39 (06) : 3608 - 3609
  • [39] Real-time strain rate imaging (RT-SRI) versus tissue velocity image-derived SRI (TVI-SRI): Compression and expansion rate analysis using a tissue mimicking phantom
    Belohlavek, M
    Bartleson, VB
    Zobitz, ME
    Kinnick, RR
    Greenleaf, JF
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2000, 35 (02) : 491A - 491A
  • [40] Field Programmable Gate Array and System-on-Chip Based Implementation of Discrete Fast Walsh-Hadamard Transform Domain Image Watermarking Architecture for Real-Time Applications
    Ghosh, Sudip
    Biswas, Arijit
    Maity, Santi Prasad
    Rahaman, Hafizur
    JOURNAL OF LOW POWER ELECTRONICS, 2015, 11 (03) : 375 - 386