Architecture optimization for multimedia application exploiting data and thread-level parallelism

被引:3
|
作者
Limousin, C [1 ]
Sebot, J [1 ]
Vartanian, A [1 ]
Drach, N [1 ]
机构
[1] Univ Paris 11, LRI, F-91405 Orsay, France
关键词
SIMD; SMT; superscalar processor; memory hierarchy; multimedia;
D O I
10.1016/j.sysarc.2004.06.002
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The characteristics of multimedia applications when executed on general-purpose processors are not well understood. Such knowledge is extremely important in guiding the development of multimedia applications and the design of future processors. In this paper, we characterize and optimize the performance of multimedia applications on superscalar processor exploiting data-level parallelism and thread-level parallelism with SIMD (Single Instruction Multiple Data) and SMT (Simultaneous MultiThreading) capacities. We show that SMT and SIMD superscalar processor is suitable for 3D geometry application and we characterize the execution in term of memory hierarchy, which is the main bottleneck. The results show that the latency is not fully recovered by SMT; the use of second-level data prefetching does not succeed in increasing the performance. With detailed analysis, we show that this problem comes from a pollution of the instruction window by the threads experiencing second-level cache misses, thus reducing the window available for the other threads. We thus propose a hardware mechanism (an architecture optimization) to predict second-level misses and control this pollution. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:15 / 27
页数:13
相关论文
共 50 条
  • [1] Exploiting speculative thread-level parallelism in data compression applications
    Wang, Shengyue
    Zhai, Antonia
    Yew, Pen-Chung
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2007, 4382 : 126 - +
  • [2] Exploiting data- and thread-level parallelism for image correlation
    Kadidlo, Juergen
    Strey, Alfred
    PROCEEDINGS OF THE 16TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2008, : 407 - +
  • [3] Exploiting speculative thread-level parallelism on a SMT processor
    Marcuello, P
    González, A
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, PROCEEDINGS, 1999, 1593 : 754 - 763
  • [4] Thread partitioning and value prediction for exploiting speculative thread-level parallelism
    Marcuello, P
    González, A
    Tubella, J
    IEEE TRANSACTIONS ON COMPUTERS, 2004, 53 (02) : 114 - 125
  • [5] Balancing thread partition for efficiently exploiting speculative thread-level parallelism
    Wang, Yaobin
    An, Hong
    Liang, Bo
    Wang, Li
    Cong, Ming
    Ren, Yongqing
    ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS, 2007, 4847 : 40 - 49
  • [6] Exploiting thread-level speculative parallelism with software value prediction
    Li, XF
    Yang, C
    Du, ZH
    Ngai, TF
    ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, PROCEEDINGS, 2005, 3740 : 367 - 388
  • [7] Exploiting the thread-level parallelism for BGP on Multi-core
    Gao Lei
    Lai Mingche
    Gong Zhenghu
    CNSR 2008: PROCEEDINGS OF THE 6TH ANNUAL COMMUNICATION NETWORKS AND SERVICES RESEARCH CONFERENCE, 2008, : 510 - 516
  • [8] Exploiting Thread-level Parallelism Based on Banlancing Load for Speculative Multithreading
    Li Yuancheng
    ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING III, 2014, 678 : 8 - 11
  • [9] Exploiting thread-level parallelism in the iterative solution of sparse linear systems
    Aliaga, Jose I.
    Bollhoefer, Matthias
    Martin, Alberto F.
    Quintana-Orti, Enrique S.
    PARALLEL COMPUTING, 2011, 37 (03) : 183 - 202
  • [10] Exploiting Thread-Level Parallelism on HEVC by Employing a Reference Dependency Graph
    Kim, Minwoo
    Kim, Deokho
    Kim, Kyungah
    Ro, Won Woo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (04) : 736 - 749