Architecture optimization for multimedia application exploiting data and thread-level parallelism

被引：3

作者：

Limousin, C ^{[1
]}

Sebot, J ^{[1
]}

Vartanian, A ^{[1
]}

Drach, N ^{[1
]}

机构：

[1] Univ Paris 11, LRI, F-91405 Orsay, France

来源：

JOURNAL OF SYSTEMS ARCHITECTURE | 2005年 / 51卷 / 01期

关键词：

SIMD; SMT; superscalar processor; memory hierarchy; multimedia;

D O I：

10.1016/j.sysarc.2004.06.002

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The characteristics of multimedia applications when executed on general-purpose processors are not well understood. Such knowledge is extremely important in guiding the development of multimedia applications and the design of future processors. In this paper, we characterize and optimize the performance of multimedia applications on superscalar processor exploiting data-level parallelism and thread-level parallelism with SIMD (Single Instruction Multiple Data) and SMT (Simultaneous MultiThreading) capacities. We show that SMT and SIMD superscalar processor is suitable for 3D geometry application and we characterize the execution in term of memory hierarchy, which is the main bottleneck. The results show that the latency is not fully recovered by SMT; the use of second-level data prefetching does not succeed in increasing the performance. With detailed analysis, we show that this problem comes from a pollution of the instruction window by the threads experiencing second-level cache misses, thus reducing the window available for the other threads. We thus propose a hardware mechanism (an architecture optimization) to predict second-level misses and control this pollution. (C) 2004 Elsevier B.V. All rights reserved.

引用

页码：15 / 27

页数：13

共 50 条

[1] Exploiting speculative thread-level parallelism in data compression applications
Wang, Shengyue
Zhai, Antonia
Yew, Pen-Chung
LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2007, 4382 : 126 - +
[2] Exploiting data- and thread-level parallelism for image correlation
Kadidlo, Juergen
Strey, Alfred
PROCEEDINGS OF THE 16TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2008, : 407 - +
[3] Exploiting speculative thread-level parallelism on a SMT processor
Marcuello, P
González, A
HIGH-PERFORMANCE COMPUTING AND NETWORKING, PROCEEDINGS, 1999, 1593 : 754 - 763
[4] Thread partitioning and value prediction for exploiting speculative thread-level parallelism
Marcuello, P
González, A
Tubella, J
IEEE TRANSACTIONS ON COMPUTERS, 2004, 53 (02) : 114 - 125
[5] Balancing thread partition for efficiently exploiting speculative thread-level parallelism
Wang, Yaobin
An, Hong
Liang, Bo
Wang, Li
Cong, Ming
Ren, Yongqing
ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS, 2007, 4847 : 40 - 49
[6] Exploiting thread-level speculative parallelism with software value prediction
Li, XF
Yang, C
Du, ZH
Ngai, TF
ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, PROCEEDINGS, 2005, 3740 : 367 - 388
[7] Exploiting the thread-level parallelism for BGP on Multi-core
Gao Lei
Lai Mingche
Gong Zhenghu
CNSR 2008: PROCEEDINGS OF THE 6TH ANNUAL COMMUNICATION NETWORKS AND SERVICES RESEARCH CONFERENCE, 2008, : 510 - 516
[8] Exploiting Thread-level Parallelism Based on Banlancing Load for Speculative Multithreading
Li Yuancheng
ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING III, 2014, 678 : 8 - 11
[9] Exploiting thread-level parallelism in the iterative solution of sparse linear systems
Aliaga, Jose I.
Bollhoefer, Matthias
Martin, Alberto F.
Quintana-Orti, Enrique S.
PARALLEL COMPUTING, 2011, 37 (03) : 183 - 202
[10] Exploiting Thread-Level Parallelism on HEVC by Employing a Reference Dependency Graph
Kim, Minwoo
Kim, Deokho
Kim, Kyungah
Ro, Won Woo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (04) : 736 - 749

← 1 2 3 4 5 →