共 50 条
Parallel Programming and Optimization Based on TMS320C6678
被引:1
|作者:
Mou, Xin-gang
[1
]
Wei, Guo-hua
[1
]
Zhou, Xiao
[1
]
机构:
[1] Wuhan Univ Technol, Sch Mech & Elect Engn, Wuhan 430070, Peoples R China
来源:
关键词:
parallel processing;
OpenMP;
TMS320C6678;
image convolution;
optimization;
compiler intrinsics;
DMA;
D O I:
10.4028/www.scientific.net/AMM.615.259
中图分类号:
TH [机械、仪表工业];
学科分类号:
0802 ;
摘要:
The development of multi-core processors has provided a good solution to applications that require real-time processing and a large number of calculations. However, simply exploiting parallelism in software is hard to make full use of the hardware performance. This paper studies the parallel programming and optimization techniques on TMS320C6678 multicore digital signal processors. We firstly illustrate an implementation of a selected parallel image convolution algorithm by OpenMP. Then several optimization techniques such as compiler intrinsics, cache, DMA are used to further enhance the application performance and achieve a good execution time according to the test results.
引用
收藏
页码:259 / 264
页数:6
相关论文