An efficient parallel-processing method for transposing large matrices in place

被引:11
|
作者
Portnoff, MR [1 ]
机构
[1] Univ Calif Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
关键词
corner turn; fast algorithms; image formation; image processing; matrix transpose; parallel processing; seismic migration; synthetic aperture radar;
D O I
10.1109/83.784438
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have developed an efficient algorithm for transposing large matrices in place. The algorithm is efficient because data are accessed either sequentially in blocks or randomly within blocks small enough to fit in cache, and because the same indexing calculations are shared among identical procedures operating on independent subsets of the data. This inherent parallelism makes the method well suited for a multiprocessor computing environment. The algorithm is easy to implement because the same two procedures are applied to the data in various groupings to carry out the complete transpose operation. Using only a single processor, we have demonstrated nearly an order of magnitude increase in speed over the previously published algorithm by Gate and Twigg for transposing a large rectangular matrix in place. With multiple processors operating in parallel, the processing speed increases almost linearly with the number of processors. A simplified version of the algorithm for square matrices is presented as well as an extension for matrices large enough to require virtual memory.
引用
收藏
页码:1265 / 1275
页数:11
相关论文
共 50 条