next up previous contents
Next: 1.7 Basic Linear Algebra Up: 1.6 Implementation of Basic Previous: 1.6.1 Matrix-matrix multiplication

1.6.2 Attaining better performance

    The above approach will not in general yield high performance, since all local operations are performed using matrix-vector operations. Better (near peak) performance can be attained by replacing matrix-vector multiplication by   matrix-panel-of-vectors multiplication, and rank-1 updates by   rank-k updates. The algorithms outlined above can be easily altered to accommodate this, as will be explained in detail in Chapter gif.


next up previous contents
Next: 1.7 Basic Linear Algebra Up: 1.6 Implementation of Basic Previous: 1.6.1 Matrix-matrix multiplication

rvdg@cs.utexas.edu