next up previous contents
Next: L1 matmul macro definitions Up: Speeding up the Level Previous: Building the General Matrix   Contents

The L1 matmul

So, there are actually three L1 matmul kernels (corresponding to different $\beta$ values), and they are: $C \leftarrow A^T B$, $C \leftarrow A^T B + C$, $C \leftarrow A^T B + \beta C$. All input arrays ($A, B, C$) are column-major (they are still used as performance kernels for row-major BLAS as well, so don't worry). Additionally, $A^T$ and $B$ are in block-major format, such that $lda = ldb = M = N = K = N_B$.


R. Clint Whaley 2001-08-04