Recursive TRMM
¨ Recur down to L1
cache block size
¨ Need kernel at
bottom of
recursion
ã Use gemm-based
kernel for
portability
0
0
0
0
0
0
0