MUBLAS

Overview

MUBLAS-GEMV is an optimized implementation of GEMV for NVIDIA GPUs. This implementation automatically adjusts the thread-block size based on the theoretical performance model before launching kernel. This code will be integrated into ASPEN.K2 in the near future.

Download

MUBLAS version 1.5.38 source code
MUBLAS version 1.5.31 source code
MUBLAS version 1.5.24 source code
MUBLAS version 1.5.14 source code 1772525 byte
MUBLAS version 1.4.28 source code 11676498 byte
MUBLAS-GEMV version 1.3.1 source code 487640 byte
MUBLAS-GEMV version 1.3 source code 29466 byte