MUBLAS | PROJECTS | Large-scale Parallel Numerical Computing Technology Research Team

Overview

MUBLAS-GEMV is an optimized implementation of GEMV for NVIDIA GPUs. This implementation automatically adjusts the thread-block size based on the theoretical performance model before launching kernel. This code will be integrated into ASPEN.K2 in the near future.

Download

MUBLAS version 1.5.38	source code
MUBLAS version 1.5.31	source code
MUBLAS version 1.5.24	source code
MUBLAS version 1.5.14	source code	1772525 byte
MUBLAS version 1.4.28	source code	11676498 byte
MUBLAS-GEMV version 1.3.1	source code	487640 byte
MUBLAS-GEMV version 1.3	source code	29466 byte