Abstract: We propose an efficient quantum subroutine for matrix multiplication that computes a state vector encoding the entries of the product of two matrices in superposition. The subroutine ...
You can create a release to package software, along with release notes and links to binary files, for other people to use. Learn more about releases in our docs.
Abstract: Transformer has been widely applied across various domains due to its outstanding performance, driving the development of numerous application services. However, in many real-world scenarios ...
This repository contains the CUDA kernels for general matrix-matrix multiplication (GEMM) and the corresponding performance analysis. The correctness of the CUDA kernels is guaranteed for any matrix ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results