News

Abstract: Matrix multiplication computation (MMC) is a fundamental operation with various applications, including linear regression, k-nearest neighbor classification and biometric identification.
On a B200, the nvjet_tst_16x64_64x16_4x1_v_bz_TNN kernel is used, and it takes roughly 8.1 microseconds. On a H200, the nvjet_tst_64x8_64x16_4x1_v_bz_TNT kernel is ...
Numerical linear algebra for quaternions — fast, practical, and well‑tested. QuatIca was inspired by the pioneering work in quaternion linear algebra, particularly the QTFM (Quaternion Toolbox for ...
Abstract: Matrix multiplication is a crucial operation in many data-intensive workloads. Given the large size of matrices in today's workloads, it is common to split the computation into tasks ...