News
ASUS teases 'something BIG is coming soon' with a 'top-secret' GeForce RTX 50 series GPU: could be the flagship ROG MATRIX GeForce RTX 5090.
The problem of computing distributed matrix multiplication reliably has been of immense interest for several decades. Recently, it was shown that Polynomial codes achieve the theoretically minimum ...
Approximated Matrix Multiplication (AMM) based on table look-ups can significantly reduce the pressure on computing units and memory bandwidth, and has great potential in large-scale machine learning ...
tritonBLAS: A Lightweight Triton-based General Matrix Multiplication (GEMM) Library Important This project is intended for research purposes only. Use it at your own risk and discretion. Triton is a ...
Block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge. Additionally, this repo includes codes for quantizing Pytorch bf16 matmul with fp8. - GitHub - luongth ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results