News
Block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge. Additionally, this repo includes codes for quantizing Pytorch bf16 matmul with fp8. - GitHub - luongth ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results