News

Abstract: In this paper, we propose a scheme for matrix-matrix multiplication on a distributed-memory parallel computer. The scheme hides almost all of the communication cost with the computation and ...
Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also ...
implement by naive algorithm(without partition) and advanced algorithm(with partition) - AiningWang/Matrix-Multiplication-using-MapReduce ...