News

Platforms: rocm This test was disabled because it is failing in CI. See recent examples and the most recent trunk workflow logs. Over the past 3 hours, it has been determined flaky in 10 workflow(s ...
Secondly, we propose a distributed graph-tensor completion algorithm and unfold it into a deep neural network called GT-FISTA-Net. GT-FISTA-Net requires small communication costs for distributed model ...
As graph data becomes increasingly prevalent in mobile computing scenarios, deploying Graph Convolutional Network-based self-supervised learning (GCN-SSL) models on mobile devices provides a powerful ...
Testing the Qwen2.5 VL-3B model using TRTLLM version 0.19.0, following the PyTorch workflow example , running with the use_cuda_graph parameter resulted in only a few generated tokens. Removing the ...
We investigate a novel approach to approximate tensor-network contraction via the exact, matrix-free decomposition of full tensor-networks. We study this method as a means to eliminate the propagat ...