News

It can train models in a distributed manner while ... also use the fused optimizer provided by BMTrain for mixed-precision training. In addition, BMTrain also provides the common LRScheduler in the ...