News

End-to-end (E2E) models, including the attention-based encoder-decoder (AED) models, have achieved promising performance on the automatic speech recognition (ASR) task. However, the supervised ...
Additionally, MSEED incorporates a simple vanilla encoder-decoder model for strengthening rolling predictions. The framework has been tested on four challenging real-world datasets, focusing on two ...