News

Abstract: This paper investigates the ability of deep neural networks (DNNs) to predict a specific room impulse response (RIR) between two given room locations. We use three end-to-end deep learning ...
Images are exposed to deterioration over years due to many factors. These factors may include, but not limited to, environmental factors, chemical processing, improper storage, etc. Image inpainting ...
Learning a fine-grained and simultaneous understanding of both the visual content of images and the textual content of questions is the heart of VQA. In this paper, a novel Multimodal Encoder-Decoder ...
The auditory selection framework with attention and memory (ASAM), which has an attention mechanism, embedding generator, generated embedding array, and life-long memory, is used to deal with mixed ...
In recent years, more and more people suffer from voice-related diseases. Given the limitations of current pathological speech conversion methods, that is, a method can only convert a single kind of ...
In this letter, a model-driven deep learning (DL) decoder for irregular binary low-density parity-check (LDPC) codes is proposed via the alternating direction method of multipliers (ADMM) technique.
Sleep staging serves as a fundamental assessment for sleep quality measurement and sleep disorder diagnosis. Although current deep learning approaches have successfully integrated multimodal sleep ...
The existing deep learning based reversible data hiding (RDH) predictors typically adopt standard convolutions for extracting features, which inherently fails to capture contextual information across ...