News

Abstract: This paper investigates the ability of deep neural networks (DNNs) to predict a specific room impulse response (RIR) between two given room locations. We use three end-to-end deep learning ...
Images are exposed to deterioration over years due to many factors. These factors may include, but not limited to, environmental factors, chemical processing, improper storage, etc. Image inpainting ...
Learning a fine-grained and simultaneous understanding of both the visual content of images and the textual content of questions is the heart of VQA. In this paper, a novel Multimodal Encoder-Decoder ...
The auditory selection framework with attention and memory (ASAM), which has an attention mechanism, embedding generator, generated embedding array, and life-long memory, is used to deal with mixed ...
In recent years, more and more people suffer from voice-related diseases. Given the limitations of current pathological speech conversion methods, that is, a method can only convert a single kind of ...
In this letter, a model-driven deep learning (DL) decoder for irregular binary low-density parity-check (LDPC) codes is proposed via the alternating direction method of multipliers (ADMM) technique.
Sleep staging serves as a fundamental assessment for sleep quality measurement and sleep disorder diagnosis. Although current deep learning approaches have successfully integrated multimodal sleep ...
The existing deep learning based reversible data hiding (RDH) predictors typically adopt standard convolutions for extracting features, which inherently fails to capture contextual information across ...
This letter presents our initial results in deep learning for channel estimation and signal detection in orthogonal frequency-division multiplexing (OFDM) systems. In this letter, we exploit deep ...
This paper proposes DRL-ED-TSPP, a deep reinforcement learning (DRL) model with an Encoder-Decoder architecture, to solve the Traveling Salesman Problem with Profits (TSPP) for sustainable cultural ...
Motivated by the three-blade symmetrical structure of WTs, we propose a new symmetry-aware pitch feature encoder-decoder network named PitchNet. A group feature encoding strategy is first designed to ...
Current security solutions face significant challenges in dealing with the ever-increasing complexity and sophistication of cyber-attacks. This is particularly true for the solutions that inherently ...