Background Noise Suppression in Audio File using LSTM Network

W. Shivani Patnaik

doi:10.22214/ijraset.2022.44109

Abstract— In the realm of speech enhancement, noise suppression is a crucial problem. It is especially important in workfrom-home situations where noise reduction may improve communication quality and reduce the cognitive effort of video conferencing. As a result of the advent of deep neural networks, several novel ways for audio processing methods based on deep models have been presented. The goal of the project is to use a stacked Dual signal Transformation LSTM Network (DTLN) to combine both

more »

... analysis and synthesis into one model. The proposed model consists of two separation cores, the first of which employs an Short Term Fourier Transformation (STFT) signal transformation and the second of which employs a learnt signal representation, This arrangement was designed to enable the second core to further improve the signal with phase information while the first core creates a strong magnitude estimation. Due to the complementarity of traditional and learnt features modifications, this combination might give good impacts while preserving a minimal computing footprint, in terms of computational complexity, the stacked network is far less than most previously suggested LSTM networks and assures real-time capabilities.

doi:10.22214/ijraset.2022.44109 fatcat:snqgeixzzraudnais2z6hz6hki

Open Access

Background Noise Suppression in Audio File using LSTM Network

Preserved Fulltext