Convolutional Deep Neural Network and Full Connectivity for Speech Enhancement

Ban M. Alameri; Inas Jawad Kadhim; Suha Qasim Hadi; Ali F. Hassoon; Mustafa M. Abd; Prashan Premaratne

doi:10.3991/ijoe.v19i04.37577

Authors

Ban M. Alameri Department of Electrical Engineering, Faculty of Engineering, Mustansiriyah University, Baghdad, Iraq
Inas Jawad Kadhim Electrical Engineering Technical College, Middle Technical University, Baghdad, Iraq
Suha Qasim Hadi Department of Electrical Engineering, Faculty of Engineering, Mustansiriyah University, Baghdad, Iraq
Ali F. Hassoon Department of Electrical Engineering, Faculty of Engineering, Mustansiriyah University, Baghdad, Iraq
Mustafa M. Abd Department of Electrical Engineering, Faculty of Engineering, Mustansiriyah University, Baghdad, Iraq
Prashan Premaratne School of Electrical and Computer and Telecommunications Engineering, University of Wollongong, North Wollongong, NSW, 2522, Australia

DOI:

https://doi.org/10.3991/ijoe.v19i04.37577

Keywords:

Speech enhancement, deep learning, fully connected network, Convolutional network, signal-to-noise ratio (SNR).

Abstract

The speech signal that is received in real-time has background noise and reverberations, which have an impact on the quality of speech. Therefore, it is crucial to reduce or eliminate the noise and increase the intelligibility and quality of speech signals. In this study, a proposed method that is the most effective and challenging in a low SNR environment for three types of noise are removed, including washing machine, traffic noise, and electric fan noise, and clean speech is recovered. with three samples of noise which are mixed and added to the clean speech signal with a lower level of SNR value fixed at (-5, 0, 5) dBs, that noise source takes equal weights. The enhancement of the corrupted speech signal is done by applying a fully connected and convolutional neural network-based denoising algorithm and comparing their performance. The proposed network shows that a fully connected network (FCN) has less elapsed time than a convolutional network (CNN) while still achieving better performance, demonstrating its applicability for an embedded system. Also, the results obtained show that, overall, the CNN is better than the FCN regarding maximum coloration, PSNR, MES, and STOI.

Convolutional Deep Neural Network and Full Connectivity for Speech Enhancement

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Rankings

Other journals