Document Type
Preprint
Publication Date
12-1-2019
Journal / Book Title
2019 IEEE 5th International Conference on Computer and Communications Iccc 2019
Abstract
Speech enhance methods base on traditional digital signal processing (DSP) algorithms or adaptive filters can effectively suppress stationary noises. However, they don't provide viable solution for the variety of non-stationary noises that exist in our everyday life. Smart voice assistants such as Google Home and Alexa deteriorate their performance mostly due to non-stationary noises. In this paper we introduce CycleGAN ANF, a neural network approach that can learn to reduce both stationary and non-stationary noises, totally unsupervised. CycleGAN ANF is capable of reducing undesired interference by reading in a raw audio sample from a set X (speech mixed with noises) and transforming it so that it sound as if it belongs in set Y (clean speech). Our experiments demonstrate that without labels and when trained on unparalleled; relatively small vocabulary of speech datasets, CycleGAN ANF can achieve significant improvements without the ground assumptions of nature and form of the noise.
DOI
10.1109/ICCC47050.2019.9064433
Montclair State University Digital Commons Citation
Nguyen, Nam Son; Li, Tengpeng; Zhang, Xiaoqian; Sheng, Bo; Wang, Teng; and Wang, Jiayin, "Audio Noise Filter using Cycle Consistent Adversarial Network - CycleGAN ANF" (2019). Department of Computer Science Faculty Scholarship and Creative Works. 701.
https://digitalcommons.montclair.edu/compusci-facpubs/701