Spectral network based on lattice convolution and adversarial training for noise-robust speech super-resolution

Yang, J; Liu, H; Gan, L; Jing, X

Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/30136

Full metadata record

DC Field	Value	Language
dc.contributor.author	Yang, J	-
dc.contributor.author	Liu, H	-
dc.contributor.author	Gan, L	-
dc.contributor.author	Jing, X	-
dc.date.accessioned	2024-11-15T14:21:34Z	-
dc.date.available	2024-11-15T14:21:34Z	-
dc.date.issued	2024-11-12	-
dc.identifier	ORCiD: Hongqing Liu https://orcid.org/0000-0002-2069-0390	-
dc.identifier	ORCiD: Lu Gan https://orcid.org/0000-0003-1056-7660	-
dc.identifier.citation	Yang, J. et al. (2024) 'Spectral network based on lattice convolution and adversarial training for noise-robust speech super-resolution', Journal of the Acoustical Society of America, 156 (5), pp. 3143 - 3157. doi: 10.1121/10.0034364.	en_US
dc.identifier.issn	0001-4966	-
dc.identifier.uri	https://bura.brunel.ac.uk/handle/2438/30136	-
dc.description	Data Availability: Data are available on request to the authors.	en_US
dc.description.abstract	Speech super-resolution aims to predict a high-resolution speech signal from its low-resolution counterpart. The previous models usually perform this task at a fixed sampling rate, reconstructing only high-frequency spectrogram components and merging them with low-frequency ones in noise-free cases. These methods achieve high accuracy, but they are less effective in real-world settings, where ambient noise and flexible sampling rates are presented. To develop a robust model that fits practical applications, in this work, we introduce Super Denoise Net (SDNet), a neural network for noise-robust super-resolution with flexible input sampling rates. To this end, SDNet's design includes gated and lattice convolution blocks for enhanced repair and temporal-spectral information capture. The frequency transform blocks are employed to model long frequency dependencies, and a multi-scale discriminator is proposed to facilitate the multi-adversarial loss training. The experiments show that SDNet outperforms current state-of-the-art noise-robust speech super-resolution models on multiple test sets, indicating its robustness and effectiveness in real-world scenarios.	en_US
dc.format.extent	3143 - 3157	-
dc.format.medium	Print-Electronic	-
dc.language.iso	en_US	en_US
dc.publisher	Acoustical Society of America	en_US
dc.rights	Copyright © 2024 Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America (see: https://acousticalsociety.org/web-posting-guidelines/ ). The following article appeared in Junkang Yang, Hongqing Liu, Lu Gan, Xiaorong Jing; Spectral network based on lattice convolution and adversarial training for noise-robust speech super-resolution. J. Acoust. Soc. Am. 1 November 2024; 156 (5): 3143–3157. https://doi.org/10.1121/10.0034364 and may be found at https://pubs.aip.org/asa/jasa/article/156/5/3143/3320008/Spectral-network-based-on-lattice-convolution-and .	-
dc.rights.uri	https://acousticalsociety.org/web-posting-guidelines/	-
dc.subject	speech communication	-
dc.subject	acoustic noise	-
dc.subject	speech processing systems	-
dc.subject	speech recognition	-
dc.subject	spectrograms	-
dc.subject	electronic circuits	-
dc.subject	data analysis	-
dc.subject	artificial neural networks	-
dc.subject	signal processing	-
dc.subject	Fourier analysis	-
dc.title	Spectral network based on lattice convolution and adversarial training for noise-robust speech super-resolution	en_US
dc.type	Article	en_US
dc.date.dateAccepted	2024-10-22	-
dc.identifier.doi	https://doi.org/10.1121/10.0034364	-
dc.relation.isPartOf	Journal of the Acoustical Society of America	-
pubs.issue	5	-
pubs.publication-status	Published	-
pubs.volume	156	-
dc.identifier.eissn	1520-8524	-
dc.rights.holder	Acoustical Society of America	-
Appears in Collections:	Department of Electronic and Electrical Engineering Embargoed Research Papers

Files in This Item:

File	Description	Size	Format
FullText.pdf	Copyright © 2024 Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America (see: https://acousticalsociety.org/web-posting-guidelines/ ). The following article appeared in Junkang Yang, Hongqing Liu, Lu Gan, Xiaorong Jing; Spectral network based on lattice convolution and adversarial training for noise-robust speech super-resolution. J. Acoust. Soc. Am. 1 November 2024; 156 (5): 3143–3157. https://doi.org/10.1121/10.0034364 and may be found at https://pubs.aip.org/asa/jasa/article/156/5/3143/3320008/Spectral-network-based-on-lattice-convolution-and .	5.27 MB	Adobe PDF	View/Open

Show simple item record