Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/30136
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Yang, J | - |
dc.contributor.author | Liu, H | - |
dc.contributor.author | Gan, L | - |
dc.contributor.author | Jing, X | - |
dc.date.accessioned | 2024-11-15T14:21:34Z | - |
dc.date.available | 2024-11-15T14:21:34Z | - |
dc.date.issued | 2024-11-12 | - |
dc.identifier | ORCiD: Hongqing Liu https://orcid.org/0000-0002-2069-0390 | - |
dc.identifier | ORCiD: Lu Gan https://orcid.org/0000-0003-1056-7660 | - |
dc.identifier.citation | Yang, J. et al. (2024) 'Spectral network based on lattice convolution and adversarial training for noise-robust speech super-resolution', Journal of the Acoustical Society of America, 156 (5), pp. 3143 - 3157. doi: 10.1121/10.0034364. | en_US |
dc.identifier.issn | 0001-4966 | - |
dc.identifier.uri | https://bura.brunel.ac.uk/handle/2438/30136 | - |
dc.description | Data Availability: Data are available on request to the authors. | en_US |
dc.description.abstract | Speech super-resolution aims to predict a high-resolution speech signal from its low-resolution counterpart. The previous models usually perform this task at a fixed sampling rate, reconstructing only high-frequency spectrogram components and merging them with low-frequency ones in noise-free cases. These methods achieve high accuracy, but they are less effective in real-world settings, where ambient noise and flexible sampling rates are presented. To develop a robust model that fits practical applications, in this work, we introduce Super Denoise Net (SDNet), a neural network for noise-robust super-resolution with flexible input sampling rates. To this end, SDNet's design includes gated and lattice convolution blocks for enhanced repair and temporal-spectral information capture. The frequency transform blocks are employed to model long frequency dependencies, and a multi-scale discriminator is proposed to facilitate the multi-adversarial loss training. The experiments show that SDNet outperforms current state-of-the-art noise-robust speech super-resolution models on multiple test sets, indicating its robustness and effectiveness in real-world scenarios. | en_US |
dc.format.extent | 3143 - 3157 | - |
dc.format.medium | Print-Electronic | - |
dc.language.iso | en_US | en_US |
dc.publisher | Acoustical Society of America | en_US |
dc.rights | Copyright © 2024 Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America (see: https://acousticalsociety.org/web-posting-guidelines/ ). The following article appeared in Junkang Yang, Hongqing Liu, Lu Gan, Xiaorong Jing; Spectral network based on lattice convolution and adversarial training for noise-robust speech super-resolution. J. Acoust. Soc. Am. 1 November 2024; 156 (5): 3143–3157. https://doi.org/10.1121/10.0034364 and may be found at https://pubs.aip.org/asa/jasa/article/156/5/3143/3320008/Spectral-network-based-on-lattice-convolution-and . | - |
dc.rights.uri | https://acousticalsociety.org/web-posting-guidelines/ | - |
dc.subject | speech communication | - |
dc.subject | acoustic noise | - |
dc.subject | speech processing systems | - |
dc.subject | speech recognition | - |
dc.subject | spectrograms | - |
dc.subject | electronic circuits | - |
dc.subject | data analysis | - |
dc.subject | artificial neural networks | - |
dc.subject | signal processing | - |
dc.subject | Fourier analysis | - |
dc.title | Spectral network based on lattice convolution and adversarial training for noise-robust speech super-resolution | en_US |
dc.type | Article | en_US |
dc.date.dateAccepted | 2024-10-22 | - |
dc.identifier.doi | https://doi.org/10.1121/10.0034364 | - |
dc.relation.isPartOf | Journal of the Acoustical Society of America | - |
pubs.issue | 5 | - |
pubs.publication-status | Published | - |
pubs.volume | 156 | - |
dc.identifier.eissn | 1520-8524 | - |
dc.rights.holder | Acoustical Society of America | - |
Appears in Collections: | Dept of Electronic and Electrical Engineering Embargoed Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FullText.pdf | Copyright © 2024 Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America (see: https://acousticalsociety.org/web-posting-guidelines/ ). The following article appeared in Junkang Yang, Hongqing Liu, Lu Gan, Xiaorong Jing; Spectral network based on lattice convolution and adversarial training for noise-robust speech super-resolution. J. Acoust. Soc. Am. 1 November 2024; 156 (5): 3143–3157. https://doi.org/10.1121/10.0034364 and may be found at https://pubs.aip.org/asa/jasa/article/156/5/3143/3320008/Spectral-network-based-on-lattice-convolution-and . | 5.27 MB | Adobe PDF | View/Open |
Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.