DE-DPCTnet: Deep Encoder Dual-path Convolutional Transformer Network for Multi-channel Speech Separation

Wang, Z; Zhou, Y; Gan, L; Chen, R; Tang, X; Liu, H

Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/25756

Full metadata record

DC Field	Value	Language
dc.contributor.author	Wang, Z	-
dc.contributor.author	Zhou, Y	-
dc.contributor.author	Gan, L	-
dc.contributor.author	Chen, R	-
dc.contributor.author	Tang, X	-
dc.contributor.author	Liu, H	-
dc.date.accessioned	2023-01-11T15:16:16Z	-
dc.date.available	2023-01-11T15:16:16Z	-
dc.date.issued	2022-10-25	-
dc.identifier	ORCID iD: Lu Gan https://orcid.org/0000-0003-1056-7660	-
dc.identifier.citation	Wang, Z. et al. (2022) 'DE-DPCTnet: Deep Encoder Dual-path Convolutional Transformer Network for Multi-channel Speech Separation', 2022 IEEE Workshop on Signal Processing Systems (SiPS), Rennes, France, 02-04 November, pp. 1 - 5. doi: 10.1109/SiPS55645.2022.9919247.	en_US
dc.identifier.isbn	978-1-6654-8524-1 (ebk)	-
dc.identifier.isbn	978-1-6654-8525-8 (PoD)	-
dc.identifier.issn	1520-6130	-
dc.identifier.uri	https://bura.brunel.ac.uk/handle/2438/25756	-
dc.description.abstract	In recent years, beamforming has been extensively investigated in multi-channel speech separation task. In this paper, we propose a deep encoder dual-path convolutional transformer network (DE-DPCTnet), which directly estimates the beamforming filters for speech separation task in time domain. In order to learn the signal repetitions correctly, nonlinear deep encoder module is proposed to replace the traditional linear one. The improved transformer is also developed by utilizing convolutions to capture long-time speech sequences. The ablation studies demonstrate that the deep encoder and improved transformer indeed benefit the separation performance. The comparisons show that the DE-DPCTnet outperforms the state-of-the-art filter-and-sum network with transform-average-concatenate module (FaSNet-TAC), even with a lower computational complexity.	en_US
dc.format.extent	1 - 5	-
dc.format.medium	Print-Electronic	-
dc.language.iso	en_US	en_US
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	en_US
dc.rights	Copyright © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	-
dc.rights.uri	https://www.ieee.org/publications/rights/rights-policies.html	-
dc.subject	speech separation	en_US
dc.subject	multi-channel	en_US
dc.subject	deep encoder	en_US
dc.subject	improved transformer	en_US
dc.subject	beamforming	en_US
dc.title	DE-DPCTnet: Deep Encoder Dual-path Convolutional Transformer Network for Multi-channel Speech Separation	en_US
dc.type	Conference Paper	en_US
dc.identifier.doi	https://doi.org/10.1109/SiPS55645.2022.9919247	-
dc.relation.isPartOf	2022 IEEE Workshop on Signal Processing Systems (SiPS)	-
pubs.publication-status	Published	-
pubs.volume	2022	-
dc.rights.holder	Institute of Electrical and Electronics Engineers (IEEE)	-
Appears in Collections:	Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:

File	Description	Size	Format
FullText.pdf	Copyright © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	448.5 kB	Adobe PDF	View/Open

Show simple item record