Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/26272
Title: Differentiable channel pruning guided via attention mechanism: a novel neural network pruning approach
Authors: Cheng, H
Wang, Z
Ma, L
Wei, Z
Alsaadi, FE
Liu, X
Keywords: artificial intelligence;network pruning;neural architecture search;Gumbel-softmax sampling;attention mechanism
Issue Date: 27-Mar-2023
Publisher: Spinger Nature
Citation: Cheng, H. et al. (2023) 'Differentiable channel pruning guided via attention mechanism: a novel neural network pruning approach', Complex & Intelligent Systems, 9 (5), pp. 5611 - 5624. doi: 10.1007/s40747-023-01022-6.
Abstract: Copyright © The Author(s) 2023. Neural network pruning offers great prospects for facilitating the deployment of deep neural networks on computational resource limited devices. Neural architecture search (NAS) provides an efficient way to automatically seek appropriate neural architecture design for compressed model. It is observed that, for existing NAS-based pruning methods, there is usually a lack of layer information when searching the optimal neural architecture. In this paper, we propose a new NAS approach, namely, differentiable channel pruning method guided via attention mechanism (DCP-A), where the adopted attention mechanism is able to provide layer information to guide the optimization of the pruning policy. The training process is differentiable with Gumbel-softmax sampling, while parameters are optimized under a two-stage training procedure. The neural network block with the shortcut is dedicatedly designed, which is of help to prune the network not only on its width but also on its depth. Extensive experiments are performed to verify the applicability and superiority of the proposed method. Detailed analysis with visualization of the pruned model architecture shows that our proposed DCP-A learns explainable pruning policies.
URI: https://bura.brunel.ac.uk/handle/2438/26272
DOI: https://doi.org/10.1007/s40747-023-01022-6
ISSN: 2199-4536
Other Identifiers: ORCID iDs: Zidong Wang https://orcid.org/0000-0002-9576-7401; Xiaohui Liu https://orcid.org/0000-0003-1589-1267.
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © The Author(s) 2023. Rights and permissions: Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.1.29 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons