Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/29929
Title: A Lightweight Object Counting Network Based On Density Map Knowledge Distillation
Authors: Shen, Z
Li, G
Xia, R
Meng, H
Huang, Z
Keywords: object counting;lightweight;distillation;feature fusion;multi-scale
Issue Date: 30-Sep-2024
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Citation: Shen, Z. et al. (2024) 'A Lightweight Object Counting Network Based On Density Map Knowledge Distillation', IEEE Transactions on Circuits and Systems for Video Technology, 35 (2), pp. 1492 - 1505. doi: 10.1109/tcsvt.2024.3469933.
Abstract: Object counting aims to count the accurate number of object instances in images, and its operation efficiency is essential. However, most current CNN-based methods rely on complex network architectures, which results in them consuming a significant amount of memory, time, and other resources at runtime. This seriously limits their deployment in practical application scenarios, such as public safety and agriculture planting. Therefore, we propose a lightweight object counting method named EdgeCount to effectively balance inference speed and object counting accuracy. Specifically, we construct a network composed of a student model (EdgeCount) and a teacher model (EdgeCount-T) with the same encoder-decoder structure based on density map knowledge distillation (DMKD), allowing the EdgeCount to learn object density distribution from the EdgeCount-T. After that, we introduce spatial and channel reconstruction convolution (SCConv), composed of a spatial reconstruction unit (SRU) and a channel reconstruction unit (CRU), to decrease spatial and channel redundancy with lower computational costs. Moreover, a low parameter weighted multi-scale feature fusion module (LWMFFM) is designed to further improve the countering ability through segmenting minor structural discrepacies among multi-scale features. Extensive experiments conducted on challenging remote sensing and dense crowd object counting datasets demonstrate the effectiveness and superiority of our method. In particular, under the four NVIDIA Jetson devices, EdgeCount can accurately counter objects with only 0.12M parameters and 19.87M floating-point operations per second (FLOPs) in the size of 128, which achieves the lowest latency and fastest FPS compared with other state-of-the-art object counters.
URI: https://bura.brunel.ac.uk/handle/2438/29929
DOI: https://doi.org/10.1109/tcsvt.2024.3469933
ISSN: 1051-8215
Other Identifiers: ORCiD: Guoquan Li https://orcid.org/0000-0001-8022-743X
ORCiD: Ruiyang Xia https://orcid.org/0000-0002-2421-9512
ORCiD: Hongying Meng https://orcid.org/0000-0002-8836-1382
ORCiD: Zhengwen Huang https://orcid.org/0000-0003-2426-242X
Appears in Collections:Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2024 Institute of Electrical and Electronics Engineers (IEEE). Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works (see: https://journals.ieeeauthorcenter.ieee.org/become-an-ieee-journal-author/publishing-ethics/guidelines-and-policies/post-publication-policies/).23.7 MBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.