Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/29929
Title: | A Lightweight Object Counting Network Based On Density Map Knowledge Distillation |
Authors: | Shen, Z Li, G Xia, R Meng, H Huang, Z |
Keywords: | object counting;lightweight;distillation;feature fusion;multi-scale |
Issue Date: | 30-Sep-2024 |
Publisher: | Institute of Electrical and Electronics Engineers (IEEE) |
Citation: | Shen, Z. et al. (2024) 'A Lightweight Object Counting Network Based On Density Map Knowledge Distillation', IEEE Transactions on Circuits and Systems for Video Technology, 35 (2), pp. 1492 - 1505. doi: 10.1109/tcsvt.2024.3469933. |
Abstract: | Object counting aims to count the accurate number of object instances in images, and its operation efficiency is essential. However, most current CNN-based methods rely on complex network architectures, which results in them consuming a significant amount of memory, time, and other resources at runtime. This seriously limits their deployment in practical application scenarios, such as public safety and agriculture planting. Therefore, we propose a lightweight object counting method named EdgeCount to effectively balance inference speed and object counting accuracy. Specifically, we construct a network composed of a student model (EdgeCount) and a teacher model (EdgeCount-T) with the same encoder-decoder structure based on density map knowledge distillation (DMKD), allowing the EdgeCount to learn object density distribution from the EdgeCount-T. After that, we introduce spatial and channel reconstruction convolution (SCConv), composed of a spatial reconstruction unit (SRU) and a channel reconstruction unit (CRU), to decrease spatial and channel redundancy with lower computational costs. Moreover, a low parameter weighted multi-scale feature fusion module (LWMFFM) is designed to further improve the countering ability through segmenting minor structural discrepacies among multi-scale features. Extensive experiments conducted on challenging remote sensing and dense crowd object counting datasets demonstrate the effectiveness and superiority of our method. In particular, under the four NVIDIA Jetson devices, EdgeCount can accurately counter objects with only 0.12M parameters and 19.87M floating-point operations per second (FLOPs) in the size of 128, which achieves the lowest latency and fastest FPS compared with other state-of-the-art object counters. |
URI: | https://bura.brunel.ac.uk/handle/2438/29929 |
DOI: | https://doi.org/10.1109/tcsvt.2024.3469933 |
ISSN: | 1051-8215 |
Other Identifiers: | ORCiD: Guoquan Li https://orcid.org/0000-0001-8022-743X ORCiD: Ruiyang Xia https://orcid.org/0000-0002-2421-9512 ORCiD: Hongying Meng https://orcid.org/0000-0002-8836-1382 ORCiD: Zhengwen Huang https://orcid.org/0000-0003-2426-242X |
Appears in Collections: | Dept of Electronic and Electrical Engineering Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FullText.pdf | Copyright © 2024 Institute of Electrical and Electronics Engineers (IEEE). Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works (see: https://journals.ieeeauthorcenter.ieee.org/become-an-ieee-journal-author/publishing-ethics/guidelines-and-policies/post-publication-policies/). | 23.7 MB | Adobe PDF | View/Open |
Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.