DAF-DETR: A dynamic adaptation feature transformer for enhanced object detection in unmanned aerial vehicles

Song, B; Zhao, S; Wang, Z; Liu, W; Liu, X

Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/31532

Full metadata record

DC Field	Value	Language
dc.contributor.author	Song, B	-
dc.contributor.author	Zhao, S	-
dc.contributor.author	Wang, Z	-
dc.contributor.author	Liu, W	-
dc.contributor.author	Liu, X	-
dc.date.accessioned	2025-07-10T14:11:13Z	-
dc.date.available	2025-07-10T14:11:13Z	-
dc.date.issued	2025-05-27	-
dc.identifier	ORCiD: Baoye Song https://orcid.org/0000-0003-1631-5237	-
dc.identifier	ORCiD: Zidong Wang https://orcid.org/0000-0002-9576-7401	-
dc.identifier	ORCiD: Weibo Liu https://orcid.org/0000-0002-8169-3261	-
dc.identifier	ORCiD: Xiaohui Liu https://orcid.org/0000-0003-1589-1267	-
dc.identifier	Article number: 113760	-
dc.identifier.citation	Song, B. et al. (2025) 'DAF-DETR: A dynamic adaptation feature transformer for enhanced object detection in unmanned aerial vehicles', Knowledge Based Systems, 323, 113760, pp. 1 - 13. doi: 10.1016/j.knosys.2025.113760.	en_US
dc.identifier.issn	0950-7051	-
dc.identifier.uri	https://bura.brunel.ac.uk/handle/2438/31532	-
dc.description	Data availability: Data will be made available on request.	en_US
dc.description.abstract	Object detection in complex environments is challenged by overlapping objects, complex spatial relationships, and dynamic variations in target scales. To address these challenges, the Dynamic Adaptation Feature DEtection TRansformer (DAF-DETR) is proposed as a novel transformer-based model optimized for real-time detection in spatially complex environments. The framework introduces four key innovations. First, a learnable position encoding mechanism is employed in place of fixed positional encoding, enhancing adaptability and flexibility when processing complex spatial layouts. Second, the Resynthetic Network (ResynNet) backbone, which consists of stacked Resynthetic Blocks (ResynBlocks) integrating ResBlock and FasterBlock feature extraction strategies, is designed to optimize multi-scale feature representation and improve computational efficiency. Third, an enhanced feature fusion module is incorporated to strengthen the detection of small, densely packed objects by integrating multi-scale contextual information. Fourth, a dynamic perception module is introduced, utilizing deformable attention to capture complex spatial relationships between overlapping objects. Extensive experiments conducted on the Vision meets Drone 2019 (VisDrone2019) and Tiny Object Detection in Aerial Images (AI-TOD) datasets demonstrate the superiority of DAF-DETR, achieving state-of-the-art detection accuracy while maintaining real-time efficiency. The results confirm its robustness in handling scale variations, occlusions, and spatial complexity, establishing it as a reliable solution for real-world applications such as aerial imagery and crowded scene analysis.	en_US
dc.description.sponsorship	This work was supported in part by the Natural Science Foundation of Shandong Province of China under Grant ZR2023MF067, the Royal Society of the UK, and the Alexander von Humboldt Foundation of Germany .	en_US
dc.format.extent	1 - 13	-
dc.format.medium	Print-Electronic	-
dc.language	English	-
dc.language.iso	en_US	en_US
dc.publisher	Elsevier	en_US
dc.rights	Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International	-
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0/	-
dc.subject	object detection	en_US
dc.subject	complex environments	en_US
dc.subject	tiny object detection	en_US
dc.subject	transformer	en_US
dc.subject	unmanned aerial vehicles	en_US
dc.title	DAF-DETR: A dynamic adaptation feature transformer for enhanced object detection in unmanned aerial vehicles	en_US
dc.type	Article	en_US
dc.date.dateAccepted	2025-05-11	-
dc.identifier.doi	https://doi.org/10.1016/j.knosys.2025.113760	-
dc.relation.isPartOf	Knowledge Based Systems	-
pubs.publication-status	Published	-
pubs.volume	323	-
dc.identifier.eissn	1872-7409	-
dc.rights.license	https://creativecommons.org/licenses/by-nc-nd/4.0/legalcode.en	-
dcterms.dateAccepted	2025-05-11	-
dc.rights.holder	Elsevier B.V.	-
Appears in Collections:	Dept of Computer Science Embargoed Research Papers

Files in This Item:

File	Description	Size	Format
FullText.pdf	Embargoed until 27 May 2026. Copyright © 2025 Elsevier B.V. All rights reserved. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/ (see: https://www.elsevier.com/about/policies/sharing).	9.31 MB	Adobe PDF	View/Open

Show simple item record

This item is licensed under a Creative Commons License