Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/31532
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Song, B | - |
dc.contributor.author | Zhao, S | - |
dc.contributor.author | Wang, Z | - |
dc.contributor.author | Liu, W | - |
dc.contributor.author | Liu, X | - |
dc.date.accessioned | 2025-07-10T14:11:13Z | - |
dc.date.available | 2025-07-10T14:11:13Z | - |
dc.date.issued | 2025-05-27 | - |
dc.identifier | ORCiD: Baoye Song https://orcid.org/0000-0003-1631-5237 | - |
dc.identifier | ORCiD: Zidong Wang https://orcid.org/0000-0002-9576-7401 | - |
dc.identifier | ORCiD: Weibo Liu https://orcid.org/0000-0002-8169-3261 | - |
dc.identifier | ORCiD: Xiaohui Liu https://orcid.org/0000-0003-1589-1267 | - |
dc.identifier | Article number: 113760 | - |
dc.identifier.citation | Song, B. et al. (2025) 'DAF-DETR: A dynamic adaptation feature transformer for enhanced object detection in unmanned aerial vehicles', Knowledge Based Systems, 323, 113760, pp. 1 - 13. doi: 10.1016/j.knosys.2025.113760. | en_US |
dc.identifier.issn | 0950-7051 | - |
dc.identifier.uri | https://bura.brunel.ac.uk/handle/2438/31532 | - |
dc.description | Data availability: Data will be made available on request. | en_US |
dc.description.abstract | Object detection in complex environments is challenged by overlapping objects, complex spatial relationships, and dynamic variations in target scales. To address these challenges, the Dynamic Adaptation Feature DEtection TRansformer (DAF-DETR) is proposed as a novel transformer-based model optimized for real-time detection in spatially complex environments. The framework introduces four key innovations. First, a learnable position encoding mechanism is employed in place of fixed positional encoding, enhancing adaptability and flexibility when processing complex spatial layouts. Second, the Resynthetic Network (ResynNet) backbone, which consists of stacked Resynthetic Blocks (ResynBlocks) integrating ResBlock and FasterBlock feature extraction strategies, is designed to optimize multi-scale feature representation and improve computational efficiency. Third, an enhanced feature fusion module is incorporated to strengthen the detection of small, densely packed objects by integrating multi-scale contextual information. Fourth, a dynamic perception module is introduced, utilizing deformable attention to capture complex spatial relationships between overlapping objects. Extensive experiments conducted on the Vision meets Drone 2019 (VisDrone2019) and Tiny Object Detection in Aerial Images (AI-TOD) datasets demonstrate the superiority of DAF-DETR, achieving state-of-the-art detection accuracy while maintaining real-time efficiency. The results confirm its robustness in handling scale variations, occlusions, and spatial complexity, establishing it as a reliable solution for real-world applications such as aerial imagery and crowded scene analysis. | en_US |
dc.description.sponsorship | This work was supported in part by the Natural Science Foundation of Shandong Province of China under Grant ZR2023MF067, the Royal Society of the UK, and the Alexander von Humboldt Foundation of Germany . | en_US |
dc.format.extent | 1 - 13 | - |
dc.format.medium | Print-Electronic | - |
dc.language | English | - |
dc.language.iso | en_US | en_US |
dc.publisher | Elsevier | en_US |
dc.rights | Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International | - |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | - |
dc.subject | object detection | en_US |
dc.subject | complex environments | en_US |
dc.subject | tiny object detection | en_US |
dc.subject | transformer | en_US |
dc.subject | unmanned aerial vehicles | en_US |
dc.title | DAF-DETR: A dynamic adaptation feature transformer for enhanced object detection in unmanned aerial vehicles | en_US |
dc.type | Article | en_US |
dc.date.dateAccepted | 2025-05-11 | - |
dc.identifier.doi | https://doi.org/10.1016/j.knosys.2025.113760 | - |
dc.relation.isPartOf | Knowledge Based Systems | - |
pubs.publication-status | Published | - |
pubs.volume | 323 | - |
dc.identifier.eissn | 1872-7409 | - |
dc.rights.license | https://creativecommons.org/licenses/by-nc-nd/4.0/legalcode.en | - |
dcterms.dateAccepted | 2025-05-11 | - |
dc.rights.holder | Elsevier B.V. | - |
Appears in Collections: | Dept of Computer Science Embargoed Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FullText.pdf | Embargoed until 27 May 2026. Copyright © 2025 Elsevier B.V. All rights reserved. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/ (see: https://www.elsevier.com/about/policies/sharing). | 9.31 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License