Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/30612
Full metadata record
DC FieldValueLanguage
dc.contributor.authorZhao, R-
dc.contributor.authorFan, Y-
dc.contributor.authorLi, Y-
dc.contributor.authorZhang, D-
dc.contributor.authorGao, F-
dc.contributor.authorGao, Z-
dc.contributor.authorYang, Z-
dc.date.accessioned2025-01-30T09:23:54Z-
dc.date.available2025-01-30T09:23:54Z-
dc.date.issued2025-01-01-
dc.identifierORCiD: Rui Zhao https://orcid.org/0000-0003-1597-1961-
dc.identifierORCiD: Yuze Fan https://orcid.org/0009-0003-8309-7396-
dc.identifierORCiD: Yun Li https://orcid.org/0009-0002-7824-7751-
dc.identifierORCiD: Dong Zhang https://orcid.org/0000-0002-4974-4671-
dc.identifierORCiD: Fei Gao https://orcid.org/0000-0003-4195-5033-
dc.identifierORCiD: Zhenhai Gao https://orcid.org/0000-0002-4623-3956-
dc.identifierORCiD: Zhengcai Yang https://orcid.org/0000-0002-6793-5666-
dc.identifierArticle number 191-
dc.identifier.citationZhao, R. et al. (2025) 'Knowledge Distillation-Enhanced Behavior Transformer for Decision-Making of Autonomous Driving', Sensors, 25 (1), 191, pp. 1 - 26. doi: 10.3390/s25010191.en_US
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/30612-
dc.descriptionData Availability Statement: The data presented in this study are available upon request from the corresponding author.en_US
dc.description.abstractAutonomous driving has demonstrated impressive driving capabilities, with behavior decision-making playing a crucial role as a bridge between perception and control. Imitation Learning (IL) and Reinforcement Learning (RL) have introduced innovative approaches to behavior decision-making in autonomous driving, but challenges remain. On one hand, RL’s policy networks often lack sufficient reasoning ability to make optimal decisions in highly complex and stochastic environments. On the other hand, the complexity of these environments leads to low sample efficiency in RL, making it difficult to efficiently learn driving policies. To address these challenges, we propose an innovative Knowledge Distillation-Enhanced Behavior Transformer (KD-BeT) framework. Building on the successful application of Transformers in large language models, we introduce the Behavior Transformer as the policy network in RL, using observation–action history as input for sequential decision-making, thereby leveraging the Transformer’s contextual reasoning capabilities. Using a teacher–student paradigm, we first train a small-capacity teacher model quickly and accurately through IL, then apply knowledge distillation to accelerate RL’s training efficiency and performance. Simulation results demonstrate that KD-BeT maintains fast convergence and high asymptotic performance during training. In the CARLA NoCrash benchmark tests, KD-BeT outperforms other state-of-the-art methods in terms of traffic efficiency and driving safety, offering a novel solution for addressing real-world autonomous driving tasks.en_US
dc.description.sponsorshipThis work was supported by the National Science Foundation of China under grants 52202495, 52202494, and 52394261; the Science and Technology Development Project of Jilin Province (No. 202302013); and the Open Fund Project of the Key Laboratory of Automotive Power Train and Electronics (Hubei University of Automotive Technology) (ZDK1202402).en_US
dc.format.extent1 - 26-
dc.format.mediumElectronic-
dc.languageEnglish-
dc.language.isoen_USen_US
dc.publisherMDPIen_US
dc.rightsAttribution 4.0 International-
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/-
dc.subjectimitation learningen_US
dc.subjectreinforcement learningen_US
dc.subjectbehavior transformeren_US
dc.subjectautonomous drivingen_US
dc.subjectknowledge distillationen_US
dc.subjectdecision-makingen_US
dc.titleKnowledge Distillation-Enhanced Behavior Transformer for Decision-Making of Autonomous Drivingen_US
dc.typeArticleen_US
dc.date.dateAccepted2024-12-31-
dc.identifier.doihttps://doi.org/10.3390/s25010191-
dc.relation.isPartOfSensors-
pubs.issue1-
pubs.publication-statusPublished-
pubs.volume25-
dc.identifier.eissn1424-8220-
dc.rights.licensehttps://creativecommons.org/licenses/by/4.0/legalcode.en-
dcterms.dateAccepted2024-12-31-
dc.rights.holderThe authors-
Appears in Collections:Dept of Mechanical and Aerospace Engineering Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).39.55 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons