Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/24394
Title: Deterministic policy optimization with clipped value expansion and long-horizon planning
Authors: Gao, S
Shi, H
Wang, F
Wang, Z
Zhang, S
Li, Y
Sun, Y
Keywords: model-based reinforcement learning;policy gradient;sample efficiency;planning;imitation learning
Issue Date: 16-Feb-2022
Publisher: Elsevier
Citation: Gao, S. et al. (2022) ‘Deterministic policy optimization with clipped value expansion and long-horizon planning’, Neurocomputing, 483, pp. 299 - 310. doi:10.1016/j.neucom.2022.02.022.
URI: https://bura.brunel.ac.uk/handle/2438/24394
DOI: https://doi.org/10.1016/j.neucom.2022.02.022
ISSN: 0925-2312
Appears in Collections:Dept of Computer Science Embargoed Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfEmbargoed until 16 Feb 20242.84 MBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.