Hierarchical Intermittent Motor Control with Deterministic Policy Gradient

Shi, H; Sun, Y; Li, G; Wang, F; Wang, D; Li, J

Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/18020

Title:	Hierarchical Intermittent Motor Control with Deterministic Policy Gradient
Authors:	Shi, H Sun, Y Li, G Wang, F Wang, D Li, J
Keywords:	hierarchical reinforcement learning;intermittent control;deterministic policy gradient;continuous action control, motor control
Issue Date:	19-Mar-2019
Publisher:	IEEE
Citation:	Shi, H., Sun, Y., Li, G., Wang,F., Wang, D. and Li, J. (2019) 'Hierarchical Intermittent Motor Control With Deterministic Policy Gradient,' IEEE Access, 7, pp. 41799-41810. doi: 10.1109/ACCESS.2019.2904910.
Abstract:	It has been evidenced that the neural motor control exploits the hierarchical and intermittent representation. In this paper, we propose a hierarchical deep reinforcement learning (DRL) method to learn the continuous control policy across multiple levels, by unifying the neuroscience principle of the minimum transition hypothesis. The control policies in the two levels of the hierarchy operate at different time scales. The high-level controller produces the intermittent actions to set a sequence of goals for the low-level controller, which in turn conducts the basic skills with the modulation of goals. The goal planning and the basic motor skills are trained jointly with the proposed algorithm: hierarchical intermittent deep deterministic policy gradient (HI-DDPG). The performance of the method is validated in two continuous control problems. The results show that the method successfully learns to temporally decompose compound tasks into sequences of basic motions with sparse transitions and outperforms the previous DRL methods that lack a hierarchical continuous representation.
Description:	© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
URI:	https://bura.brunel.ac.uk/handle/2438/18020
DOI:	https://doi.org/10.1109/ACCESS.2019.2904910
Appears in Collections:	Dept of Computer Science Research Papers

Files in This Item:

File	Description	Size	Format
FullText.pdf		4.91 MB	Adobe PDF	View/Open

Show full item record