Multiagent deep reinforcement learning-based cooperative optimal operation with strong scalability for residential microgrid clusters

Wang, C; Wang, M; Wang, A; Zhang, X; Zhang, J; Ma, H; Yang, N; Zhao, Z; Lai, CS; Lai, LL

Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/30647

Full metadata record

DC Field	Value	Language
dc.contributor.author	Wang, C	-
dc.contributor.author	Wang, M	-
dc.contributor.author	Wang, A	-
dc.contributor.author	Zhang, X	-
dc.contributor.author	Zhang, J	-
dc.contributor.author	Ma, H	-
dc.contributor.author	Yang, N	-
dc.contributor.author	Zhao, Z	-
dc.contributor.author	Lai, CS	-
dc.contributor.author	Lai, LL	-
dc.date.accessioned	2025-02-03T11:19:48Z	-
dc.date.available	2025-02-03T11:19:48Z	-
dc.date.issued	2024-12-11	-
dc.identifier	ORCiD: Can Wang https://orcid.org/0000-0002-5892-253X	-
dc.identifier	ORCiD: Xiaojia Zhang https://orcid.org/0009-0007-5024-5363	-
dc.identifier	ORCiD: Zhuoli Zhao https://orcid.org/0000-0003-2531-0614	-
dc.identifier	ORCiD: Chun Sing Lai https://orcid.org/0000-0002-4169-4438	-
dc.identifier	ORCiD: Loi Lei Lai https://orcid.org/0000-0003-4786-7931	-
dc.identifier	134165	-
dc.identifier.citation	Wang, C. et al. (2024) 'Multiagent deep reinforcement learning-based cooperative optimal operation with strong scalability for residential microgrid clusters', Energy, 314, 134165, pp. 1 - 14. doi: 10.1016/j.energy.2024.134165.	en_US
dc.identifier.issn	0360-5442	-
dc.identifier.uri	https://bura.brunel.ac.uk/handle/2438/30647	-
dc.description	Data availability: The authors do not have permission to share data.	en_US
dc.description.abstract	With the rapid development of smart home technology, residential microgrid (RM) clusters have become an important way to utilize the demand-side resources of large-scale housing. However, there are some key problems in existing RM cluster optimization methods, such as difficult in adapting to the local observable environment and with poor privacy and scalability. Therefore, this paper proposes a multi-agent deep reinforcement learning (MADRL)-based RM cluster optimization operation method. First, with the aim of minimizing the energy cost of each residence while satisfying the comfort level of residents and avoiding transformer overload, the optimization scheduling problem of an RM cluster is described as a Markov game with an unknown state transition probability function. Then, a novel MADRL method is proposed to determine the optimal operation strategy of multiple RMs in this game paradigm. Each agent in the proposed method contains a collective strategy model and an independent learner. The collective strategy model can simulate the energy consumption of other RMs in the system and reflect its operating behavior. In addition, an independent learner based on a soft actor-critic (SAC) framework is used to learn the optimal scheduling strategy interactively with the environment. The proposed method has a completely decentralized and scalable structure, which can deal with continuous high-dimensional state and action spaces only requires local observations and approximations during training. Finally, a numerical example is given to verify that the proposed method can not only learn a stable cooperative energy management strategy but can also be extended to large-scale RM cluster problems. This gives the strong scalability and a high potential for practical application.	en_US
dc.description.sponsorship	This work was supported in part by the National Natural Science Foundation of China under Grant 52107108.	en_US
dc.format.extent	1 - 14	-
dc.format.medium	Print-Electronic	-
dc.language	English	-
dc.language.iso	en_US	en_US
dc.publisher	Elsevier	en_US
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International	-
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0/	-
dc.subject	optimal operation	en_US
dc.subject	residential microgrid (RM)	en_US
dc.subject	deep reinforcement learning (DRL)	en_US
dc.subject	multi-agent systems	en_US
dc.title	Multiagent deep reinforcement learning-based cooperative optimal operation with strong scalability for residential microgrid clusters	en_US
dc.type	Article	en_US
dc.identifier.doi	https://doi.org/10.1016/j.energy.2024.134165	-
dc.relation.isPartOf	Energy	-
pubs.publication-status	Published	-
pubs.volume	314	-
dc.identifier.eissn	1873-6785	-
dc.rights.license	https://creativecommons.org/licenses/by-nc-nd/4.0/legalcode.en	-
dcterms.dateAccepted	2024-12-07	-
dc.rights.holder	Elsevier Ltd.	-
Appears in Collections:	Dept of Electronic and Electrical Engineering Embargoed Research Papers

Files in This Item:

File	Description	Size	Format
FullText.pdf	Embargoed until 11 December 2025. Copyright © 2024 Elsevier Ltd. All rights reserved. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/ (see: https://www.elsevier.com/about/policies/sharing).	2.05 MB	Adobe PDF	View/Open

Show simple item record

This item is licensed under a Creative Commons License