Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/30647
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Wang, C | - |
dc.contributor.author | Wang, M | - |
dc.contributor.author | Wang, A | - |
dc.contributor.author | Zhang, X | - |
dc.contributor.author | Zhang, J | - |
dc.contributor.author | Ma, H | - |
dc.contributor.author | Yang, N | - |
dc.contributor.author | Zhao, Z | - |
dc.contributor.author | Lai, CS | - |
dc.contributor.author | Lai, LL | - |
dc.date.accessioned | 2025-02-03T11:19:48Z | - |
dc.date.available | 2025-02-03T11:19:48Z | - |
dc.date.issued | 2024-12-11 | - |
dc.identifier | ORCiD: Can Wang https://orcid.org/0000-0002-5892-253X | - |
dc.identifier | ORCiD: Xiaojia Zhang https://orcid.org/0009-0007-5024-5363 | - |
dc.identifier | ORCiD: Zhuoli Zhao https://orcid.org/0000-0003-2531-0614 | - |
dc.identifier | ORCiD: Chun Sing Lai https://orcid.org/0000-0002-4169-4438 | - |
dc.identifier | ORCiD: Loi Lei Lai https://orcid.org/0000-0003-4786-7931 | - |
dc.identifier | 134165 | - |
dc.identifier.citation | Wang, C. et al. (2024) 'Multiagent deep reinforcement learning-based cooperative optimal operation with strong scalability for residential microgrid clusters', Energy, 314, 134165, pp. 1 - 14. doi: 10.1016/j.energy.2024.134165. | en_US |
dc.identifier.issn | 0360-5442 | - |
dc.identifier.uri | https://bura.brunel.ac.uk/handle/2438/30647 | - |
dc.description | Data availability: The authors do not have permission to share data. | en_US |
dc.description.abstract | With the rapid development of smart home technology, residential microgrid (RM) clusters have become an important way to utilize the demand-side resources of large-scale housing. However, there are some key problems in existing RM cluster optimization methods, such as difficult in adapting to the local observable environment and with poor privacy and scalability. Therefore, this paper proposes a multi-agent deep reinforcement learning (MADRL)-based RM cluster optimization operation method. First, with the aim of minimizing the energy cost of each residence while satisfying the comfort level of residents and avoiding transformer overload, the optimization scheduling problem of an RM cluster is described as a Markov game with an unknown state transition probability function. Then, a novel MADRL method is proposed to determine the optimal operation strategy of multiple RMs in this game paradigm. Each agent in the proposed method contains a collective strategy model and an independent learner. The collective strategy model can simulate the energy consumption of other RMs in the system and reflect its operating behavior. In addition, an independent learner based on a soft actor-critic (SAC) framework is used to learn the optimal scheduling strategy interactively with the environment. The proposed method has a completely decentralized and scalable structure, which can deal with continuous high-dimensional state and action spaces only requires local observations and approximations during training. Finally, a numerical example is given to verify that the proposed method can not only learn a stable cooperative energy management strategy but can also be extended to large-scale RM cluster problems. This gives the strong scalability and a high potential for practical application. | en_US |
dc.description.sponsorship | This work was supported in part by the National Natural Science Foundation of China under Grant 52107108. | en_US |
dc.format.extent | 1 - 14 | - |
dc.format.medium | Print-Electronic | - |
dc.language | English | - |
dc.language.iso | en_US | en_US |
dc.publisher | Elsevier | en_US |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | - |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | - |
dc.subject | optimal operation | en_US |
dc.subject | residential microgrid (RM) | en_US |
dc.subject | deep reinforcement learning (DRL) | en_US |
dc.subject | multi-agent systems | en_US |
dc.title | Multiagent deep reinforcement learning-based cooperative optimal operation with strong scalability for residential microgrid clusters | en_US |
dc.type | Article | en_US |
dc.identifier.doi | https://doi.org/10.1016/j.energy.2024.134165 | - |
dc.relation.isPartOf | Energy | - |
pubs.publication-status | Published | - |
pubs.volume | 314 | - |
dc.identifier.eissn | 1873-6785 | - |
dc.rights.license | https://creativecommons.org/licenses/by-nc-nd/4.0/legalcode.en | - |
dcterms.dateAccepted | 2024-12-07 | - |
dc.rights.holder | Elsevier Ltd. | - |
Appears in Collections: | Dept of Electronic and Electrical Engineering Embargoed Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FullText.pdf | Embargoed until 11 December 2025. Copyright © 2024 Elsevier Ltd. All rights reserved. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/ (see: https://www.elsevier.com/about/policies/sharing). | 2.05 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License