1华南理工大学电力学院,广东省广州市 510641;2南方电网能源发展研究院有限责任公司,广东省广州市 510663
在新型电力系统中,新能源可快速调节功率,具有参与线路过载紧急控制的潜力。然而,引入该措施后,基于深度强化学习的紧急控制策略生成方法面临决策空间过大、求解复杂度高的挑战。为此,提出一种决策空间与策略模型动态迭代的紧急控制混合学习方法。首先,构建包含控制地点网络和控制量网络的双网络模型,设计针对两个网络的迭代学习框架;其次,提出控制地点网络及学习要点,设计基于灵敏度的样本生成方法,学习控制地点网络;然后,提出控制量网络深度强化学习方法,设计分段探索策略,高效学习控制量网络;接着,提出控制量网络和控制地点网络的动态迭代实施流程;最后,在IEEE 39节点、IEEE 300节点系统以及中国某省级电网中验证了所提方法的有效性。
国家自然科学基金企业创新发展联合基金集成项目(U22B6007);国家自然科学基金资助项目(52277101);中央高校基本科研业务费专项资金资助项目(2024ZYGXZR109)。
张寿志(2000—),男,博士研究生,主要研究方向:电力系统紧急控制与决策优化、电力系统小扰动稳定性分析与决策调节、强化学习在电力系统的应用。E-mail:202411083518 @mail.scut.edu.cn
陈戈(1995—),男,博士,主要研究方向:电力系统紧急控制与调度决策、连锁故障紧急控制、强化学习在电力系统的应用、云计算与任务调度。E-mail:epchenge@163.com
张俊勃(1986—),男,通信作者,博士,教授,博士生导师,主要研究方向:新型电力系统稳定性、数字电网智能化应用、大型电力软件系统、人工智能在大型软件工程的应用。E-mail:epjbzhang@scut.edu.cn
1School of Electric Power Engineering, South China University of Technology, Guangzhou 510641, China;2Energy Development Research Institute, China Southern Power Grid, Guangzhou 510663, China
Renewable energy can rapidly regulate the power in the new power system, demonstrating the potential of participating in overload emergency control for lines. However, when it is adopted, generation methods for the emergency control strategy based on deep reinforcement learning face the challenges of excessively large decision space and high solution complexity. To address this issue, a hybrid learning method for emergency control with dynamic iteration of decision space and strategy model is proposed. First, a dual-network model comprising a control location network and a control value network is constructed, and an iterative learning framework for both networks is designed. Second, the control location network and its learning objectives are introduced, and a sensitivity-based sample generation method is designed to learn the control location network. Then, a deep reinforcement learning method for the control value network is proposed, and a segmented exploration strategy is designed for efficient learning of the control value network. Next, a dynamic iteration implementation process between the control value network and control location network is designed. Finally, the effectiveness of the proposed method is validated in the IEEE 39-bus system, IEEE 300-bus system, and a provincial power grid of China.
| [1] | 张寿志,陈戈,张俊勃,等.采用决策空间与策略模型动态迭代的线路过载紧急控制混合学习[J].电力系统自动化,2026,50(10):59-72. ZHANG Shouzhi, CHEN Ge, ZHANG Junbo, et al. Hybrid Learning for Line Overload Emergency Control with Dynamic Iteration of Decision Space and Strategy Model[J]. Automation of Electric Power Systems, 2026, 50(10):59-72. |