Deep reinforcement learning for multiple reservoir operation planning in the Chao Phraya River Basin
1
Issued Date
2025-04-01
Resource Type
ISSN
23636203
eISSN
23636211
Scopus ID
2-s2.0-85218255950
Journal Title
Modeling Earth Systems and Environment
Volume
11
Issue
2
Rights Holder(s)
SCOPUS
Bibliographic Citation
Modeling Earth Systems and Environment Vol.11 No.2 (2025)
Suggested Citation
Phankamolsil Y., Rittima A., Sawangphol W., Kraisangka J., Tabucanon A.S., Talaluxmana Y., Vudhivanich V. Deep reinforcement learning for multiple reservoir operation planning in the Chao Phraya River Basin. Modeling Earth Systems and Environment Vol.11 No.2 (2025). doi:10.1007/s40808-024-02265-z Retrieved from: https://repository.li.mahidol.ac.th/handle/123456789/105466
Title
Deep reinforcement learning for multiple reservoir operation planning in the Chao Phraya River Basin
Corresponding Author(s)
Other Contributor(s)
Abstract
This study demonstrates application of Deep Deterministic Policy Gradient (DDPG)-based algorithm to provide comprehensive and flexible plans for reservoir operation planning of the multiple reservoir system in the Chao Phraya River Basin (CPYRB), Thailand aiming to mitigate flood and drought risks in the region. The multi-agent-based Deep Reinforcement Learning (DRL) model is accordingly constructed considering 7-D predicted inflow, reservoir water released from adjacent reservoir, downstream flow condition, and changes in reservoir water storage, as state variables. The desired goal is to increase water storage levels in all reservoirs by 10–15% to ensure higher potential in supplying water for crop cultivation over the dry seasons and preventing flood occurrences during wet season. Simulation results from 2009 to 2022 indicate that DRL–DDPG-based algorithm can perform well in solving sequential decision problems for optimal operation of multiple reservoir system to achieve the desired water storage goal. It can offer realistic simulation results of seasonal and annual release schemes and reservoir release ratios among reservoirs in the system compared to actual operation and Fmincon and ANFIS optimizations. Importantly, DRL model demonstrates a significant advantage in view of increasing the long-term water storage levels in all reservoirs as targeted in the modelling process while maintaining the similar and consistent release schemes in the reservoir system. For the multipurpose multiple reservoir system operation, adjusting the dynamic desired goals within multi-agent-based RL model is advisable to attain the specific desired outcomes and address various water scenarios.
