Deep reinforcement learning for multiple reservoir operation planning in the Chao Phraya River Basin

Phankamolsil Y.; Rittima A.; Sawangphol W.; Kraisangka J.; Tabucanon A.S.; Talaluxmana Y.; Vudhivanich V.

Deep reinforcement learning for multiple reservoir operation planning in the Chao Phraya River Basin

1

Issued Date

2025-04-01

Resource Type

Article

ISSN

23636203

eISSN

23636211

DOI

10.1007/s40808-024-02265-z

Scopus ID

2-s2.0-85218255950

Journal Title

Modeling Earth Systems and Environment

Volume

11

Issue

2

Rights Holder(s)

SCOPUS

Bibliographic Citation

Modeling Earth Systems and Environment Vol.11 No.2 (2025)

Suggested Citation

Phankamolsil Y., Rittima A., Sawangphol W., Kraisangka J., Tabucanon A.S., Talaluxmana Y., Vudhivanich V. Deep reinforcement learning for multiple reservoir operation planning in the Chao Phraya River Basin. Modeling Earth Systems and Environment Vol.11 No.2 (2025). doi:10.1007/s40808-024-02265-z Retrieved from: https://repository.li.mahidol.ac.th/handle/123456789/105466

Title

Deep reinforcement learning for multiple reservoir operation planning in the Chao Phraya River Basin

Author(s)

Phankamolsil Y.
Rittima A.
Sawangphol W.
Kraisangka J.
Tabucanon A.S.
Talaluxmana Y.
Vudhivanich V.

Author's Affiliation

Faculty of Environment and Resource Studies, Mahidol University
Kasetsart University
Mahidol University

Corresponding Author(s)

Phankamolsil Y.

Other Contributor(s)

Mahidol University

Abstract

This study demonstrates application of Deep Deterministic Policy Gradient (DDPG)-based algorithm to provide comprehensive and flexible plans for reservoir operation planning of the multiple reservoir system in the Chao Phraya River Basin (CPYRB), Thailand aiming to mitigate flood and drought risks in the region. The multi-agent-based Deep Reinforcement Learning (DRL) model is accordingly constructed considering 7-D predicted inflow, reservoir water released from adjacent reservoir, downstream flow condition, and changes in reservoir water storage, as state variables. The desired goal is to increase water storage levels in all reservoirs by 10–15% to ensure higher potential in supplying water for crop cultivation over the dry seasons and preventing flood occurrences during wet season. Simulation results from 2009 to 2022 indicate that DRL–DDPG-based algorithm can perform well in solving sequential decision problems for optimal operation of multiple reservoir system to achieve the desired water storage goal. It can offer realistic simulation results of seasonal and annual release schemes and reservoir release ratios among reservoirs in the system compared to actual operation and Fmincon and ANFIS optimizations. Importantly, DRL model demonstrates a significant advantage in view of increasing the long-term water storage levels in all reservoirs as targeted in the modelling process while maintaining the similar and consistent release schemes in the reservoir system. For the multipurpose multiple reservoir system operation, adjusting the dynamic desired goals within multi-agent-based RL model is advisable to attain the specific desired outcomes and address various water scenarios.

Keyword(s)

Earth and Planetary Sciences
Environmental Science
Agricultural and Biological Sciences
Decision Sciences

URI

https://repository.li.mahidol.ac.th/handle/123456789/105466

Collections

Scopus 2025

Full item page

Send Feedback

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th