How to train RL-DQN agent with varying environment?

Question

Praveen Kumar Nambisan T M on 23 Jun 2021

0
Link

Direct link to this question

https://au.mathworks.com/matlabcentral/answers/863085-how-to-train-rl-dqn-agent-with-varying-environment

Edited: Jillian Eunice Oliveros on 25 Oct 2021

Accepted Answer: Emmanouil Tzorakoleftherakis

RL HEV.pdf

The question is related to reinforcement learning based energy management in hybrid electic vehicle (HEV). I am considering DQN-RL for this work. The actions are the control variable for the energy management system which controls the fuel-rate.

In this case, my environment is an HEV with particular driving profile (UDDS). The objective is to train the agent for the energy management system to achieve the final fuel target (desired fuel) at the end of the drivecycle. However, I want to train a single agent for multiple drive profile to achieve the same target in all the cases.

The problem formulation is similar to the paper: Reference paper

I could train the agent for one driving profile, how to train the same agent for multiple profiles?

Note: The reference paper could help to clarify the exact problem. They have trained the agent for 5 driving profile to achieve same desired SOC.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Emmanouil Tzorakoleftherakis on 24 Jun 2021

2
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/863085-how-to-train-rl-dqn-agent-with-varying-environment#answer_732465

What you are describing is actually pretty standard process to create robust policies. To change the driving profiles, you can use the reset function in your MATLAB/Simulink environment definition.

A simple example is here (take a look at the Reset function at the bottom).