Custom environment in Deep reinforcement learning

6 views (last 30 days)
I am currently trying to buid to a custom environment for the implementation of deep reinforcement learning. My considered environment has 4 states low, med, high, severe represented by 1,2,3,4 respectively and the actions to be taken are 1,2,3 and rewards are decided on the basis of context like temperature, pressure,humidity which varies with time. So how i can define my reward that changes with time in mystepfunction?

Answers (1)

Ari Biswas
Ari Biswas on 20 Apr 2020
One way to solve this is by introducing a property to keep track of elapsed time in your custom MATLAB environment. You can use this property to compute rewards and increment this as needed in the step function.
  1 Comment
SULAKSHNA DEVI
SULAKSHNA DEVI on 13 May 2020
The property here refers to function. Can you please provide explanation on this

Sign in to comment.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!