Reinforcement Learning experience buffer length and parallelisation toolbox

Question

Tech Logg Ding on 2 Dec 2020

0
Link

Direct link to this question

https://au.mathworks.com/matlabcentral/answers/673448-reinforcement-learning-experience-buffer-length-and-parallelisation-toolbox

Edited: Emmanouil Tzorakoleftherakis on 3 Dec 2020

Accepted Answer: Emmanouil Tzorakoleftherakis

When parallelisation is used when training a DDPG agent with the following settings:

trainOpts.UseParallel = true;
trainOpts.ParallelizationOptions.Mode = 'async';
trainOpts.ParallelizationOptions.StepsUntilDataIsSent = -1;
trainOpts.ParallelizationOptions.DataToSendFromWorkers = 'Experiences';

Does the the parallel simulations have their own experience buffer? This could take up more memory hence I am hoping that only one experience buffer is stored to update the critic network.

From the documentations, it seems like there will only be one experience buffer as the experiences are sent back to the host.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Emmanouil Tzorakoleftherakis on 3 Dec 2020

0
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/673448-reinforcement-learning-experience-buffer-length-and-parallelisation-toolbox#answer_564503

Edited: Emmanouil Tzorakoleftherakis on 3 Dec 2020

Hello,

There is one big experience buffer on the host, the size of which you determine as usual in your agent options. Each worker has a much smaller buffer to collect experiences until you reach "StepsUntilDataIsSent".

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Reinforcement Learning experience buffer length and parallelisation toolbox

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

0 Comments
Show -2 older commentsHide -2 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Reinforcement Learning experience buffer length and parallelisation toolbox

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

0 Comments Show -2 older commentsHide -2 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments