How to train model-based reinforcement learning agents
Create and train model-based policy optimization (MBPO) agents. An MBPO agent uses neural networks to internally approximate the environment. This reusable internal model allows for a greater sample efficiency compared to a typical model-free agent.
You can also select a web site from the following list
How to Get Best Site Performance
Select the China site (in Chinese or English) for best site performance. Other MathWorks country sites are not optimized for visits from your location.