Abstract: In this article, we propose a novel variant of path integral policy improvement with covariance matrix adaptation (PI2-CMA), which is a reinforcement learning (RL) algorithm that aims to ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results