PPO reinforcement learning agent trained to walk in a simulated environment. One of my first RL projects — watching a policy learn to coordinate limbs from random noise.
PPO RL Agent trying to walk
PPO reinforcement learning agent trained to walk in a simulated environment. One of my first RL projects — watching a policy learn to coordinate limbs from random noise.