on-policy On-policy RL (VPG, PPO) algorithms from scratch. Commands make install - Install dependencies. make [train-ppo|train-vpg] - Train model. make clean - Clean project artifacts.