Fix atari wrapper (affecting a2c perf) and pposgd mujoco performance
- removed vf clipping in pposgd - that was severely degrading performance on mujoco because it didn’t account for scale of returns - switched adam epsilon in pposgd_simple - brought back no-ops in atari wrapper (oops) - added readmes - revamped run_X_benchmark scripts to have standard form - cleaned up DDPG a little, removed deprecated SimpleMonitor and non-idiomatic usage of logger
baselines/a2c/README.md
0 → 100644
baselines/acktr/README.md
0 → 100644
baselines/ddpg/README.md
0 → 100644
baselines/ppo1/README.md
0 → 100644
baselines/trpo_mpi/README.md
0 → 100644