1. 28 Aug, 2017 1 commit
    • John Schulman's avatar
      Fix atari wrapper (affecting a2c perf) and pposgd mujoco performance · d9f194f7
      John Schulman authored
      - removed vf clipping in pposgd - that was severely degrading performance on mujoco because it didn’t account for scale of returns
      - switched adam epsilon in pposgd_simple
      - brought back no-ops in atari wrapper (oops)
      - added readmes
      - revamped run_X_benchmark scripts to have standard form
      - cleaned up DDPG a little, removed deprecated SimpleMonitor and non-idiomatic usage of logger
      d9f194f7
  2. 19 Aug, 2017 1 commit
  3. 28 Jul, 2017 1 commit
  4. 24 Jul, 2017 1 commit
  5. 21 Jul, 2017 3 commits
  6. 13 Jul, 2017 4 commits
  7. 11 Jul, 2017 1 commit
    • Fernando Arbeiza's avatar
      Effectively apply weights from the replay buffer · d76cd129
      Fernando Arbeiza authored
      It seems that the weights retrieved from the replay buffer are not applied when training the model. Is there any reason for that or am I missing something?
      
      In any case, I have added a parameter in order for them to be used; just in case it is useful.
      d76cd129
  8. 28 Jun, 2017 1 commit
  9. 24 Jun, 2017 3 commits
  10. 16 Jun, 2017 1 commit
  11. 09 Jun, 2017 1 commit
  12. 07 Jun, 2017 1 commit
  13. 04 Jun, 2017 4 commits
  14. 02 Jun, 2017 1 commit
  15. 31 May, 2017 1 commit
  16. 30 May, 2017 2 commits
  17. 29 May, 2017 2 commits
  18. 27 May, 2017 1 commit
  19. 26 May, 2017 2 commits
  20. 25 May, 2017 2 commits
  21. 24 May, 2017 1 commit