- 28 Aug, 2017 1 commit
-
-
John Schulman authored
- removed vf clipping in pposgd - that was severely degrading performance on mujoco because it didn’t account for scale of returns - switched adam epsilon in pposgd_simple - brought back no-ops in atari wrapper (oops) - added readmes - revamped run_X_benchmark scripts to have standard form - cleaned up DDPG a little, removed deprecated SimpleMonitor and non-idiomatic usage of logger
-
- 19 Aug, 2017 1 commit
-
-
John Schulman authored
-
- 28 Jul, 2017 1 commit
-
-
Matthias Plappert authored
* Export param noise * Update documentation * Final finishing touches
-
- 24 Jul, 2017 1 commit
-
-
Peter Welinder authored
-
- 21 Jul, 2017 3 commits
-
-
Jonas Schneider authored
-
John Schulman authored
ppo and trpo
-
John Schulman authored
-
- 13 Jul, 2017 4 commits
-
-
Szymon Sidor authored
-
Szymon Sidor authored
-
Szymon Sidor authored
Effectively apply weights from the replay buffer
-
Szymon Sidor authored
Fix TF graph variables deprecation
-
- 11 Jul, 2017 1 commit
-
-
Fernando Arbeiza authored
It seems that the weights retrieved from the replay buffer are not applied when training the model. Is there any reason for that or am I missing something? In any case, I have added a parameter in order for them to be used; just in case it is useful.
-
- 28 Jun, 2017 1 commit
-
-
MironencoMircea authored
-
- 24 Jun, 2017 3 commits
-
-
Szymon Sidor authored
remove unnecessary initialization of variable resized_screen
-
Szymon Sidor authored
Update setup.py
-
Szymon Sidor authored
Fix README since BreakOut pretrained model doesn't match the correct …
-
- 16 Jun, 2017 1 commit
-
-
cxx authored
Fix README since BreakOut pretrained model doesn't match the correct tensor shape. Therefore, Pong is used instead.
-
- 09 Jun, 2017 1 commit
-
-
Zach Dwiel authored
-
- 07 Jun, 2017 1 commit
-
-
Zichuan Lin authored
-
- 04 Jun, 2017 4 commits
-
-
Szymon Sidor authored
docstring and comment fixes
-
Szymon Sidor authored
Update README.md
-
Szymon Sidor authored
Consistent initial type (float) for episode_rewards
-
Szymon Sidor authored
Update setup.py
-
- 02 Jun, 2017 1 commit
-
-
ngc92 authored
-
- 31 May, 2017 1 commit
-
-
Tiago Carvalho authored
-
- 30 May, 2017 2 commits
-
-
Quan Vuong authored
-
Yuxin Wu authored
`PongNoFrameskip-v4` seems to require `gym>=0.9.1`
-
- 29 May, 2017 2 commits
-
-
Szymon Sidor authored
Fix typos
-
YenChenLin authored
-
- 27 May, 2017 1 commit
-
-
Szymon Sidor authored
tf_util: spelling / docstring updates
-
- 26 May, 2017 2 commits
-
-
Aaron Morris authored
-
Szymon Sidor authored
-
- 25 May, 2017 2 commits
-
-
Szymon Sidor authored
Correct path to script "download_model"
-
Olivier Moindrot authored
`python -m baselines.deepq.experiments.download_model` becomes `python -m baselines.deepq.experiments.atari.download_model`
-
- 24 May, 2017 1 commit
-
-
Szymon Sidor authored
-