You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@sahiliitm Hello, in the paper, it is stated " We used the same preprocessing and feedforward CNN architecture used by (Mnih et al., 2016)". So it should be the traditional two layers FF.
In the code here we have just z-normalization for the inputs, no virtual batch norm.
Also no hyperparameters for Atari. OpenAI, please be more reproducible! :-)
If I understand the code correctly, it uses virtual batch normalization only for the inputs and not for the intermediate layers.
Was this done in the Atari context for getting the results stated in the paper?
Also, what was the network architecture used for the Atari domain?
The text was updated successfully, but these errors were encountered: