-
Notifications
You must be signed in to change notification settings - Fork 88
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
error in adaptive instance normalization in residual blocks of generator (decoder) ? #12
Comments
I may not be able to understand your question because I am not good at English. To answer your question, adain_params is gamma and beta values obtained from MLP. (The author code is mean, std, but the article is written as gamma, beta.) Use adain_params for the residual block value using AdaptiveInstanceNorm2d in weight, bias At this time, gamma and beta are not different for each layer. If you do not understand, you can ask me again, I'm fine. |
I think original author's code has not different gamma, beta.. SAME ! Because |
Thanks for your reply! Perhaps I am missing something...here is how I understood it: In the original author code, the num_adain_params returned by get_num_adain_params function is 4096. That is, there are total 4096 parameters for adaIN operation. These 4096 parameters in the original author code represent the total number of gamma and beta values of each of the 256 feature maps of 4 residual blocks of the decoder. In your current code, the number of adaIN parameters are 256 for gamma + 256 for beta (total 512 adain_params). So I was a bit confused by this difference (512 vs 4096 adaIN parameters). Perhaps 512 is the intended choice of your implementation. |
Hi, thanks for this Tensorflow implementation!
I was wondering if there is an error in the way adaptive instance normalization is performed in the generator (line 135 in MUNIT.py)?
In the original code by the authors, it seems that they compute mu and sigma for each of the layers of residual blocks of decoder (see assign_adain_params - line 130 in networks.py ). But in your code, it seems like mu and sigma are the same across all of the residual blocks. Could you please clarify?
The text was updated successfully, but these errors were encountered: