You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for sharing the code! I read the paper and also checked the code. I’m currently trying to adopt Diffmask to another dataset and have some questions regarding the code:
What is alpha (defined in sentiment_classification_sst_diffmask.pyBertSentimentClassificationSSTDiffMask), is that Lagrangian multiplier mentioned in Eq(3) in the paper?
In SentimentClassificationSSTDiffMask, What is the expected_L0 in loss_g, why expected_L0 is negative? The negative value of expected_L0 makes loss_g negative. Is that correct?
I also don’t understand log_expected_L0() function in distributions.py. Can I find an explanation for this in the paper?
During the training step, you also calculate l0 (l0 = (expected_L0.exp() * mask).sum(-1) / mask.sum(-1)), what is this for, is this used for training?
Again, thanks for the wonderful work. Look forward to your reply!
The text was updated successfully, but these errors were encountered:
Thanks for sharing the code! I read the paper and also checked the code. I’m currently trying to adopt Diffmask to another dataset and have some questions regarding the code:
What is
alpha
(defined insentiment_classification_sst_diffmask.py
BertSentimentClassificationSSTDiffMask
), is that Lagrangian multiplier mentioned in Eq(3) in the paper?In
SentimentClassificationSSTDiffMask
, What is theexpected_L0
inloss_g
, whyexpected_L0
is negative? The negative value ofexpected_L0
makesloss_g
negative. Is that correct?I also don’t understand
log_expected_L0()
function indistributions.py
. Can I find an explanation for this in the paper?During the training step, you also calculate
l0 (l0 = (expected_L0.exp() * mask).sum(-1) / mask.sum(-1))
, what is this for, is this used for training?Again, thanks for the wonderful work. Look forward to your reply!
The text was updated successfully, but these errors were encountered: