You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for your excellent work, I can't achieve the effect in your paper in the process of reproducing the compression of the swin-transformer model, in detail, I use the swin model you defined to train the teacher model on my own dataset, but the accuracy has not been up, in addition, I also use my own teacher model to distill directly, the accuracy can not go up, what is going on? Thank you very much!
The text was updated successfully, but these errors were encountered:
Do you use your own teacher model for distillation? or do you have to use the swin-transformer base model generated by your code to train on the dataset and then distill it as a teacher model?Thank you very much!
Thank you for your excellent work, I can't achieve the effect in your paper in the process of reproducing the compression of the swin-transformer model, in detail, I use the swin model you defined to train the teacher model on my own dataset, but the accuracy has not been up, in addition, I also use my own teacher model to distill directly, the accuracy can not go up, what is going on? Thank you very much!
The text was updated successfully, but these errors were encountered: