You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Nov 10, 2023. It is now read-only.
I am running your code several time on the DD dataset and obtain different results than the one you present on Table 3 of your paper.
In particular I run 20 times this experiment, estimate the average and std but find out that the training is very random, with the std rising up to +-10%. Note that I used the same hyperparameters you provide in your paper and your google sheet (see issue #2).
I also tried with the ReduceLROnPlateau scheduler for the LR, but still have an std up to 5%.
How did you select your seed and how come there is such variation?