Replies: 1 comment 1 reply
-
@shunmian you can't take hparams that are tuned for imagenet and expect them to work on a task that's closer to mnist. These RMSProp settings are unlikely to work on a smaller dataset, AdamW as an optimizer will be more forgiving. Also, disable model-ema until you get some result and then enable with a much shorter time-const (like 0.99 - 0.999). |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Describe the bug
I have been trying to train hand writing digits with
efficientnet_b2
. However, the training is not running correctly.To Reproduce
The ./HWD/ folder has following structure:
When trainning finished, the log says "Best metric: 10.0 (epoch 0)"
The full training log is as following:
When I do inference with
It produce unexpected result:
What would be the possible cause of that?
The training data is here.
Beta Was this translation helpful? Give feedback.
All reactions