I see.
No, I don’t know how to balance it well. Maybe need to try SGDRestart on few different models with different max LR to see if any rule can be derived.
I see.
No, I don’t know how to balance it well. Maybe need to try SGDRestart on few different models with different max LR to see if any rule can be derived.