0 Comments
Answer:
Increase the learning rate after each mini-batch by multiplying it by a small constant.