Training param tuning

Training param in prototxt

There has two types data augmentation method for different application

For example , I will choose adaptive aspect ratio in fisheye video which make pixel level geometry distortion

This type may break k-mean anchors rule and effect accuracy about 1% in my test

If solver type set to "SGD" , you may need set learning rate policy like this

total_batch_size = iter_size * batch_size

If pre-trained weights use

Classification model (like imagenet)

total_batch_size set to 64 at least
Detection model (like ms-coco)

total_batch_size set to 16 at least ("PASCAL-VOC")

total_batch_size set to 32 at least , recommend to 64 ("MS-COCO")