Overinflated mini-batch Accuracy and Validation Accuracy when training Faster-RCNN

Question

Ashley Cook on 2 Mar 2022

0
Link

Direct link to this question

https://au.mathworks.com/matlabcentral/answers/1662220-overinflated-mini-batch-accuracy-and-validation-accuracy-when-training-faster-rcnn

Commented: Ashley Cook on 16 May 2022

Hi. Can anyone offer any reasoning as to why when I train a RCNN using transfer learning (i.e. ResNet-18/ ResNet-50) after the first 50 iterations the mini-batch accuracy and validation accuracy immideatly jump to ~99% , yet when I review results the network performance doesn't reflect the mini-batch or validation accuracy (it's always much worse). I've tried: 1)reducing training set percentage 2)increasing mini-batch size 3)increasing validation frequency 4)changing max epochs 5) experimenting with different anchor box sizes. No matter what I try the training progress which looks good doesn't match the final results.

the top fig is what I always see, middle is precision recall curve and last fig is an example of the options I was using but I've changed a lot of these parameters and don't see a difference

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Prince Kumar on 7 Apr 2022

1
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/1662220-overinflated-mini-batch-accuracy-and-validation-accuracy-when-training-faster-rcnn#answer_937020

Hi,

This generally happens when your model is learning the data instead of learning the pattern. This scenario is called 'Overfitting'.

Following few thing can be trieds:

Lower your learning rate. It is too high.
Use of regularization technique
Make sure each set (train, validation and test) has sufficient samples like 60%, 20%, 20% or 70%, 15%, 15% split for training, validation and test sets respectively.
Perform k-fold cross validation
Randomly shuffle the data before doing the spit, this will make sure that data distribution is nearly the same. If your data is in datastore you can use 'shuffle' function else you can use "randperm" function.

Hope this helps!

1 Comment
Show -1 older commentsHide -1 older comments

Ashley Cook on 16 May 2022

Much appreciated thanks!

Sign in to comment.

Overinflated mini-batch Accuracy and Validation Accuracy when training Faster-RCNN

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

1 Comment
Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Overinflated mini-batch Accuracy and Validation Accuracy when training Faster-RCNN

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

1 Comment Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

1 Comment
Show -1 older commentsHide -1 older comments