Why SVM-based fitcecoc function makes unexplainable misclassifications when 'fitPosterior' label is true?

Question

Omar Elnaggar on 20 Dec 2021

0
Link

Direct link to this question

https://au.mathworks.com/matlabcentral/answers/1614480-why-svm-based-fitcecoc-function-makes-unexplainable-misclassifications-when-fitposterior-label-is

Answered: Sahil Jain on 22 Dec 2021

I am using the fitcecoc function with SVM template (and RBF kernel), and the 'onevsone' design matrix. The input dataset is purely constructed from 16-dimensional floating-point numbers (decimals) and the output should be one of 12 different class labels.

I know from my training and testing datasets, that few classes are overlapping so I expect some degree of misclassifications.

I noticed an interesting observation, that is, when the label 'fitPosterior' is false, the overall ECOC model (~70% accurate) makes misclassifications that can be explained in the light of the few overlapping classes. I verified this by removing one overlapping class and retraining the whole ECOC model, and the performance reflected an improvement.

Interestingly, when I enabled the 'fitPosterior' label to get some probabilities (not just hard output labels), the ECOC model overall performance relatively improved (~84% accurate) but with some persistent misclassifications. The difference this time is that these misclassifications are not with the overlapping classes anymore. Instead, the model misclassifies the incoming testing instances with very different classes (of little to no overlap).

To wrap up, I find it difficult trying to understand:

(1) Why the performance with 'fitPosterior' enabled showed relative improvement compared to with it disabled? Why this improved performance was associated with reduced explainability and bizzare misclassifications (without overlap between confused classes).

(2) How does 'fitPosterior' works as an algorithm? Is there any way through which we can have some control over how this "Posterior Probability Estimation" gets trained.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Sahil Jain on 22 Dec 2021

0
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/1614480-why-svm-based-fitcecoc-function-makes-unexplainable-misclassifications-when-fitposterior-label-is#answer_859985

Hi Omar. By default, the software minimizes the Kullback-Leibler divergence to estimate class posterior probabilities. Other than KL divergence, Quadratic Programming can also be used (requires optimization toolbox). To know more about the algorithm, please refer to the Algorithms section of the "predict" function. To understand the behaviour of the algorithm, I'd suggest going through the references linked in the section.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Why SVM-based fitcecoc function makes unexplainable misclassifications when 'fitPosterior' label is true?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Why SVM-based fitcecoc function makes unexplainable misclassifications when 'fitPosterior' label is true?

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments