Kruskal-Wallis multcompare comparison intervals
Show older comments
Hello,
currently I am analysing results of my study and I found that a good multcomparison for it would be using the kruskalwallis with dunn-sidak.
My question is now, how to interpret the comparison intervals, which are not confidence intervals. I know, that if they do not overlap, the data comes from different distributions. But how are these comparison intervals generated?
Through researching and playing with the data and multcompare graphs I found that the width of the comparison intervals equals to 4 x the standard error (obtained through [m] = multcompare(stats)). Why is that the case?
However, I cannot find a reference of how these standard errors are calculated. And I cannot find anything on the MATLAB documentation.
Can you help me?
Another question is how do I label the x-axis of the multcompare graph for a publication? In principal, 'ranks' should be correct, right?
Thank you and kind regards
Michael
7 Comments
Scott MacKenzie
on 9 Jul 2021
It might help if you post your data and any MATLAB code you've written so far. Using MATLAB's kruskalwallis function is fairly striaghtforward. The documentation includes examples and also demonstrates doing a post-hoc multiple comparisons test using multcompare.
Michael Werthmann
on 9 Jul 2021
Scott MacKenzie
on 9 Jul 2021
Well, I can't recite the calculcations, but if you want to know how to do the caclulations, just look them up in a stats book. Many moons back, I wrote Java apps to do the Kruskal-Wallis and numerous other tests. The book I used and recommend is David Sheskin's Handbook of Parametric and Non-Parametrics Statistical Procedures from CRC Press.
Michael Werthmann
on 11 Jul 2021
Scott MacKenzie
on 11 Jul 2021
The conversion to ranks is something that occurs behind the scenes in the Kruskal-Wallis and other non-parametric tests. The responses tested are usually just answers to questionnaire items, qualitative assessments, or something like that. I'm not sure what kind of chart you are contemplating or what the data represent, so I can't suggest an x-axis label.
Michael Werthmann
on 11 Jul 2021
Edited: Michael Werthmann
on 11 Jul 2021
Scott MacKenzie
on 11 Jul 2021
I'm not sure about the 2nd plot, but for the box plot, let's start first with the y-axis.
I doubt "Jaccard" is the correct label. The label should be whatever the values represent. For the first box, the value is about 0.77. What is that? Perhaps "Coefficient Increase" or "Segmentation Overlap", or whatever the measure represents.
For the x-axis label, an appropriate label is probably something like "Image Registration Technique". For the x-axis tick labels, the first one should probably be "Jaccard". The other four tick labels should be names associated with the algorithms. Probably, "Algorithm 1", "Algorithm 2", etc., is a bit bland. I'm sure you have better names for these. Hope this helps. Good luck.
Answers (0)
Categories
Find more on Analysis of Variance and Covariance in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!
