Given a null distribution, how can I calculate a p-value for my test statistic?

Question

Prabhjot Dhami on 3 Feb 2022

0
Link

Direct link to this question

https://au.mathworks.com/matlabcentral/answers/1642505-given-a-null-distribution-how-can-i-calculate-a-p-value-for-my-test-statistic

Commented: Jeff Miller on 4 Feb 2022

Greetings,

For example, let's say I have two groups and want to see if their means are significantly different. However, I want to do so in a shuffling/permutation framework.

Accordingly, I shuffle the group labels across data points, calculate the difference between means, and do so 5000 times to create a null distribution.

I have my original unshuffled mean difference, and see that it is in the top 2.5 percentile of the null distribution. I can thus conclude that the difference is significant at the two-tailed level.

However, in this context, how can I compute the exact p-value of my original mean difference value with the null distribution? I am lost when it comes to finding the right function to do so.

Thank you.

P.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Jeff Miller on 3 Feb 2022

0
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/1642505-given-a-null-distribution-how-can-i-calculate-a-p-value-for-my-test-statistic#answer_888585

The one-tailed p value is just the tail probability of your original unshuffled difference relative to the null distribution that you created by shuffling. In your example where the unshuffled mean is at the edge of the top 2.5% of the null distribution, p=.025.

For two-tailed testing, the p would be double this tail probability (e.g., 2*.025=.05)

2 Comments
Show NoneHide None

Prabhjot Dhami on 4 Feb 2022

Hi Jeff,

How can I calculate the exact p-value though in my case of the true mean difference in the distribution of the shuffled data?

Jeff Miller on 4 Feb 2022

Open in MATLAB Online

If Obs is the observed mean difference and SN is a vector of differences from the shuffled data, you could compute for example

pLess = mean(SN<Obs);
pGreater = mean(SN>Obs);

to get the exact probability of null values less than or greater than your observed value. The mean of the many 0's and 1's will be the proportion that you are interested in.

Sign in to comment.

Given a null distribution, how can I calculate a p-value for my test statistic?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

2 Comments
Show NoneHide None

See Also

Categories

Tags

Community Treasure Hunt

Given a null distribution, how can I calculate a p-value for my test statistic?

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

2 Comments Show NoneHide None

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

2 Comments
Show NoneHide None