Hello,

This is my first post as a new member, so hope I'm posting this in the right place...

I need a general formula to calculate the minimum sample size N required to determine whether k binomial variables are somehow correlated (either positively or negatively) at significance level p<= alpha, where:

N = sample size

k = no. of binomial variables, from i= 1 to i=k

p(i) = probability that variable i is "positive" (otherwise negative)

alpha = maximum p-Value

For example, how many times must I toss a set of k biased coins, in order to determine whether any of them somehow talk to each other at significance level alpha?

In this case the Null Hypothesis is that the k coins act independently according to their respective probabilities p(i=1 to k), so I think we're looking at a 2-tailed distribution where a correlation could be either positive or negative. But I'm not sure.

Is there a general formula for the actual p-value in terms of N, k and p(i), which I could then solve/rearrange for N in terms of k, p(i) and alpha?

I'm really struggling with this, so any help would be appreciated, thanks!

Kelvin

This is my first post as a new member, so hope I'm posting this in the right place...

I need a general formula to calculate the minimum sample size N required to determine whether k binomial variables are somehow correlated (either positively or negatively) at significance level p<= alpha, where:

N = sample size

k = no. of binomial variables, from i= 1 to i=k

p(i) = probability that variable i is "positive" (otherwise negative)

alpha = maximum p-Value

For example, how many times must I toss a set of k biased coins, in order to determine whether any of them somehow talk to each other at significance level alpha?

In this case the Null Hypothesis is that the k coins act independently according to their respective probabilities p(i=1 to k), so I think we're looking at a 2-tailed distribution where a correlation could be either positive or negative. But I'm not sure.

Is there a general formula for the actual p-value in terms of N, k and p(i), which I could then solve/rearrange for N in terms of k, p(i) and alpha?

I'm really struggling with this, so any help would be appreciated, thanks!

Kelvin

Last edited: