I have searched and read a lot about this but I'm still unsure which test is suitable for the following situation.
Data come from a few hundred participants who saw, in each of several trials, a pair of stimuli and made a binary judgment (indicating which stimulus in the pair has a certain quality; there is an objectively correct answer). I am not interested in participant-level variation. I want to make inferences about the stimulus pairs. I want to compare these several pairs against each other (e.g., Pair 1 vs. Pair 2; Pair 1 vs. Pair 3, etc.) to conclude if certain pairs afford more accurate judgment. The stimuli in the pairs overlap, by the way (e.g., Pair 1 contrasts Stim 1 vs. Stim 2; Pair 2 contrasts Stim 1 vs. Stim 3, etc.).
In the end, I only want to compare one figure--the % of correct responses--in one pair against the same figure in another pair; and do this for multiple pairs.
The definitions of McNemar and Cochran's Q seem closest to this. Can someone verify that either of these would be a sensible choice here? I guess I cannot simply use the percentages, I would need to create 2*2 tables (e.g., how many participants gave correct response in Pair 1 AND incorrect in Pair 2, etc..
I can work with R or Excel (also SPSS but not preferred, if that matters. Thank you in advance!
Data come from a few hundred participants who saw, in each of several trials, a pair of stimuli and made a binary judgment (indicating which stimulus in the pair has a certain quality; there is an objectively correct answer). I am not interested in participant-level variation. I want to make inferences about the stimulus pairs. I want to compare these several pairs against each other (e.g., Pair 1 vs. Pair 2; Pair 1 vs. Pair 3, etc.) to conclude if certain pairs afford more accurate judgment. The stimuli in the pairs overlap, by the way (e.g., Pair 1 contrasts Stim 1 vs. Stim 2; Pair 2 contrasts Stim 1 vs. Stim 3, etc.).
In the end, I only want to compare one figure--the % of correct responses--in one pair against the same figure in another pair; and do this for multiple pairs.
The definitions of McNemar and Cochran's Q seem closest to this. Can someone verify that either of these would be a sensible choice here? I guess I cannot simply use the percentages, I would need to create 2*2 tables (e.g., how many participants gave correct response in Pair 1 AND incorrect in Pair 2, etc..
I can work with R or Excel (also SPSS but not preferred, if that matters. Thank you in advance!
Last edited: