Hey all. I'm having issues trying to figure out how to record and analyze some data I've collected. Here's the short story:

8 kids took before and after yes/no questionnaires and answers are scored with 1 or 2 points. Questions are phrased so that sometimes Yes is worth 2 points and sometimes No is worth 2 points (to control for response set). The questionnaires do not have the same questions and do not have the same NUMBER of questions.

I've considered a simple t-test for two correlated samples. This would certainly be the easiest to do by transposing the qualitative data to quantitative by "grading" as points earned out of total possible points. Does the fact that participants were children brought by their parents to a presentation at a zoo change that? Is this considered true random sampling? Is the "grading" method considered an equal-interval scale? If so...

I've considered the Wilcoxon signed rank test, but how would I enter this? Could I "grade" the questionnaires and enter them as before/after values? Does the nature of the questionnaire have any bearing on this? If so...

I've considered the McNamar test, but how would I score this? Would I have to do it for each individual participant with 0 for responses earning point and 1 for responses earning 2 points? How would I combine the data to look at the whole group?

And, on a more general note, I'm not sure I totally understand the difference between what the Wilcoxon signed rank test and the McNamar test tells us. Would someone be willing to explain it in a "Stats for Dummies" way? It's not completely foreign, but stats are not my forte.

Thanks for any and all help.