Pearson r and Data

I am trying to determine the relationship of scores on a test between teachers and their students. I have 40 classes for a total of 40 teachers and 335 students. For example, Teacher (Bob) scored a 70 out of 100 and his 24 students scored an average of 60; Teacher (Susie) scored an 83 and her 27 students scored an average of 78; and so forth. I basically have 40 sets of score.

Here is my question:
Can I use Pearson r to determine the relationship of the teacher scores and their students averaged scores? If not, what would your recommend?




Can't make spagetti
yours is a clear example of what is called the "unit of analysis problem" in the social/behavioural/helath sciences. i would advice against the averaging procedure because you'd fall into what it's called an ecological fallacy. it's a mathematical fact that the variance means is considerably less than the variance of individuals, yielding biased correlation coefficents.

the most correct method of dealing with this kind of stiuations where your data is nested is through a specific extension of the general linear model called hierarchical linear models or multilevel modeling. if you read the "Level"section of the article you'll see the same situation of what you're dealing with here: puplis nested within classes (or teachers in your case). so you'd have a 2-level model where students'scores are your level-one predictor and teachers your level-two predictor. and even if you dont have equal class sizes or missing data, multilevel models can handle that quite nicely...

hope it helps!