
Originally Posted by
kowalsks
A little bit more background:
I have 2 years of data. Year 1 is data from the first year we ever used peer assessment. In year 2, some new methods were used. I'm trying to see if the new methods have made a difference in the quality of the grades given by the peer assessors. (that's why I was trying to get a reliability score, so I could compare year 1 with year 2). One of the concerns of the Y1 data was that the students were just giving everyone the same score.
Also, the grades given by the students are categorical because of the grading scheme we use (Fail, Poor, Pass, Excellent.)