Are we talking about Cohen's kappa here? That's used mainly for qualitative (i.e. nominal) ratings. OP's data is at least ordinal, with 50 possible final scores. Kappa will count any difference between scores as a disagreement. E.g., if someone gets a score of 21 at time 1, and then 22 at time 2, this is recorded as a disagreement. Same for a score of 50 at time 1 and 0 at time 2. Ideally you'd prefer something here that takes into account the ordering of ratings.
You could do a weighted kappa, but probably the most conventional way to deal with this problem is to use a correlation (Spearman's or Pearson's, depending if you want to treat the data as ordinal or interval).