Statistical significance in reproducibility study

Thanks for taking the time to read this thread and help me out!

I recently carried out a reproducibility study in which 15 observers from 4 groups (total of 60 observers) each measured a continuous variable for which the possible range of values is 0 to 1. I calclulated the limit of reproducibility for each group as 1.96(√2)(SD). I now have the limit of reproducibility for each of the 4 categories and am wondering what test I can perform to make a determination of whether the reproducibility of group 1's measurements is statistically superior (more reproducible) compared to group 2, etc.

Any help is greatly appreciated!