    Quantifying how estimates compare to a gold standard

    This may be too basic of a question but I would greatly appreciate any guidance. Say you have a dataset where a series of people have visually estimated the number of dots in a series of samples (say in a series of boxes). Almost everyone overestimates, it turns out. And you count the actual...