1. ## How do I chose between sensitivity/specificity and PPV/NPV?

I have just finished a validation study, where two specialists used the available information to classify patients according to their diseases status as true cases and no cases. I have confirmed diagnoses for a sample of those patients and I now want to see how successfully the specialists evaluated the disease status of those patients.

I tried to find it out myself but I still don't fully understand the difference between sensitivity and PPV resp. specificity and NPV. Which of those values are relevant for my analysis? Is it necessary to present all four values or is it enough to just present the PPV and NPV?

Thank you for your help! This is probably a really basic question but I did not really find an answer to it myself.

People typically present all 4, plus accuracy metric (i.e, TP + TN over TP + TN + FN + FP. if you are just creating a classification tables for each specialist, than you could present those table values or just the numbers for TP, FP, FN, and TN and the reviewer can always calculate them on their own.

Yes all four metric seem the same for most people until you worked with them enough. The difference is in the descriptions (e.g., of those with the condition..., of those without the condition..., of those testing positive..., of those testing negative...)

A side note, the SEN and PPV as well as the SPEC and NPV will typically be comparable given the similarities in the denominators. However the prevalence of the condition in your sample can change the PPV and NPV metrics, a nuance, which many don't realize.

Thank you very much! Just one more question...

Does the fact, that my sample only comprises of patients with a putative diagnosis for the disease in question, have an influence on the assertion of the PPV and NPV? Will they still be relevant (just higher values) or are they only relevant, when the sample displays the true prevalence of a disease?

putative: based on clinical judgment/gestalt?

If yes, you can only generalize findings to comparable scenarios, and as you mentioned, be apprehensive with PPV and NPV conclusions.

With the putative dignoses I meant the ones which I am testing for validity. I have some gold standard diagnoses which I want to compare the putative diagnoses to.

