I'm not sure if this question is Applied Statistics, my apologies if it isn't.
I have different implementations of different classification models (Discriminant Analsis, SVM, Neural Networks, Decision trees, etc) a total of 40 implementations and I need to compare them in two different sets...