james
02-22-2009, 09:03 PM
I was reading an article on logistic regression tonight, and I noticed the following:
"Logistic regression can be used to classify observations as events or nonevents
as was done in discriminant - classification analysis. ... To use this information you would search through the classification table to find the probability cut-off point that produces the best classification performance. In our example a probability of .22 to .28 produces rules that have the highest overall successful classification rate."
Are they saying that it could possibly be appropriate to say: your probability of having cancer is 25% by our model; however, based on our cutpoint, we believe you have cancer.
I'm confused, because I was of the opinion that you should be consistent with your classifier - that is, if you change your cutpoint to like 25%, then you should rescale your probabilities around this point.
"Logistic regression can be used to classify observations as events or nonevents
as was done in discriminant - classification analysis. ... To use this information you would search through the classification table to find the probability cut-off point that produces the best classification performance. In our example a probability of .22 to .28 produces rules that have the highest overall successful classification rate."
Are they saying that it could possibly be appropriate to say: your probability of having cancer is 25% by our model; however, based on our cutpoint, we believe you have cancer.
I'm confused, because I was of the opinion that you should be consistent with your classifier - that is, if you change your cutpoint to like 25%, then you should rescale your probabilities around this point.