# Thread: when to use the median and when the mean ?

1. ## when to use the median and when the mean ?

Good morning,

I have an assignment, I need to specify with examples, in which situation the mean is the best central measure and in which the median is (I also need mode but I know that).

I read that when having an outlier, the median is better, and it makes sense, if I check salaries of people and sample Bill Gates, I will get a huge mean. According to this logic, why and when should I use the mean ? Isn't it better to use the median all the time ? I mean, if there are no outliers, why using the mean ?

2. ## Re: when to use the median and when the mean ?

Median is a better measure of central tendency when there are outliers in the data. The mean is vulnerable to outliers -- that is to say that the mean can be skewed in the direction of the outlier. So for your income example, let's imagine that you have a sample of 50 individuals and their yearly salary is reported. If 49 people have a yearly salary between \$50,000 and \$60,000 and then Bill Gates (who happens to be sampled) has a yearly salary of 1.1 billion....your mean is going to be heavily skewed upwards to reflect the outlying salary...thus a median will give you a more reliable measure of central tendency as it remains unaffected by outliers.

hope this clears things up.

3. ## Re: when to use the median and when the mean ?

Originally Posted by victorxstc
I think because unless our population distribution is not perfectly symmetrical, or its size is reasonably large, median cannot indicate the center point. Consider those small asymmetries in one small population as micor-outliers which can make the median doesn't work perfectly, unless the number of those micro-outliers with positive and negative effect on location of median gets very high (neutralizing each other) or get zero. However, those micro-outliers don't affect mean because in calculating mean all of values are actually summed up, but in calculating median we only check which value is in the middle.
You don't define what you mean by micro-outliers but your whole post just feels wrong. Median as tokai points out is better in the case when there are outliers because the outliers don't affect it as much.

4. ## Re: when to use the median and when the mean ?

If your data is influenced by non-normality (be that skew, outliers etc) medians are commonly better measure of central tendency. But there are better ones than that (winsorized means are commonly suggested as is simply transforming your data to deal with skew, outliers etc).

5. ## Re: when to use the median and when the mean ?

thanks guys.

so yes, the median is better when the data is skewed or having outliers, but when do I use the mean then ? if the data is symmetric without outliers, the median and mean are almost equal, aren't they ?

when do I use the mean and why not median ?

6. ## Re: when to use the median and when the mean ?

we also use the mean because it has the property that if it is subtracted from all numbers in the set, and these differences are squared and summed up, we obtain a number called the least sum of squares. this is crucial for the calculation of the variance and standard deviation.

7. ## Re: when to use the median and when the mean ?

You could technically take each value's squared distance from the median and operate on that value. What meaning or use it has, maybe smarter people than myself will know! But the mean has nice properties, no doubt.

8. ## Re: when to use the median and when the mean ?

I thought about that too, but i'm also not sure if there would be a point. i don't think I know enough at this point to really speculate, perhaps someone else could!

9. ## Re: when to use the median and when the mean ?

Hi,
just to add a link to an earlier discussion on a similar topic.

Regards
Gm

10. ## Re: when to use the median and when the mean ?

I'm not sure I agree that the mean relies on an assumption of normality. There are many cases where using the mean is better than using the median and the data isn't normal.

11. ## Re: when to use the median and when the mean ?

Dason,
I was relying upon what I read in a book (author R. Wilcox). I am here to widening my knowledge and to confront my views with those of others.
Thanks for providing fuel for further speculations.

Gm

12. ## Re: when to use the median and when the mean ?

I lean toward Dason's view. It's not obvious at all why taking a mean implies normality. Although I can't formally prove it, it seems intuitively the case that the mean should be an efficient estimator for any symmetrical distribution.

13. ## Re: when to use the median and when the mean ?

This has something to say on that subject: http://mathworld.wolfram.com/StatisticalMedian.html

14. ## Re: when to use the median and when the mean ?

thanks everyone, the discussion is interesting.

so I understand from you that if I calculate an expression like x-mean vs. x-median, and I square it, sum it and divide by n, for the mean I will get a smaller number ?

thanks again

15. ## Re: when to use the median and when the mean ?

no problem. and yes, using the mean will give you the lowest possible sum of squares for that set of numbers.