    I am too embarrassed to ask it as it looks very easy but I did not find answers to it and I tried hard.

    I have a sample of 5000 observation which are profit/assets (%). The Median is 3%, the average is 200% and the STD is also high. That means that I have some observation (that I recognized) that I need to drop in order to get normal data. My question is - what is the rule from which number to drop and not to add it to the sample.
    You do not need to answer exactly, it is also OK to send me to read about subjects you suggest me to read.

    Thanks a lot


    What is your overall goal with this dataset?

