+ Reply to Thread
Page 1 of 2 1 2 LastLast
Results 1 to 15 of 26

Thread: tests for normal data

  1. #1
    Points: 1,838, Level: 25
    Level completed: 38%, Points required for next Level: 62

    Posts
    135
    Thanks
    18
    Thanked 2 Times in 2 Posts

    tests for normal data




    Hi,

    I posted some like my questions, but I didn't stay very clearly with my problem.

    I have some rows of data like
    Code: 
    1 0 2 1 3 0
    I need to apply some tests to compare if the first 3 elements have the same distribution than the other 3. I have some rows with big numbers like
    Code: 
    23 32 12 33 23 54
    My idea is to test the normality of each row and if the row has a normal data, I will apply normal tests. If the row don't have normal data, I will apply not normal tests.

    Is correct what I'm thinking to do? My question is because I only have 6 data for each row...is correct to apply a shapiro.test to test the normality?

    Thank you very much

  2. #2
    Points: 1,838, Level: 25
    Level completed: 38%, Points required for next Level: 62

    Posts
    135
    Thanks
    18
    Thanked 2 Times in 2 Posts

    Re: tests for normal data

    Please! I need some help

  3. #3
    Omega Contributor
    Points: 38,432, Level: 100
    Level completed: 0%, Points required for next Level: 0
    hlsmith's Avatar
    Location
    Not Ames, IA
    Posts
    7,006
    Thanks
    398
    Thanked 1,186 Times in 1,147 Posts

    Re: tests for normal data

    Your original question is a little unclear. Can you take sometime to rephrase it, so we better know what you are doing and the purpose.

    Also it may be helpful for you to read about Kolmogorov–Smirnov test.
    Stop cowardice, ban guns!

  4. #4
    Points: 1,838, Level: 25
    Level completed: 38%, Points required for next Level: 62

    Posts
    135
    Thanks
    18
    Thanked 2 Times in 2 Posts

    Re: tests for normal data

    My original question: I need to compare if the groups(3 first elements and last 3) has the same distribution. For to do this I need to apply some statistical tests, but first I need to know if I have a normal data.

    Which is the difference between the two tests (kolmogorov smirnov or shapiro.test) in this case?

  5. #5
    Points: 300, Level: 5
    Level completed: 50%, Points required for next Level: 50

    Posts
    19
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Re: tests for normal data

    What do you do if you have a non-significant Shapiro-Wilks but a significant kolmogorov smirnov ? If kurtosis and skewness are less than +/-1 . Can I use a repeated measure ancova?

  6. #6
    R purist
    Points: 35,103, Level: 100
    Level completed: 0%, Points required for next Level: 0
    TheEcologist's Avatar
    Location
    United States
    Posts
    1,921
    Thanks
    303
    Thanked 607 Times in 341 Posts

    Re: tests for normal data

    Quote Originally Posted by benjwalt View Post
    What do you do if you have a non-significant Shapiro-Wilks but a significant kolmogorov smirnov ? If kurtosis and skewness are less than +/-1 . Can I use a repeated measure ancova?
    You are testing whether the residuals of the ANCOVA are normally distributed? Cause the normality assumption rests on the residuals, NOT the data.
    The true ideals of great philosophies always seem to get lost somewhere along the road..

  7. #7
    Fortran must die
    Points: 58,790, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,532
    Thanks
    692
    Thanked 915 Times in 874 Posts

    Re: tests for normal data

    QQ plots are commonly used (and are probably the best way to determine) normality of data. Although I have never seen this done (interestingly) I assume this could be done with residuals just as easily as raw data.

    Its interesting that QQ plots are commonly shown with raw data not residuals even with methods like ANOVA or regression that make assumptions about the normality of the residuals not the raw data.
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

  8. #8
    Points: 1,838, Level: 25
    Level completed: 38%, Points required for next Level: 62

    Posts
    135
    Thanks
    18
    Thanked 2 Times in 2 Posts

    Re: tests for normal data

    Quote Originally Posted by Englund View Post
    There are no tests that have reasonable size and power properties for such small samples! Please correct me if I'm wrong.


    I agree, but I have only this data, and I need to test...How can I do?

  9. #9
    Points: 1,838, Level: 25
    Level completed: 38%, Points required for next Level: 62

    Posts
    135
    Thanks
    18
    Thanked 2 Times in 2 Posts

    Re: tests for normal data

    Quote Originally Posted by benjwalt View Post
    What do you do if you have a non-significant Shapiro-Wilks but a significant kolmogorov smirnov ? If kurtosis and skewness are less than +/-1 . Can I use a repeated measure ancova?
    Sorry, I don't know ancova. What is this?

  10. #10
    Points: 1,838, Level: 25
    Level completed: 38%, Points required for next Level: 62

    Posts
    135
    Thanks
    18
    Thanked 2 Times in 2 Posts

    Re: tests for normal data

    Quote Originally Posted by noetsi View Post
    QQ plots are commonly used (and are probably the best way to determine) normality of data. Although I have never seen this done (interestingly) I assume this could be done with residuals just as easily as raw data.

    Its interesting that QQ plots are commonly shown with raw data not residuals even with methods like ANOVA or regression that make assumptions about the normality of the residuals not the raw data.
    Thank you.

    The better thing is to do a QQ-plot. But with this, can I have a measure? A number? Or only the graph? Because I need to apply this to a lot of rows, and its impossible to look all the graphs...

    After that (and I know that the the power of the test isn't good), can I apply the test (normal or not normal) to see the distribution?

  11. #11
    Fortran must die
    Points: 58,790, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,532
    Thanks
    692
    Thanked 915 Times in 874 Posts

    Re: tests for normal data

    QQ plots only plot the observed data against a theoretical distribution. They don't calculate a test statistic.

    I am not sure what you mean by a lot of rows. Regardless of how many variables you have the residuals (your observed data) will be a single set of data so you can use one QQ plot.
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

  12. #12
    Points: 1,838, Level: 25
    Level completed: 38%, Points required for next Level: 62

    Posts
    135
    Thanks
    18
    Thanked 2 Times in 2 Posts

    Re: tests for normal data

    "A lot of rows": a lot of data to test the normality. I asked if it's possible to have a "value" for the QQ plot because I need to do test for about 20000 rows of data (independently) and it's impossible to look all graphs. If I have a number, I can filter, it's beacause I talk about shapiro.test or kolmogorov-smirnov.

    In this case I need to work with one of that(shapiro.test or ks.test). Which is the best? Or more correct to use?

    Thank you

  13. #13
    Fortran must die
    Points: 58,790, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,532
    Thanks
    692
    Thanked 915 Times in 874 Posts

    Re: tests for normal data

    As far as I know QQ plot's don't generate a statistic or value you can test.

    In all the discussions of various normality tests I have not seen authors come down on one as being better. The general comment is that all are weak in terms of statistical power.
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

  14. #14
    Points: 1,838, Level: 25
    Level completed: 38%, Points required for next Level: 62

    Posts
    135
    Thanks
    18
    Thanked 2 Times in 2 Posts

    Re: tests for normal data

    I know the power is very bad, but it's correct to do a shapiro.test or ks.test? Or there are some rules that I can't apply this?

  15. #15
    TS Contributor
    Points: 6,786, Level: 54
    Level completed: 18%, Points required for next Level: 164

    Location
    Sweden
    Posts
    524
    Thanks
    44
    Thanked 112 Times in 100 Posts

    Re: tests for normal data


    Quote Originally Posted by slavia View Post
    Or there are some rules that I can't apply this?
    For the shapiro wilk test you'll need at least 5 observations, and for jarque bera 8 or more obs is required. At least according to STATA's help section.

+ Reply to Thread
Page 1 of 2 1 2 LastLast

           




Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts






Advertise on Talk Stats