+ Reply to Thread
Results 1 to 7 of 7

Thread: Having a hard time deciding on what test to use for data

  1. #1
    Points: 9, Level: 1
    Level completed: 17%, Points required for next Level: 41

    Posts
    2
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Having a hard time deciding on what test to use for data




    Here is the premise:

    There are large events taking place on park land.
    We are using noise disturbance hardware to measure decibels during and not during events.
    We have >1000 samples for our control and our experimental.
    The data is not normally distributed.
    The variances are different.

    What test do I use to see if there is a difference between means?

    I keep coming to the Mann Whitney test, but I can't find any tutorials on how to perform this test with large sample sizes.

    Any suggestions?

  2. #2
    TS Contributor
    Points: 17,749, Level: 84
    Level completed: 80%, Points required for next Level: 101
    Karabiner's Avatar
    Location
    FC Schalke 04, Germany
    Posts
    2,540
    Thanks
    56
    Thanked 640 Times in 602 Posts

    Re: Having a hard time deciding on what test to use for data

    The U-test is not a test for means.

    You can use the Welch test, which is a t-test corrected
    for unequal variances. Since both samples are about
    the same size (I suppose), Welch and t-test won't
    differ much.

    Whether the data (within groups!) is normally distributed
    or not, doesn't matter for the t-test with such a big sample
    size such as yours.

    With kind regards

    K.

  3. #3
    Points: 9, Level: 1
    Level completed: 17%, Points required for next Level: 41

    Posts
    2
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Re: Having a hard time deciding on what test to use for data

    Quote Originally Posted by Karabiner View Post
    The U-test is not a test for means.

    You can use the Welch test, which is a t-test corrected
    for unequal variances. Since both samples are about
    the same size (I suppose), Welch and t-test won't
    differ much.

    Whether the data (within groups!) is normally distributed
    or not, doesn't matter for the t-test with such a big sample
    size such as yours.

    With kind regards

    K.
    The U test does show a difference in groups, correct?

    Once you rank them, it shows the difference in medians between the two groups?

  4. #4
    TS Contributor
    Points: 17,749, Level: 84
    Level completed: 80%, Points required for next Level: 101
    Karabiner's Avatar
    Location
    FC Schalke 04, Germany
    Posts
    2,540
    Thanks
    56
    Thanked 640 Times in 602 Posts

    Re: Having a hard time deciding on what test to use for data

    Once you rank them, it shows the difference in medians between the two groups?
    Hopefully so, but not necessarily. The U-test is not a test
    for medians. It just tells us whether in one group ranks
    tend to be higher than in the other group. Maybe the
    description of the Wilcoxon rank sum test (which gives
    exactely the same result as the U-test) is a bit more
    illustrative than the description of the Mann-Whitney test.

    If you want to test the means, then with n > 2000 you
    can perform a t-test even if data (within groups) are
    non-normal. By the way, I am not sure whether an U-test
    could not be affected by markedly different variances.

    With kind regards

    K.

  5. #5
    TS Contributor
    Points: 14,811, Level: 78
    Level completed: 91%, Points required for next Level: 39
    Miner's Avatar
    Location
    Greater Milwaukee area
    Posts
    1,171
    Thanks
    34
    Thanked 405 Times in 363 Posts

    Re: Having a hard time deciding on what test to use for data

    The decibel scale is logarithmic, which is why the distributions are non normal and heteroskedastic. You should be able to transform the results and apply a standard test.

  6. #6
    TS Contributor
    Points: 17,749, Level: 84
    Level completed: 80%, Points required for next Level: 101
    Karabiner's Avatar
    Location
    FC Schalke 04, Germany
    Posts
    2,540
    Thanks
    56
    Thanked 640 Times in 602 Posts

    Re: Having a hard time deciding on what test to use for data

    But since it is already logarithmic, and measurements are still
    non-normal and heteroscedatic - what should usually be done, then?

    With kind regards

    K.

  7. #7
    TS Contributor
    Points: 14,811, Level: 78
    Level completed: 91%, Points required for next Level: 39
    Miner's Avatar
    Location
    Greater Milwaukee area
    Posts
    1,171
    Thanks
    34
    Thanked 405 Times in 363 Posts

    Re: Having a hard time deciding on what test to use for data


    I would try taking the antilog (inverse log) of the data.

+ Reply to Thread

           




Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts






Advertise on Talk Stats