+ Reply to Thread
Results 1 to 10 of 10

Thread: What to do about violations of assumptions

  1. #1
    Fortran must die
    Points: 58,790, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,532
    Thanks
    692
    Thanked 915 Times in 874 Posts

    What to do about violations of assumptions




    This is one example of a larger topic

    Say you are doing logistic regression and you have influential data points. You can't use robust regression - so what do you use? Non-parametrics?

    Does cook's d even work with logistic regression?
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

  2. #2
    Fortran must die
    Points: 58,790, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,532
    Thanks
    692
    Thanked 915 Times in 874 Posts

    Re: What to do about violations of assumptions

    It is easy to test for multicolinearity, but I have found no solution with broad support if you can not gather more data and don't want to combine multiple variables into one [neither are realistic solutions in my analysis].

    So how does one deal with MC?
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

  3. #3
    Fortran must die
    Points: 58,790, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,532
    Thanks
    692
    Thanked 915 Times in 874 Posts

    Re: What to do about violations of assumptions

    While I am at it, do the rules of thumb for Cook's D apply to predictor (covariate) values for logistic regression, or are these rules different for logistic and linear regression. So far I have found little on this [what I have found leads me to suspect the rules of thumb are the same for the two approaches].
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

  4. #4
    Points: 20,006, Level: 89
    Level completed: 32%, Points required for next Level: 344

    Posts
    568
    Thanks
    50
    Thanked 20 Times in 19 Posts

    Re: What to do about violations of assumptions

    IIRC, I think Cook's D is based off the Hat Matrix, which involves only the X's (covariates). In this case, you can run a regular OLS regression with your variables and inspect them as usual.

  5. The Following User Says Thank You to jamesmartinn For This Useful Post:

    noetsi (01-25-2016)

  6. #5
    Fortran must die
    Points: 58,790, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,532
    Thanks
    692
    Thanked 915 Times in 874 Posts

    Re: What to do about violations of assumptions

    That makes sense although some comments on logistic regression I read suggested different rules of thumb than I saw for linear regression. This may simply be different authors with their own experiences leading them to different rules...

    This is certainly the case with Hosmer and Lemeshow who base their recommendations on their own experience with logistic regression.
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

  7. #6
    Omega Contributor
    Points: 38,253, Level: 100
    Level completed: 0%, Points required for next Level: 0
    hlsmith's Avatar
    Location
    Not Ames, IA
    Posts
    6,989
    Thanks
    397
    Thanked 1,185 Times in 1,146 Posts

    Re: What to do about violations of assumptions

    Multicolinearity can be addressed by doing nothing (), dropping a variable, or creating a new variable (e.g, construct based).
    Stop cowardice, ban guns!

  8. #7
    Devorador de queso
    Points: 95,540, Level: 100
    Level completed: 0%, Points required for next Level: 0
    Awards:
    Posting AwardCommunity AwardDiscussion EnderFrequent Poster
    Dason's Avatar
    Location
    Tampa, FL
    Posts
    12,930
    Thanks
    307
    Thanked 2,629 Times in 2,245 Posts

    Re: What to do about violations of assumptions

    Depending on what your actual goal is multicollinearity isn't really much of an issue...
    I don't have emotions and sometimes that makes me very sad.

  9. #8
    Fortran must die
    Points: 58,790, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,532
    Thanks
    692
    Thanked 915 Times in 874 Posts

    Re: What to do about violations of assumptions

    Where MC comes up is if you are trying to get the unique impact of a predictor. And commonly I am. The problem with collapsing two variables together is that tells you what the collapsed variables do. Not what the individual variables do by themselves. I was wondering if partial regression plots get at this, but I am guessing not...

    I guess you could argue if MC makes it impossible to get at the unique impact of a variable creating solutions really distorts reality.
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

  10. #9
    Omega Contributor
    Points: 38,253, Level: 100
    Level completed: 0%, Points required for next Level: 0
    hlsmith's Avatar
    Location
    Not Ames, IA
    Posts
    6,989
    Thanks
    397
    Thanked 1,185 Times in 1,146 Posts

    Re: What to do about violations of assumptions

    You still get at the unique impact of the variables, either through odds ratios or partial R^2, but the standard errors get wonky. You know this.
    Stop cowardice, ban guns!

  11. #10
    Fortran must die
    Points: 58,790, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,532
    Thanks
    692
    Thanked 915 Times in 874 Posts

    Re: What to do about violations of assumptions


    But you can't test if those are statistically significant and, at least in my line of work that is pretty important. People care a lot about test of statistical significance. This is true in journals as well, at least in social science ones where few pay attention to specific effect size which is hard to interpret substantively. They want to know if this a real effect, that is statistically significant.

    In rarely look at odds ratios for linear regression actually, I use slopes.
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

+ Reply to Thread

           




Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts






Advertise on Talk Stats