+ Reply to Thread
Results 1 to 7 of 7

Thread: Leverage analysis for groups of outliers

  1. #1
    Fortran must die
    Points: 62,031, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,538
    Thanks
    693
    Thanked 916 Times in 875 Posts

    Leverage analysis for groups of outliers




    There are a wide range of statistics that deal with leverage of individual data points (that is their impact on the regression line such as DFBETA). In my case I normally have thousands of points so its unlikely a single point is going to move the line much. But small groups of points might.

    So my question is, how do you determine (and is there anything like DFBETA or Cook's D] for a group of points that are outliers and might influence your analysis. And is there anything like robust regression that addresses the issue of multiple as compared to individual points.
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

  2. #2
    Omega Contributor
    Points: 39,045, Level: 100
    Level completed: 0%, Points required for next Level: 0
    hlsmith's Avatar
    Location
    Not Ames, IA
    Posts
    7,070
    Thanks
    402
    Thanked 1,192 Times in 1,153 Posts

    Re: Leverage analysis for groups of outliers

    If you have suspect values, you can just run the model with and without them to see if they "leverage" your results.
    Stop cowardice, ban guns!

  3. The Following User Says Thank You to hlsmith For This Useful Post:

    noetsi (05-09-2017)

  4. #3
    Fortran must die
    Points: 62,031, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,538
    Thanks
    693
    Thanked 916 Times in 875 Posts

    Re: Leverage analysis for groups of outliers

    Thanks. Believe it or not no one in any book I read suggested that They all talk about statistics like Cook's d or DFBETA.
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

  5. #4
    TS Contributor
    Points: 1,905, Level: 26
    Level completed: 5%, Points required for next Level: 95

    Posts
    253
    Thanks
    39
    Thanked 71 Times in 62 Posts

    Re: Leverage analysis for groups of outliers

    Quote Originally Posted by noetsi View Post
    Thanks. Believe it or not no one in any book I read suggested that They all talk about statistics like Cook's d or DFBETA.
    This is essentially how the DF beta is calculated, though. The model is run (n) separate times using (n-1) observations on each run to calculate the estimates leaving out the ith observation on each calculation until all (n) observations have been separately left out. The coefficient estimates are then compared to the estimate where all observations are included and DFbeta is calculated. Unless you're referring to using it for the group, then I haven't seen that done (where the group is omitted and a group DFBeta is calculated, although it wouldn't surprise me if it has been done). I would imagine careful selection of the group is required. Be sure to fully investigate each observation and have justification for including it in a particular group to omit.
    Last edited by ondansetron; 05-09-2017 at 08:31 PM.

  6. #5
    Fortran must die
    Points: 62,031, Level: 100
    Level completed: 0%, Points required for next Level: 0
    noetsi's Avatar
    Posts
    6,538
    Thanks
    693
    Thanked 916 Times in 875 Posts

    Re: Leverage analysis for groups of outliers

    I was referring to using these for a group. I run analysis on thousands of points and in theory and practice its hard for me to imagine that one point would move the regressive line much when you run 5000 plus points. I have never seen DFBETA applied to a set of outliers as compared to one. I did not think it had that functionality.
    "Very few theories have been abandoned because they were found to be invalid on the basis of empirical evidence...." Spanos, 1995

  7. #6
    TS Contributor
    Points: 1,905, Level: 26
    Level completed: 5%, Points required for next Level: 95

    Posts
    253
    Thanks
    39
    Thanked 71 Times in 62 Posts

    Re: Leverage analysis for groups of outliers

    Quote Originally Posted by noetsi View Post
    I was referring to using these for a group. I run analysis on thousands of points and in theory and practice its hard for me to imagine that one point would move the regressive line much when you run 5000 plus points. I have never seen DFBETA applied to a set of outliers as compared to one. I did not think it had that functionality.
    I can't say I've seen it done, either. However, I wonder if there are some papers where people figured out a way to group the observations and then calculate a group leverage statistic. As you said, it might not have that functionality, though.

  8. #7
    Omega Contributor
    Points: 39,045, Level: 100
    Level completed: 0%, Points required for next Level: 0
    hlsmith's Avatar
    Location
    Not Ames, IA
    Posts
    7,070
    Thanks
    402
    Thanked 1,192 Times in 1,153 Posts

    Re: Leverage analysis for groups of outliers


    I was just thinking something like say a Windsor trim job on residual related values. Not sure if there is a standardized test, but then you could look at relative changes in estimates or maybe AICC, etc.
    Stop cowardice, ban guns!

+ Reply to Thread

           




Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts






Advertise on Talk Stats