+ Reply to Thread
Results 1 to 3 of 3

Thread: Which analysis represents better the "worse-ness" of my linear regression?

  1. #1
    Points: 10, Level: 1
    Level completed: 19%, Points required for next Level: 40

    Posts
    2
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Which analysis represents better the "worse-ness" of my linear regression?




    Hello all,

    If I have two sets of data points, for instance:

    1 10
    2 14
    3 17
    4 12
    5 19

    and

    1 10
    2 14
    3 17
    4 12
    100 112

    The linear regression is bad for the first set and good for the second, if you look at R^2, but it is still a bad line. I was told that standard deviation of the slope can help me, since for the second data set there is seemingly more leeway, thus the first 4 points become less represented by the regression. However, this is not the case.

    I would like to ask whether there is a parameter or a statistical analysis of the data set that can give me evaluation of the quality of regression.
    I do see a picture of residuals, however I am sure there is something more that can help me...

    Thank you very much in advance,
    Alex.

  2. #2
    Omega Contributor
    Points: 38,289, Level: 100
    Level completed: 0%, Points required for next Level: 0
    hlsmith's Avatar
    Location
    Not Ames, IA
    Posts
    6,992
    Thanks
    397
    Thanked 1,185 Times in 1,146 Posts

    Re: Which analysis represents better the "worse-ness" of my linear regression?

    Mean Square Error is a term that is usually beneficial. Your example seems like you are implying that there may be outliers. If so, you can look at the leverage and influence of individual data observation. You can also find values with potential issues and remove the value and examine fit. It is also beneficial to visualize your data and best fit line if you have 3-dimensional or 2-dimensional data. It comes down to what your question is?
    Stop cowardice, ban guns!

  3. #3
    Points: 10, Level: 1
    Level completed: 19%, Points required for next Level: 40

    Posts
    2
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Re: Which analysis represents better the "worse-ness" of my linear regression?


    Thanks for the reply,
    I see what you mean, however I am not talking about outliers. In my example, I increased the interval between the data points to get a "better" linear fit with a higher R^2, however it minimizes the weight of every other point with a lower X value, thus representing it badly. Thus in the case where I try to use the fit for the first and second data sets to represent values of X in the in the low range (in this case 1,2,3,4...), the second data set will have a worse prediction than the first.
    I want a mathematical, statistical parameter or value that can really say that this is the case - the second fit is worse than the first.

    Thanks again,
    Alex.

+ Reply to Thread

           




Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts






Advertise on Talk Stats