+ Reply to Thread
Results 1 to 5 of 5

Thread: Data Transformation in Multiple regression

  1. #1
    Points: 547, Level: 10
    Level completed: 94%, Points required for next Level: 3

    Posts
    3
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Data Transformation in Multiple regression



    Hi guys!
    I am trying to run a multiple regression. Some of the independent varibles need tranformation to achieve normal distribution. Some
    have to be log - transformed and others sqrt transformed.
    I would really like to know, if it is okay to make a model with differently transformed independent variables in the SAME model!?!?

  2. #2
    FormerlyKnownAsRaptor
    Points: 24,442, Level: 95
    Level completed: 10%, Points required for next Level: 908
    Awards:
    Activity Award
    trinker's Avatar
    Location
    Buffalo, NY
    Posts
    3,174
    Thanks
    883
    Thanked 552 Times in 500 Posts

    Re: Data Transformation in Multiple regression

    Yes this is ok.

    May I ask why you have to transform the IVs. They don't have to be normal. Did the diagnostic graphs from the model without transformation suggest the need for a trasnformation?
    "If you torture the data long enough it will eventually confess."
    -Ronald Harry Coase -

  3. The Following User Says Thank You to trinker For This Useful Post:

    Lantan (11-12-2011)

  4. #3
    R purist
    Points: 13,440, Level: 75
    Level completed: 48%, Points required for next Level: 210
    TheEcologist's Avatar
    Location
    The Netherlands.
    Posts
    1,287
    Thanks
    113
    Thanked 251 Times in 127 Posts

    Re: Data Transformation in Multiple regression

    Well interpretations gets a little tricky with all those transformation (each transformation does its own thing: e.g log transforms converts absolute differences into relative [as in percentage] differences). Additionally you should look at the residuals of your model as they should be normal - not necessarily each of your variables separately.

    Another option is maybe to use a GLM analysis as these types of regression analyses relax the assumptions of normality.
    So first check if you models' residuals are normal, then move towards transformations if you must but its better to use a GLM if it is more suited. No way we can tell with the information you have given though!
    The true ideals of great philosophies always seem to get lost somewhere along the road..

  5. The Following User Says Thank You to TheEcologist For This Useful Post:

    Lantan (11-12-2011)

  6. #4
    Points: 547, Level: 10
    Level completed: 94%, Points required for next Level: 3

    Posts
    3
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Re: Data Transformation in Multiple regression

    Quote Originally Posted by trinker View Post
    Yes this is ok.

    May I ask why you have to transform the IVs. They don't have to be normal. Did the diagnostic graphs from the model without transformation suggest the need for a trasnformation?
    Thank you very much for your reply!
    The residuals look crapy with untransformed data. Thats why I decieded to tranform them, searching for the best transformation I could not remember if it was okay to do different transformations.
    Thanks again

  7. #5
    Points: 547, Level: 10
    Level completed: 94%, Points required for next Level: 3

    Posts
    3
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Re: Data Transformation in Multiple regression


    Quote Originally Posted by TheEcologist View Post
    Well interpretations gets a little tricky with all those transformation (each transformation does its own thing: e.g log transforms converts absolute differences into relative [as in percentage] differences). Additionally you should look at the residuals of your model as they should be normal - not necessarily each of your variables separately.

    Another option is maybe to use a GLM analysis as these types of regression analyses relax the assumptions of normality.
    So first check if you models' residuals are normal, then move towards transformations if you must but its better to use a GLM if it is more suited. No way we can tell with the information you have given though!
    Thank you very much too for your anwser!!!
    I tried glm as well. But the Akaike Information Criterion told me that the linear model was better. But now I ask myself, if it is possible to compare
    lm and glm with AIC? Do you happen to know that?

+ Reply to Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts








Advertise on Talk Stats