+ Reply to Thread
Results 1 to 4 of 4

Thread: which reference category for factor variables

  1. #1
    Points: 1,050, Level: 17
    Level completed: 50%, Points required for next Level: 50

    Posts
    20
    Thanks
    3
    Thanked 0 Times in 0 Posts

    which reference category for factor variables




    Hello,
    I was wondering if there is any systematic rational of how to decide which category of a factor variable (e.g. level of education -primary, secondary, tertiary education) shall be used as reference category in multiple regression models?

  2. #2
    Devorador de queso
    Points: 95,995, Level: 100
    Level completed: 0%, Points required for next Level: 0
    Awards:
    Posting AwardCommunity AwardDiscussion EnderFrequent Poster
    Dason's Avatar
    Location
    Tampa, FL
    Posts
    12,938
    Thanks
    307
    Thanked 2,630 Times in 2,246 Posts

    Re: which reference category for factor variables

    Well it doesn't *really* matter as you can answer any questions you have using any of the categories as the reference. But if there is one level that you're most interested in comparing to the others then it might make things slightly easier on you to choose that as the reference. With that said it's also entirely possible to use a different coding scheme other than reference/dummy coding: https://en.wikipedia.org/wiki/Catego...and_regression
    I don't have emotions and sometimes that makes me very sad.

  3. The Following User Says Thank You to Dason For This Useful Post:

    hlsmith (12-30-2015)

  4. #3
    Points: 1,050, Level: 17
    Level completed: 50%, Points required for next Level: 50

    Posts
    20
    Thanks
    3
    Thanked 0 Times in 0 Posts

    Re: which reference category for factor variables

    Thanks for your reply Dason.
    I would like to give an actual example which made me wondering about the choice of reference categories.
    I run a linear regression model which also includes 2 factor variables as IVs. One of the factor variables measures respondents level of education (primary, secondary, tertiary).
    In a pre-regression step, I run correlation analyses which indicate that primary school correlates negatively with the DV and tertiary education correlates positively with the DV. Secondary schholl does not correlate significantly with the DV.
    Hence I was wondering which of the three categories is best used as reference category in the multiple regression model. I thought of using secondary school since this category is not significantly related to the DV.
    Does that make sense?

  5. #4
    Omega Contributor
    Points: 38,432, Level: 100
    Level completed: 0%, Points required for next Level: 0
    hlsmith's Avatar
    Location
    Not Ames, IA
    Posts
    7,006
    Thanks
    398
    Thanked 1,186 Times in 1,147 Posts

    Re: which reference category for factor variables


    Dason, I like that wiki page.


    andik,


    What is your hypothesis? Once you have the output you can run it each way and see which may be easiest for your audience to understand. There is not right and wrong way - there are just some that make more intuitive sense.
    Stop cowardice, ban guns!

+ Reply to Thread

           




Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts






Advertise on Talk Stats