+ Reply to Thread
Results 1 to 5 of 5

Thread: Missing Values with Count Data

  1. #1
    Points: 1,070, Level: 17
    Level completed: 70%, Points required for next Level: 30

    Posts
    5
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Missing Values with Count Data




    So, I have a question that I cannot figure out. I've looked in the literature and can't really find a solution. Maybe I'm looking in the wrong place, but could use some direction.

    I have an outcome variable that is a count (i.e. # of deaths). There is missing data that I would like to address. However, the solutions I typically use (maximum likelihood or multiple imputation) produce values that are not an integer. A negative binomial regression requires an integer.

    Is there a technique that anyone is aware of that I can use to replace missing values that will result in an integer? As a long shot, would it be acceptable to round the non-integers to integers?

    Thanks!

    Dave

  2. #2
    Devorador de queso
    Points: 95,940, Level: 100
    Level completed: 0%, Points required for next Level: 0
    Awards:
    Posting AwardCommunity AwardDiscussion EnderFrequent Poster
    Dason's Avatar
    Location
    Tampa, FL
    Posts
    12,937
    Thanks
    307
    Thanked 2,630 Times in 2,246 Posts

    Re: Missing Values with Count Data

    If you're getting non-integer values when doing maximum likelihood imputation my guess is that you're assuming normally distributed data. Change the likelihood to the proper distribution and ML should give you reasonable values.
    I don't have emotions and sometimes that makes me very sad.

  3. #3
    Points: 1,070, Level: 17
    Level completed: 70%, Points required for next Level: 30

    Posts
    5
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Re: Missing Values with Count Data

    Thanks! Unfortunately I am not familiar with missing value analysis for non-normal data. I use SPSS V. 22 (the only one my organization buys!). The options for EM are "normal", "mixed normal", and "student's t". Mixed normal has a field for mixture proportion and standard deviation ratio. Stduents t has a field for degrees of freedom.

    I appreciate you taking the time to respond and don't expect you to walk me through how to make the proper adjustment. However, are you able to point me to a source that can assist me with making these adjustments? Thanks!

  4. #4
    Points: 1,070, Level: 17
    Level completed: 70%, Points required for next Level: 30

    Posts
    5
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Re: Missing Values with Count Data

    Bump. Does anyone have a solution?

  5. #5
    Human
    Points: 12,686, Level: 73
    Level completed: 59%, Points required for next Level: 164
    Awards:
    Master Tagger
    GretaGarbo's Avatar
    Posts
    1,363
    Thanks
    455
    Thanked 462 Times in 402 Posts

    Re: Missing Values with Count Data


    Round the value to the closest integer!

    If you have lots of values and few missing, then that does not matter. If you have few values and many missing then you have probably other strange selection difficulties.

    But I would not be surprised if the EM algorithm calculated an expected value as a non-integer for a discrete distribution.

+ Reply to Thread

           




Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts






Advertise on Talk Stats