+ Reply to Thread
Results 1 to 6 of 6

Thread: SNP - Protein interaction

  1. #1
    Points: 3,056, Level: 34
    Level completed: 4%, Points required for next Level: 144

    Location
    Denmark
    Posts
    3
    Thanks
    0
    Thanked 0 Times in 0 Posts

    SNP - Protein interaction



    Hello there!

    I am a PhD student with a rather complex problem.
    Since my specialty is molecular biology and genetics, I am having a rather hard time with a statistical issue, and is hoping that the experts within this forum can help me out

    This is what I would like to do.
    I have several protein measurements from differnt individuals in a case-control study, and a SNP genotype panel. I would like to see if there is any association between SNP genotype and protein levels, between cases and controls. This would be quite simple if my protein measurements was normal distributed, but unfortunately they arent, and they cant be manipulated to be normalized. Does anyone in here have an idea on how to proceed. I would rather not "group" the protein levels...

    Hope you can help me out.

    Thanks!
    Mads

  2. #2
    R purist
    Points: 13,351, Level: 75
    Level completed: 26%, Points required for next Level: 299
    TheEcologist's Avatar
    Location
    The Netherlands.
    Posts
    1,282
    Thanks
    112
    Thanked 249 Times in 125 Posts
    Now I'm not quite sure how to help you (or if I can), I'll need a data description first.

    How are your SNiP's quantified?
    How are your protein levels quantified?

    Categorical, ordinal, continuous ect ect

    Secondly how is your response distributed then?

    Quote Originally Posted by mvh View Post
    This would be quite simple if my protein measurements was normal distributed, but unfortunately they arent
    What would you do if your protein measurements were normally distributed?

    It of course depends on all of the above, but if you were thinking of running an regression or ANOVA type analysis remember that only your residuals must be normally distributed. The error around the model should be normal, the response itself not (necessarily).
    The true ideals of great philosophies always seem to get lost somewhere along the road..

  3. #3
    Points: 3,056, Level: 34
    Level completed: 4%, Points required for next Level: 144

    Location
    Denmark
    Posts
    3
    Thanks
    0
    Thanked 0 Times in 0 Posts
    How are your SNiP's quantified?
    The SNP are genotyped and presented as AA or AB (heterozygote) or BB, so catagorically

    How are your protein levels quantified?
    The protein levels is continous, and unfortunately the residuals is not normally distributed, they seems more "randomly distributed" (Please forgive my lack of statistical slang or technical words)

    My first thought was to use ANOVA to do the analysis, but since the lack of normally distribution I couldnt.

    I could chose to set some criterias for the protein concentration measurements, and analyse it in concentration groupings, but I would rather not.

    I really hope you can help me out, and appreciate your fast response!

  4. #4
    R purist
    Points: 13,351, Level: 75
    Level completed: 26%, Points required for next Level: 299
    TheEcologist's Avatar
    Location
    The Netherlands.
    Posts
    1,282
    Thanks
    112
    Thanked 249 Times in 125 Posts
    The non-parametric alternative to an ANOVA is the Kruskal-Wallis rank sum test.
    You can try it, it is less powerfull though.

    Also is it possible to post boxplots of your data per case or do you have multiple cases?
    The true ideals of great philosophies always seem to get lost somewhere along the road..

  5. #5
    Points: 3,056, Level: 34
    Level completed: 4%, Points required for next Level: 144

    Location
    Denmark
    Posts
    3
    Thanks
    0
    Thanked 0 Times in 0 Posts
    The non-parametric alternative to an ANOVA is the Kruskal-Wallis rank sum test.
    You can try it, it is less powerfull though.
    I already done KW testing to see if there was a direct correlation between genotype and protein concentration, unfortunately I cant implement case/control status into this analysis, and thats what I am looking for

    Also is it possible to post boxplots of your data per case or do you have multiple cases
    I am not sure what you mean, I have several controls and cases

  6. #6
    R purist
    Points: 13,351, Level: 75
    Level completed: 26%, Points required for next Level: 299
    TheEcologist's Avatar
    Location
    The Netherlands.
    Posts
    1,282
    Thanks
    112
    Thanked 249 Times in 125 Posts

    Oke so it indeed seems that you have gotten yourself a complex problem

    You know I feel like I’m still guessing here as I still don’t know the details of your study (how many cases ect). This type of Complex problems will require more info.

    But in light of what I do know and what I suspect you want to do: I would start out with a modelling study.

    (Maybe you have already done this but) Look at boxplots per case/control situation see if there is a difference or a detectable trend (something that might tell you how the relationship is ergo what potential model candidates are). Secondly try to figure out how the error if this model would look like ( we know its not normal but could it be then e.g. poisson or gamma distributed or ….).
    Once you have a better idea about this then look into GLM’s (generalized linear models), secondly maybe GAMS (http://www.statsoft.com/textbook/stgam.html). I recommend borrowing Statistics: An Introduction using R by Michael J. Crawley (it’s the most comprehensible description of GLM’s I’ve seen so far).
    The true ideals of great philosophies always seem to get lost somewhere along the road..

+ Reply to Thread

Similar Threads

  1. Possible Interaction?
    By jamesmartinn in forum Biostatistics
    Replies: 2
    Last Post: 07-25-2010, 10:22 PM
  2. Replies: 1
    Last Post: 11-11-2009, 07:52 AM
  3. Replies: 0
    Last Post: 05-03-2009, 09:31 AM
  4. three way interaction!!
    By angelaabd in forum Statistics
    Replies: 2
    Last Post: 03-30-2009, 03:36 PM
  5. interaction
    By Erida in forum Statistics
    Replies: 3
    Last Post: 12-04-2005, 04:59 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts








Advertise on Talk Stats