So I thought it over and I have an idea but it seems too simple. What I'm thinking is that I can calculate a probability based on where the unknown protein lies in the list of known proteins. So for the above example, protein X falls between known proteins D and E so the probability that X may be modified is:
(# known proteins with lower % in A) / (Total known proteins)
= 1/5 = 0.2
Is this valid? Seems too easy





Reply With Quote