it is more randomly spread through the genome.
so is there any way to replace the undetermined values with some number that won't skew the data? i was thinking some kind of censoring algorithm. maybe "MLE" or something like that? i'm not experienced enough with stats to do this on my own. thanks.