    testing for overlapping window types in protein sequences

    Dear community members, I have a problem to solve from data in the protein universe. I am working with the sum total of all ~30K proteins for a species (=proteome) 1. Each protein has a certain length (text string, using 20 letter alphabet) and EACH protein has a different length across...