Hi
I recently downloaded the Compara.39_eutherian_mammals_EPO_LOW_COVERAGE.chr* maf (Multiple alignment format) files from ensembl.
I'm trying to process the files, and some of the software I'm trying to use (e.g. mafTools) is having issues with the files because the nucleotide sequence has '.' in it.
e.g. "AGTTCAACTCTC----TTAGAACAGTCTCACTGTGTGTGAACCAATATTGCAAGAAATCACACTCAGAAAACCCATTGCTGCAGAGTCAATTGG........" (NB. the '.' at the end are the '.' i'm talking about)
I'm a bit confused because I was expecting only nucleotides, or '-' or 'N'.
There are N's in the file so it doesnt make sense that '.' means missing.
Does anyone know what the '.' means?
Thanks in advance
I recently downloaded the Compara.39_eutherian_mammals_EPO_LOW_COVERAGE.chr* maf (Multiple alignment format) files from ensembl.
I'm trying to process the files, and some of the software I'm trying to use (e.g. mafTools) is having issues with the files because the nucleotide sequence has '.' in it.
e.g. "AGTTCAACTCTC----TTAGAACAGTCTCACTGTGTGTGAACCAATATTGCAAGAAATCACACTCAGAAAACCCATTGCTGCAGAGTCAATTGG........" (NB. the '.' at the end are the '.' i'm talking about)
I'm a bit confused because I was expecting only nucleotides, or '-' or 'N'.
There are N's in the file so it doesnt make sense that '.' means missing.
Does anyone know what the '.' means?
Thanks in advance