Hi,
I'm analyzing some Illumina data (HiSeq) using Bioconductor and am curious to look a little bit more at the unmapped reads.
We're interested in some segmentally duplicated regions, so I'm curious to see whether I can use Illumina's mapping codes to see which of the unmapped reads didn't match anything at all, and which mapped to multiple locations.
Can anyone point me towards documentation on what the ELAND mapping codes mean for reads that didn't map to a chromosome? (e.g. NM, 255:255:255, 1:71:255, 0:1:0 etc). I suspect there's some useful information there but I don't know what it means.
I'm fairly new to next-gen sequence analysis - apologies if there's an obvious place to find this that I'm ignoring.
thanks very much,
Janet Young
Fred Hutchinson Cancer Research Center,
Seattle, WA, USA
I'm analyzing some Illumina data (HiSeq) using Bioconductor and am curious to look a little bit more at the unmapped reads.
We're interested in some segmentally duplicated regions, so I'm curious to see whether I can use Illumina's mapping codes to see which of the unmapped reads didn't match anything at all, and which mapped to multiple locations.
Can anyone point me towards documentation on what the ELAND mapping codes mean for reads that didn't map to a chromosome? (e.g. NM, 255:255:255, 1:71:255, 0:1:0 etc). I suspect there's some useful information there but I don't know what it means.
I'm fairly new to next-gen sequence analysis - apologies if there's an obvious place to find this that I'm ignoring.
thanks very much,
Janet Young
Fred Hutchinson Cancer Research Center,
Seattle, WA, USA