Hi All,
I recently got some 50nt SOLiD reads. The 53X deep dataset I have been focusing on is from exactly the same isolate as a reference genome. However, only 9% of the reads are aligning! and 1/3rd of the genome is uncovered. By trimming the reads I can get up to around 50% of the reads to align. Apart from this, which might be a problem with the sequencing?, my pileup's look unusual in the fact that all the reverse reads, instead of showing up as commas if they match, are showing up as a lower case letters, I have attached a picture of the tview showing it.
I first changed all -1's to 0's in the quality file, then indexed, ran BWA's solid2fastq.pl and aligned.
A messy way around this is to compliment all lower case letters, but I was wondering if this is meant to happen?, and if anyone knows of something that I'm doing wrong?
I haven't worked with SOLiD before so any advice would be useful.
Thanks,
Rhys
I recently got some 50nt SOLiD reads. The 53X deep dataset I have been focusing on is from exactly the same isolate as a reference genome. However, only 9% of the reads are aligning! and 1/3rd of the genome is uncovered. By trimming the reads I can get up to around 50% of the reads to align. Apart from this, which might be a problem with the sequencing?, my pileup's look unusual in the fact that all the reverse reads, instead of showing up as commas if they match, are showing up as a lower case letters, I have attached a picture of the tview showing it.
I first changed all -1's to 0's in the quality file, then indexed, ran BWA's solid2fastq.pl and aligned.
A messy way around this is to compliment all lower case letters, but I was wondering if this is meant to happen?, and if anyone knows of something that I'm doing wrong?
I haven't worked with SOLiD before so any advice would be useful.
Thanks,
Rhys
Comment