Hello,
So I have recently recieved a whole slide worth of Paired-End SOLID ChIP-seq data, and have encountered problems at the earliest of steps. I have tried to use BOWTIE to align these reads to the hg19 genome, but it is going extremely slow. Upon further inspection it seems that there are a number of reads with "no-calls" or otherwise poor quality values that may or may not be "gumming up" the Bowtie aligner... the question:
Does anyone have suggestions on how to pre-process the data to remove reads that are VERY unlikely to give unique alignments prior to mapping to a reference genome? I'm thinking like... anything with average quality values below 8 gets thrown out...
Any help in how to do this, or aligners that may do this or something similar would be great.
Many thanks,
Bryan
So I have recently recieved a whole slide worth of Paired-End SOLID ChIP-seq data, and have encountered problems at the earliest of steps. I have tried to use BOWTIE to align these reads to the hg19 genome, but it is going extremely slow. Upon further inspection it seems that there are a number of reads with "no-calls" or otherwise poor quality values that may or may not be "gumming up" the Bowtie aligner... the question:
Does anyone have suggestions on how to pre-process the data to remove reads that are VERY unlikely to give unique alignments prior to mapping to a reference genome? I'm thinking like... anything with average quality values below 8 gets thrown out...
Any help in how to do this, or aligners that may do this or something similar would be great.
Many thanks,
Bryan
Comment