Hi
I have been processing some 150bp paired end Nextera XT reads from viral cDNA. The first 20 or so bases at the 5'end are due to Nextera which I just trim off, but I also have an unusual base content at the 3'end of all sequences, consistently in all samples. I have attached the fastqc images for the forward reads of one sample, which is representative of all my samples. Does anyone know what this might be?
I have been processing some 150bp paired end Nextera XT reads from viral cDNA. The first 20 or so bases at the 5'end are due to Nextera which I just trim off, but I also have an unusual base content at the 3'end of all sequences, consistently in all samples. I have attached the fastqc images for the forward reads of one sample, which is representative of all my samples. Does anyone know what this might be?
. The average size of my libraries is usually >350bp, which I assumed was ok for 150bp PE reads? Looking in Tablet there is a large degree of overlap between paired reads- is this indicative of the library being too short? I am mapping the reads to a reference sequence and looking for SNPs so I am not worried about overlap in the read pairs. How would this affect the 3'end of my reads?
Comment