Hi Everyone:
I would be grateful if someone could take a quick look at these FASTQC results. This is exome-seq 100 bp paired-end data. For the most part, FASTQC results seem normal, except a strange distribution for the per sequence GC content graph. From the FASTQC manual, an unusual distribution seems to be suggestive of contamination and a shift in the curve is suggestive of a systematic bias. Could this systematic bias be due to exome enrichment?
Also there seems to be homopolymer AAAA and TTTT repeats as indicated by the KMER content warning? Can anyone speculate on the source of these homopolymers?
Thanks,
MC
I would be grateful if someone could take a quick look at these FASTQC results. This is exome-seq 100 bp paired-end data. For the most part, FASTQC results seem normal, except a strange distribution for the per sequence GC content graph. From the FASTQC manual, an unusual distribution seems to be suggestive of contamination and a shift in the curve is suggestive of a systematic bias. Could this systematic bias be due to exome enrichment?
Also there seems to be homopolymer AAAA and TTTT repeats as indicated by the KMER content warning? Can anyone speculate on the source of these homopolymers?
Thanks,
MC
Comment