Seqanswers Leaderboard Ad

**nucacidhunter** · 05-27-2017, 07:15 PM

I agree with you that with deeper sequencing %duplicate should increase and read length differences are less likely to be the cause as FastQC uses initial 50 sequence of a subset of reads for duplicate calculation.

It would be helpful if you could post the whole FastQC report for both runs as other plots might give some clues about the cause.

**Jaeb** · 05-28-2017, 12:55 AM

Thanks for your fast reply. I did attach now the complete FastQC reports....

**nucacidhunter** · 05-28-2017, 03:21 AM

I think HS4000 reads contain lots of errors due to positional lower quality so the sequences of duplicates do not match and they are reported as unique reads. Also lots of reads seems to have very low quality over the whole length of read. If you trim or filter low quality reads you should get similar duplication rate for both runs.

**GenoMax** · 05-28-2017, 04:08 AM

I suggest that you run clumpify.sh from BBMap to get an exact idea of the duplication. You can allow for errors when doing the sequence match. FastQC does not look at the entire dataset for some of the modules (only a % of data is sampled).

Even though there is a thread for clumpify here the one over at Biostars has the directions clearly defined on one page.

Topics	Statistics	Last Post
Evaluating Genome Sequencing for ECMO Patients in the NICU by seqadmin Started by seqadmin, 12-17-2024, 10:28 AM	0 responses 33 views 0 likes	Last Post by seqadmin 12-17-2024, 10:28 AM
New Genetic Toolkit Refines Studies on Gene Function and Disease by seqadmin Started by seqadmin, 12-13-2024, 08:24 AM	0 responses 48 views 0 likes	Last Post by seqadmin 12-13-2024, 08:24 AM
Study Links Brain Mechanism to Emotional Responses in Animals and Humans by seqadmin Started by seqadmin, 12-12-2024, 07:41 AM	0 responses 34 views 0 likes	Last Post by seqadmin 12-12-2024, 07:41 AM
Study Identifies Ribosomal RNA Fingerprints as Early Cancer Biomarkers by seqadmin Started by seqadmin, 12-11-2024, 07:45 AM	0 responses 46 views 0 likes	Last Post by seqadmin 12-11-2024, 07:45 AM

Seqanswers Leaderboard Ad

Announcement

0% duplicates in RNA-Seq/Drop-seq library

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News