Unconfigured Ad

**tahamasoodi** · 10-14-2012, 11:25 AM

There seems to be no answers for my query!

**dpryan** · 10-14-2012, 12:59 PM

Contamination, really small genome with a high sequencing depth... There are a number of possibilities, you could probably get more if you provided more details. Whether your data is useless or not will likely depend on the nature of the samples and what you intend to do with the data.

**fkrueger** · 10-14-2012, 02:24 PM

Here is a nice blog post about interpreting the duplication plot of FastQC.

**tahamasoodi** · 10-14-2012, 09:46 PM

Thanks dpryan,
I have around 100 cancer samples with equal number of controls for which we are doing WGS. Till now we have completed around 16 samples but when I started analysing them, I get a high level of duplication and in some samples the base quality is also not good.

**tahamasoodi** · 10-31-2012, 12:17 AM

Can anyone throw more light on this?

**tahamasoodi** · 11-03-2012, 07:51 AM

Can I use MarkDuplicates command of Picard for removing the duplicate sequences?

**kopi-o** · 11-03-2012, 10:05 AM

Yes, you can.

**tahamasoodi** · 11-03-2012, 10:07 AM

Will it remove all the duplicate sequences from the fastq file?

**kopi-o** · 11-03-2012, 10:31 AM

It will if you supply REMOVE_DUPLICATES=true (http://picard.sourceforge.net/comman...MarkDuplicates). Otherwise it will just flag them as duplicates in the output file.

**tahamasoodi** · 11-03-2012, 10:55 AM

I have a few samples with over 80% duplication (detected by FASTQC), will picard work for these samples?

**kopi-o** · 11-03-2012, 11:35 AM

I don't see why it wouldn't. By the way, it seems the numbers you get from FastQC usually overstate the duplication you detect with Picard.

Topics	Statistics	Last Post
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, Today, 11:10 AM	0 responses 6 views 0 reactions	Last Post by SEQadmin2 Today, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 42 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 102 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 124 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM

Unconfigured Ad

Sequence Duplication

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News