High GC content and PCR duplicate

ttnguyen

Member

Join Date: Mar 2010

Posts: 41
- Share
- Tweet
#1

High GC content and PCR duplicate

03-07-2011, 11:35 AM

Dear All,

Our lab used ChIP-seq to study a histone variant that is expected to occurs everywhere in the human genome. I found two problems with our ChIP-seq dataset but could not figure out why they happened.

- I used Picard to mark duplicates and found that the duplicate percentage is 66%. I think this is so high. Do you know what is acceptable duplicate level in ChIP-seq data?

- The GC content of our dataset is 56% - much higher than the GC content of the reference genome. However, this is not explained by the duplication problem since the GC content of this dataset after removing duplicates does not decrease. I saw a post saying that Illumina prefer to sequencing higher GC content region. I wonder if Illumina have already fixed this bias?

I would very much appreciate if you could give some possible reasons for these two problems.

Many thanks,

Nguyen
Tags: None

Previous template Next

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 31 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad