Seqanswers Leaderboard Ad

**SNPsaurus** · 12-03-2013, 11:18 PM

Any difference in the insert size of the libraries? The GC bias in PCR amplification might be different depending on insert size.

**lovenlong** · 12-04-2013, 03:23 AM

Originally posted by SNPsaurus View Post

Any difference in the insert size of the libraries? The GC bias in PCR amplification might be different depending on insert size.

The insert sizes of libraries for my samples were nearly the same:
G_1, 260±66;
G_2, 280±59;
M_1, 267±62;
M_2, 283±48.

Additionally, my samples would never be the reason to the GC% differences.
All individuals for pooled were derived from one pure rice line selfing at least 20 generations, and all were check with 24 SSR markers before selected for pooling.

Thanks.

**GenoMax** · 12-04-2013, 04:13 AM

Have you asked your sequence provider if there was anything peculiar about the two sequencing runs as far as base signal intensities (assume this is illumina sequencing) were concerned?

**lovenlong** · 12-05-2013, 01:27 AM

Originally posted by GenoMax View Post

Have you asked your sequence provider if there was anything peculiar about the two sequencing runs as far as base signal intensities (assume this is illumina sequencing) were concerned?

Hi,

I'm waiting for their reply now. But seems that they have not yet met this kind thing before.

I'm wondering if the GC% discrepancy can be happened on the PhiX174 control in different Hiseq2000 sequencing experiments.

In the evalutation paper reported by Minoche et al (Genome Biology 2011, 12:R112, doi:10.1186/gb-2011-12-11-r112), they found GC% higher than expected in Hiseq datasets:
"The GC content of the unfiltered HiSeq reads was higher than expected: 40% for Bv + PhiX data and 45.5% for At + PhiX. The B. vulgaris reference sequence has a %GC of 35% [8] and that of the A. thaliana genome is 36% (calculated from TAIR10 [9]). The fraction of PhiX reads (44.7% GC) accounts for only 1 to 2% of the data. For the PhiX sample sequenced on the GAIIx the %GC of 45.7% is much closer to the expected value of 44.7%."

This looks really strange.

Thanks!

**Melissa** · 12-05-2013, 04:18 PM

The datasets make sense to me because you're not sequencing from all the same individuals. Sequencing bias is also a good explanation. M1 is most similar to G1 because mutation doesn't affect many sites in the genome. I think somaclonal variation during tissue culture stage can also affect genome content. As for G2 and M2, I would expect the genome to change due to selfing (more homozygous). This doesn't necessarily means higher GC% as observed in your case.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Large difference (5%) in GC content of illumina readsets derived from the same line.

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News