Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bias in mapped forward/reverse read ratios

    Hi,

    We see a peculiar bias in the mapping of reads in forward and reverse directions to sites in human genomic DNA. Occurs with both RNAseq and DNAseq, and with both novoalign and maq mapping. 45bp SE GAII data.

    We thought there should be a 50%/50% split of reads mapping to the genome in forward and reverse directions. The ratios of ./, or A/a etc (maq pileup output format) have huge variance, much more than expected by chance.

    Has anyone else seen this? Is it something to do with solexa chemistry - eg adapter ligation, PCR steps?

    It is relevant for SNP calling algorithms.

    thanks

    david

  • #2
    Very interesting. We saw that once, but never pursued deeper. Can you be specific about the variance you see in fwd vs rev?

    I will look up and see what we generally get..
    --
    bioinfosm

    Comment


    • #3
      I've seen this too after using sequence capture for sample prepp. and it would be interesting to get an explanation for this. I got this reply from Illuminas Tech support :
      "My understanding is that by using the Nimblegen sequence capture you enrich for particular regions but it also introduces a bias in the strandednes. We normally never see a bias with regards to strand.".
      But I thought the DNA eluted from the seq.cap array would all be double stranded after the PCR amplification step, and therefor no bias should bee seen.

      Comment


      • #4
        We see bias with both sequence capture DNA, and non sequence capture RNAseq. So its not the sequence capture.
        david

        Comment


        • #5
          That sounds strange. Are there any particular regions where the bias occur and is it always the same strand that gets the higher read numbers?

          Comment


          • #6
            Not sure I understand what you mean by bias ?

            Is it global bias - so that say, 70% are F and 30% are R% ?

            do you have a forwards and reverse strand coverage graph ?

            you should see equal F and R globally, but if you make the F and R coverage plots you might see identical coverage peaks but offset by the average insert size of your template, because any fragment has a 50% chance of going onto the surface in either orientation during cluster prep.

            Comment


            • #7
              Just thought i would give you an idea of what we are seeing in dvh's group. Attached to this post is a graph with some data from the phix174 control lane from two separate runs but using the same library. The data we see from other libraries (all kinds of sample preps - DNA, RNA, methylation) tends to be similar to this although the change is more pronounced than in phix, probably due to lower coverage.
              Attached Files

              Comment


              • #8
                that doesn't look right.

                Comment


                • #9
                  Yes, definitely something strange happening there.. we see 55-45 fwd vs reverse mapping, but its kinda uniform across the reference..
                  --
                  bioinfosm

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM
                  • seqadmin
                    Techniques and Challenges in Conservation Genomics
                    by seqadmin



                    The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                    Avian Conservation
                    Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                    03-08-2024, 10:41 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, Yesterday, 06:37 PM
                  0 responses
                  11 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, Yesterday, 06:07 PM
                  0 responses
                  10 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 03-22-2024, 10:03 AM
                  0 responses
                  51 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 03-21-2024, 07:32 AM
                  0 responses
                  68 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X