Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • SAV % base question

    I am new to looking at SAV. Would someone please explain to me why my % base would look like this. Thanks.
    Attached Files

  • #2
    Can you give more information about the library? I'm not sure, but it looks like a bar code, CCGGTTCA.

    Comment


    • #3
      The library has a standard Illumina adapter on it. I am not sure why there is the change at around 60 bases either.

      Comment


      • #4
        Have you measured the insert size? They could be short. That wouldn't explain everything, but it might explain the change at around 60.

        Comment


        • #5
          Here is what the library looks like. It was made using a protocol based on Khil et al Genome Res. 2012. 22: 957-965.

          The abstract says: Meiotic DNA double-stranded breaks (DSBs) initiate genetic recombination in discrete areas of the genome called recombination hotspots. DSBs can be directly mapped using chromatin immunoprecipitation followed by sequencing (ChIP-seq). Nevertheless, the genome-wide mapping of recombination hotspots in mammals is still a challenge due to the low frequency of recombination, high heterogeneity of the germ cell population, and the relatively low efficiency of ChIP. To overcome these limitations we have developed a novel method—single-stranded DNA (ssDNA) sequencing (SSDS)—that specifically detects protein-bound single-stranded DNA at DSB ends. SSDS comprises a computational framework for the specific detection of ssDNA-derived reads in a sequencing library and a new library preparation procedure for the enrichment of fragments originating from ssDNA. The use of our technique reduces the nonspecific double-stranded DNA (dsDNA) background >10-fold. Our method can be extended to other systems where the identification of ssDNA or DSBs is desired.
          Attached Files

          Comment


          • #6
            Originally posted by sophie mcquade View Post
            I am new to looking at SAV. Would someone please explain to me why my % base would look like this. Thanks.
            What you have is a library with a significant fraction of adapter dimer reads. If you closely examine the pattern of the predominant base call over the fist 60 bases of reads 1 and 2 you will find that they exactly match the sequence of the Illumina adapters. The adapters are each ~60nt long so when the read runs off the end of the adapter the sequencer has tendency to just call A's.

            Now look at you Bioanalyzer trace. Do you see that teeny, tiny little peak at 125nt? That is your adapter dimer. Adapter dimer contaminations which appear insignificant on a BioA trace can account for a significant number of reads because they will cluster with much higher efficiency than your legitimate library molecules.

            Comment


            • #7
              And I guess you did put index information into your samplesheet. However, there was only one sample/one index sequenced. This should explain the pattern after 100 bases.

              Comment


              • #8
                Thanks for all of your suggestions. I had asked the person analyzing the data if he was seeing adapter dimer reads and he said no. Now that I have the % base data I will have to ask him to recheck the data.

                Comment


                • #9
                  Based on the relative peak heights, only 4-5% of your reads are adapter dimers. The effect on the data analysis would be minimal (a reduction in the fraction of aligned reads), which could easily have been missed by your informaticist.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Essential Discoveries and Tools in Epitranscriptomics
                    by seqadmin




                    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                    04-22-2024, 07:01 AM
                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, Today, 11:49 AM
                  0 responses
                  12 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, Yesterday, 08:47 AM
                  0 responses
                  16 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  61 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  60 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X