Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • specific location coverage from average coverage?

    Hello there,

    I want to know if there is any way to calculate the coverage at a specific nucleotide across the amplicon using the average coverage, with the assumption that the reads are normally distributed across the ampliocn.

    Say I have sequenced a 1.3kb amplicon, with read length 300 nts, and number of reads 10000. using Lander/Waterman equation (LN/G), the average coverage will be ~2300x. This tells us that every nucleotide was covered at least with 2300 reads. My question is that, can I estimate roughly the actual number of reads cover a certain nucleotide across the ampliocn, say nucleotide number 9, or any other nucleotide. Is it doable in a theoretical way without using tools like bedtools or samtools?

    Many thanks

  • #2
    You can't get a useful answer without mapping. Did you amplify some region then randomly shear it? If you're sequencing the amplicons directly, the coverage will all be in the same location. If you're sequencing the randomly sheared amplicons, there's no way to predict what the coverage will look like.

    Comment


    • #3
      mmmm I see..yes this is the plan, to shear the amplicon.

      What If I use a simulated data, that is perfectly normally distributed, can I use this average coverage to calculate a specific point coverage?

      Many thanks

      Comment


      • #4
        Certainly, if you use synthetic data with a normal distribution, it's possible to estimate the coverage at any point along the amplicon from the average.

        Comment


        • #5
          Thanks very much Brian, could you please tell me a key though of how this can be done? Do I have to find first the actual coverage for one point, and use it as a reference? Or I can use the coverage average along with the mean and standard deviation? In both ways, I am not sure how to carry out this process...Confused

          Could please help me further.

          Much appreciated

          Comment


          • #6
            Well, it's possible, once you know the constants, using the formula:



            Using 300bp reads instead of 1bp reads would make it harder; you'd need to modify the formula somehow. But it's an approximation either way, and not one that I think would be useful at all in the real world, so I don't see much point in putting in any effort to solve it.

            Comment


            • #7
              Many thanks

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin




                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                Yesterday, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              59 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              57 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              47 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              55 views
              0 likes
              Last Post seqadmin  
              Working...
              X