Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Giorgio C
    Member
    • Oct 2010
    • 89

    Hot to calculate coverage on a 454 targeted region sequenced ?

    Hi all,

    I have a region of 250.000 bp targeted with probes and then sequenced on a 454.

    How can i make a statistical calculation of the coverage ?
    Do i have to consider the total number of bp sequenced and number of reads ?

    Or just for example considering 1.000.000 bp sequenced :

    Coverage = 100.000/250.000 bp ?

    Any advices are highly appreciated,
    Thanks
  • sphil
    Senior Member
    • Apr 2010
    • 192

    #2
    Hi,


    coverage = (#reads * length of reads) / length of target sequence

    thus, having 50k reads with length of 250 and 100k target sequence length you get a average coverage of 125. However please keep in mind that some regions can be sequences more easily and all sequences techniques have their own caveats. Keeping that in mind the above approximation is still valid.

    Best

    Phil

    Comment

    • Giorgio C
      Member
      • Oct 2010
      • 89

      #3
      Thank you very much for your reply. So if i undersand well the correct formula is:

      total numbero of bp obteined * avarege length of all reads / length of targeted region

      Is that right ?

      Comment

      • Giorgio C
        Member
        • Oct 2010
        • 89

        #4
        or is :

        total number of bp obteined * total number of reads / length of targeted region ?

        Comment

        • sphil
          Senior Member
          • Apr 2010
          • 192

          #5
          it is: total number of bp obtained / length of target region. Since total number of bp obtained == amount of reads * length of reads.

          Comment

          • Giorgio C
            Member
            • Oct 2010
            • 89

            #6
            Hmm.. i'm sorry but something is not clear to me; If i have this situation:

            - Number of total Bp sequenced = 7.114.424

            - Number of reads = 354

            - Length of targeted region = 225.000 bp


            Is correct only to do 7.114.424 / 225.000 = 31,61 ; Means 31% of coverage ?

            And i don't need to consider the "Number of reads" (354) ?

            I hope I was clear,
            Thanks you for your help

            Comment

            • sphil
              Senior Member
              • Apr 2010
              • 192

              #7
              Originally posted by Giorgio C View Post
              Hmm.. i'm sorry but something is not clear to me; If i have this situation:

              - Number of total Bp sequenced = 7.114.424

              - Number of reads = 354

              - Length of targeted region = 225.000 bp


              Is correct only to do 7.114.424 / 225.000 = 31,61 ; Means 31% of coverage ?

              And i don't need to consider the "Number of reads" (354) ?

              I hope I was clear,
              Thanks you for your help
              yes, you are right. You do not need to consider the number of redas because this should result from: number of reads * read length. To get the coverage you simply do the calculation above, thus 7.114.424 / 225.000 = 31,61. That does not mean that you ave 31% coverage. It means that every base in your target sequence is covered by 31 bases contained in your reads. So you have a coverage of 31, which is not too bad.

              Comment

              • Giorgio C
                Member
                • Oct 2010
                • 89

                #8
                Thank you very much for your reply. You have been very helpful !!!

                Comment

                • jordipt
                  Junior Member
                  • Nov 2009
                  • 2

                  #9
                  Hi there,
                  I know it is quite late but I think it might also be interesting to others to leave a comment on this (still open) thread.
                  When considering coverage of a targetted run you should also include in the equation the % of "on target" reads. That is, in a typical capture experiment not all reads obtained from the sequencer map to your region of interest, in fact, for a region like the aprox 250Kb you mention, on target reads could be about 70-50% of all your reads. Thus the number of bases obtained that you have to use in your calculations should be all those that map to your target region.
                  Best

                  Comment

                  • Giorgio C
                    Member
                    • Oct 2010
                    • 89

                    #10
                    Thanks jordipt for your clarification...it's never late for good suggestions

                    Comment

                    Latest Articles

                    Collapse

                    • GATTACAT
                      Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by GATTACAT
                      Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                      Yesterday, 11:43 AM
                    • SEQadmin2
                      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by SEQadmin2


                      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                      Here are nine questions we think about, in roughly the order they matter, before...
                      06-18-2026, 07:11 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, Today, 11:08 AM
                    0 responses
                    6 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-30-2026, 05:37 AM
                    0 responses
                    11 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-26-2026, 11:10 AM
                    0 responses
                    19 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-17-2026, 06:09 AM
                    0 responses
                    53 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...