Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • SDPA_Pet
    Senior Member
    • Apr 2013
    • 222

    The number of raw reads

    Hello, I am gonna make a table to do some stats of my Illumina pair-end sequencing.

    There is a column (The number of raw reads). This is pair-end sequencing. If I got 10 million from R1 and I will have 10 million from R2. When I say the number of raw reads. should I put 10 million or 20 million (R1+R2).

    Thanks
  • husamia
    Member
    • Apr 2010
    • 66

    #2
    for pre-alignment counts you could count the number of lines in the FASTQ files and divide by 4. I use wc -l

    Comment

    • GenoMax
      Senior Member
      • Feb 2008
      • 7142

      #3
      Originally posted by SDPA_Pet View Post
      Hello, I am gonna make a table to do some stats of my Illumina pair-end sequencing.

      There is a column (The number of raw reads). This is pair-end sequencing. If I got 10 million from R1 and I will have 10 million from R2. When I say the number of raw reads. should I put 10 million or 20 million (R1+R2).

      Thanks
      Illumina "ac"counts for them as 20M reads in their spec sheets (but they come from 10M unique clusters).

      Comment

      • SDPA_Pet
        Senior Member
        • Apr 2013
        • 222

        #4
        Originally posted by husamia View Post
        for pre-alignment counts you could count the number of lines in the FASTQ files and divide by 4. I use wc -l
        I have software to count. That's not a problem. Here is what I am concern. Should I add them number together after I get the number of R1 and R2 reads.

        Comment

        • husamia
          Member
          • Apr 2010
          • 66

          #5
          Originally posted by SDPA_Pet View Post
          I have software to count. That's not a problem. Here is what I am concern. Should I add them number together after I get the number of R1 and R2 reads.
          if R1 and R2 are the paired sequences, then should be count R1 = count R2 = number of clusters

          Comment

          • SDPA_Pet
            Senior Member
            • Apr 2013
            • 222

            #6
            Originally posted by husamia View Post
            if R1 and R2 are the paired sequences, then should be count R1 = count R2 = number of clusters
            I know. I mean when I report the total number of raw reads? should I report the number of clusters or the number of R1 or R2 X2.

            I know the theory behind this. I am asking here is about which way the normal way to report the raw reads.

            Comment

            • GenoMax
              Senior Member
              • Feb 2008
              • 7142

              #7
              Originally posted by SDPA_Pet View Post
              I know. I mean when I report the total number of raw reads? should I report the number of clusters or the number of R1 or R2 X2.

              I know the theory behind this. I am asking here is about which way the normal way to report the raw reads.
              If you want to track how many fragments were sequenced then at the cluster level. If you want to count total number of reads that came from a lane then at R1+R2 level.

              Comment

              • SDPA_Pet
                Senior Member
                • Apr 2013
                • 222

                #8
                Originally posted by GenoMax View Post
                If you want to track how many fragments were sequenced then at the cluster level. If you want to count total number of reads that came from a lane then at R1+R2 level.
                HI GenoMax,

                Thanks. Besides BBTools, do you know any other software can check some basic information about assembled genomes/metagenomes data files. For example, some basic information such as total number contigs, N50, largest contigs, etc.

                Comment

                • GenoMax
                  Senior Member
                  • Feb 2008
                  • 7142

                  #9
                  Originally posted by SDPA_Pet View Post
                  HI GenoMax,

                  Thanks. Besides BBTools, do you know any other software can check some basic information about assembled genomes/metagenomes data files. For example, some basic information such as total number contigs, N50, largest contigs, etc.
                  Try QUAST/MetaQUAST.
                  Last edited by GenoMax; 10-20-2016, 11:50 AM.

                  Comment

                  • SDPA_Pet
                    Senior Member
                    • Apr 2013
                    • 222

                    #10
                    Originally posted by GenoMax View Post
                    Anyother software. I don't like this much

                    Comment

                    Latest Articles

                    Collapse

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, Today, 10:09 AM
                    0 responses
                    9 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, Yesterday, 08:59 AM
                    0 responses
                    16 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-02-2026, 12:03 PM
                    0 responses
                    24 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-02-2026, 11:40 AM
                    0 responses
                    20 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...