Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Quality score after Illumina run - should it be coverted before samtools and gatk?

    Hi all,

    I am a bit confused with the base quality score. After Illumina run, I use BWA to map the reads. I then use downstream applications such as samtools and gatk. But samtools accepts sanger phred (ASCII-33), while Illimina's fastq is ASCII-64. Is it correct that the fastq need to be converted before running BWA to sanger scores? If so, what tools do people use for this?

  • #2
    yes, you should convert first, even before BWA.
    google for fq_all2std.pl

    Comment


    • #3
      thanks a lot!

      Comment


      • #4
        I took a look at the code of fq_all2std.pl . I believe I need the command:
        sol2std Convert Solexa/Illumina FASTQ to the standard FASTQ

        But doesn't it convert the old Illumina (prior to version 1.3) to sanger?

        The conversion table it uses:
        for (-64..64) {
        $conv_table[$_+64] = chr(int(33 + 10*log(1+10**($_/10.0))/log(10)+.499));
        }

        Comment


        • #5
          But doesn't it convert the old Illumina (prior to version 1.3) to sanger?
          Yeah this script converts only solexa format and not illumina(1.3 or 1.5+) one.

          Comment


          • #6
            BWA option -I

            In the recent versions in BWA you no longer need to convert the quality scores. From the BWA manual:

            -I The input is in the Illumina 1.3+ read format (quality equals ASCII-64).

            Comment


            • #7
              So it seems you don't need to convert before BWA. Do you need to convert before GATK or before samtools?

              Comment


              • #8
                There isn't any need to, as all SAM/BAM files have QUAL field: "ASCII of Phred-scaled base QUALity+33". This way you don't need to worry about the quality scale in SAM/BAM.

                (Note that it is possible to get qualities in a SAM/BAM like file that are scaled QUALity+64, BUT these are not real SAM/BAM files. BWA produces a proper SAM/BAM file with the correct quality scale.)

                Comment


                • #9
                  what is the consequence of losing -I option?

                  As asked in the title, what if I missed the -I option while run bwa? is there any remedial work I can do to save it? thx!

                  Comment


                  • #10
                    Current Illumina's qualities in the fastq files changed since the first post on this issue. They are now (Illumina 1.8+) Phred+33, and therefore no need to specify -I for BWA. Just make sure the fastq files you are working with are indeed Illumina 1.8+.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Essential Discoveries and Tools in Epitranscriptomics
                      by seqadmin




                      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                      04-22-2024, 07:01 AM
                    • seqadmin
                      Current Approaches to Protein Sequencing
                      by seqadmin


                      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                      04-04-2024, 04:25 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Today, 08:47 AM
                    0 responses
                    12 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-11-2024, 12:08 PM
                    0 responses
                    60 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-10-2024, 10:19 PM
                    0 responses
                    59 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-10-2024, 09:21 AM
                    0 responses
                    54 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X