Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Quality score after Illumina run - should it be coverted before samtools and gatk?

    Hi all,

    I am a bit confused with the base quality score. After Illumina run, I use BWA to map the reads. I then use downstream applications such as samtools and gatk. But samtools accepts sanger phred (ASCII-33), while Illimina's fastq is ASCII-64. Is it correct that the fastq need to be converted before running BWA to sanger scores? If so, what tools do people use for this?

  • #2
    yes, you should convert first, even before BWA.
    google for fq_all2std.pl

    Comment


    • #3
      thanks a lot!

      Comment


      • #4
        I took a look at the code of fq_all2std.pl . I believe I need the command:
        sol2std Convert Solexa/Illumina FASTQ to the standard FASTQ

        But doesn't it convert the old Illumina (prior to version 1.3) to sanger?

        The conversion table it uses:
        for (-64..64) {
        $conv_table[$_+64] = chr(int(33 + 10*log(1+10**($_/10.0))/log(10)+.499));
        }

        Comment


        • #5
          But doesn't it convert the old Illumina (prior to version 1.3) to sanger?
          Yeah this script converts only solexa format and not illumina(1.3 or 1.5+) one.

          Comment


          • #6
            BWA option -I

            In the recent versions in BWA you no longer need to convert the quality scores. From the BWA manual:

            -I The input is in the Illumina 1.3+ read format (quality equals ASCII-64).

            Comment


            • #7
              So it seems you don't need to convert before BWA. Do you need to convert before GATK or before samtools?

              Comment


              • #8
                There isn't any need to, as all SAM/BAM files have QUAL field: "ASCII of Phred-scaled base QUALity+33". This way you don't need to worry about the quality scale in SAM/BAM.

                (Note that it is possible to get qualities in a SAM/BAM like file that are scaled QUALity+64, BUT these are not real SAM/BAM files. BWA produces a proper SAM/BAM file with the correct quality scale.)

                Comment


                • #9
                  what is the consequence of losing -I option?

                  As asked in the title, what if I missed the -I option while run bwa? is there any remedial work I can do to save it? thx!

                  Comment


                  • #10
                    Current Illumina's qualities in the fastq files changed since the first post on this issue. They are now (Illumina 1.8+) Phred+33, and therefore no need to specify -I for BWA. Just make sure the fastq files you are working with are indeed Illumina 1.8+.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Advancing Precision Medicine for Rare Diseases in Children
                      by seqadmin




                      Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                      12-16-2024, 07:57 AM
                    • seqadmin
                      Recent Advances in Sequencing Technologies
                      by seqadmin



                      Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                      Long-Read Sequencing
                      Long-read sequencing has seen remarkable advancements,...
                      12-02-2024, 01:49 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 12-17-2024, 10:28 AM
                    0 responses
                    23 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-13-2024, 08:24 AM
                    0 responses
                    42 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-12-2024, 07:41 AM
                    0 responses
                    28 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-11-2024, 07:45 AM
                    0 responses
                    42 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X