Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Questions on the updated illumina quality score

    The quality of my datum from an updated illumina system is sanger/illumina 1.9 which confused me very much.Could I just treated them as sanger format?
    Can somebody familiar with this can give me some details about this kind of encoding pattern?

  • #2
    Hi zeam,

    I would suggest looking at the changes made to CASAVA 1.8 - there is a nice post about it here.

    I know they have switched the quality encodings from Phred+64 to the more standard Sanger encoding (ASCII = Phred+33) starting in CASAVA 1.8.

    Justin

    Comment


    • #3
      @zeam: The new Illumina quality scores are in Sanger format and encode a Phred quality score from 0 to 93 using ASCII 33 to 126.

      But we are confused with the new quality scores as well. We use BWA for mapping. BWA has the extra option -I for quality scores in the Illumina 1.3+ read format (quality equals ASCII-64). I assume, that without that option BWA expect the old Illumina format. Is that correct? How do we have do use BWA correctly with the new Sanger format?

      Thanks Robby

      Comment


      • #4
        Originally posted by zeam View Post
        The quality of my datum from an updated illumina system is sanger/illumina 1.9 which confused me very much.Could I just treated them as sanger format?
        Can somebody familiar with this can give me some details about this kind of encoding pattern?
        I assume you are referring to pipeline v.1.8 (I am sure there is a v. 1.9 somewhere in illumina labs in alpha/beta testing).
        If that is correct then your quality values will be in sanger format. You will also discover that if your facility uses v.3 chemistry then the valid range of quality values has been expanded beyond the previous max value of 40. You will see quality values of 41 (and up at some point in time), which are now possible.

        Comment


        • #5
          Originally posted by Robby View Post
          But we are confused with the new quality scores as well. We use BWA for mapping. BWA has the extra option -I for quality scores in the Illumina 1.3+ read format (quality equals ASCII-64). I assume, that without that option BWA expect the old Illumina format. Is that correct? How do we have do use BWA correctly with the new Sanger format?
          Not quite. They haven't updated the BWA documentation to say that that 1.3+ should be 1.3-1.7. With 1.8, just don't use the -I and you'll be doing just fine.

          Comment


          • #6
            Hi all,

            I noticed that BWA assigns mapping quality of 0 when it finds a "J" (or at least a bunch of them) in the quality string. So far I've opted for changing al J to I and then map with the default BWA so it assumes is sanger. I think a patch will be needed to correct this bug.

            Let me know if you have observed this as well.

            Comment


            • #7
              Originally posted by GenoMax View Post
              I assume you are referring to pipeline v.1.8 (I am sure there is a v. 1.9 somewhere in illumina labs in alpha/beta testing).
              If that is correct then your quality values will be in sanger format. You will also discover that if your facility uses v.3 chemistry then the valid range of quality values has been expanded beyond the previous max value of 40. You will see quality values of 41 (and up at some point in time), which are now possible.
              Hi,

              Are the scores on a different scale or are there just more of them? I want to filter scores with a cutoff of 20. Previously, with the Phred+64 scores I would test with ASCII-64 > 20. So, can I do this with the Phred+33 scores, such as, ASCII-33 > 20?

              Thanks,
              Thadeous

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Techniques and Challenges in Conservation Genomics
                by seqadmin



                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                Avian Conservation
                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                03-08-2024, 10:41 AM
              • seqadmin
                The Impact of AI in Genomic Medicine
                by seqadmin



                Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
                02-26-2024, 02:07 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 03-14-2024, 06:13 AM
              0 responses
              32 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-08-2024, 08:03 AM
              0 responses
              71 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-07-2024, 08:13 AM
              0 responses
              80 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-06-2024, 09:51 AM
              0 responses
              68 views
              0 likes
              Last Post seqadmin  
              Working...
              X