Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • nexgengirl
    Member
    • Apr 2010
    • 31

    BFAST and Variant Calling

    Hi,

    I have a question regarding variant calling with SOLiD data. For the mapping, I used BFAST and for variant calling I'm trying to use GATK.

    The problem occurs when I run the quality score recalibration and also the unified genotyper. Whenever I run these I get an output that says many of my reads were filtered because they don't have a mapping quality (The MappingQualityUnavailableFilter contains nearly 90% of my reads, also see here: http://www.broadinstitute.org/gsa/wi...TK_release_1.1).

    In other words the mapping quality was 255 which in a sam/bam file means the quality is unavailable.

    Here (http://sourceforge.net/apps/mediawik...apping_Quality) it seems the mapping quality of 255 is actually very good.

    I'm confused as to how to work around this problem. Any help would be appreciated. Thanks.
  • Chipper
    Senior Member
    • Mar 2008
    • 323

    #2
    I had the same problem, my solution was to change the mapping quality. This issue is supposed to be fixed in the latest version of BFAST, so an better option would be to realign reads with mapQV 255. Or just use Samtools mpileup.

    Comment

    • nexgengirl
      Member
      • Apr 2010
      • 31

      #3
      Thanks Chipper.

      If I may ask what did you find was an appropriate change for the mapping quality? Could it merely be changed to 254 or is it more complex than that? Thanks again.

      Comment

      • Chipper
        Senior Member
        • Mar 2008
        • 323

        #4
        As long as it is not 255 it is ok, but if the alignments with score 255 are unreliable it would be better to set it to a lower value.

        Comment

        • nilshomer
          Nils Homer
          • Nov 2008
          • 1283

          #5
          Try upgrading to the newest version as Chipper suggests, then report back.

          Comment

          • nexgengirl
            Member
            • Apr 2010
            • 31

            #6
            I will try again using the new version (0.7.0a). Before I used 0.6.5a. My only concern is that the manual for the new version on page 39 also says "If a read has one alignment, then the mapping quality is set to 255." Is there another option I should specify to avoid getting a score of 255?

            Comment

            • nilshomer
              Nils Homer
              • Nov 2008
              • 1283

              #7
              I would welcome feedback, but I think the calculation should produce a lot fewer 255s. If you find that it does, perhaps I should update the manual.

              Comment

              • nexgengirl
                Member
                • Apr 2010
                • 31

                #8
                Thanks Chipper and nilshomer! I reran using the new version and now there are no reads failing this filter: MappingQualityUnavailableFilter

                It seems to be working much better now.

                Thanks again.

                Comment

                • priesgo
                  Member
                  • Aug 2012
                  • 22

                  #9
                  Let me retake this old issue.

                  I am working with 1000 genomes project alignments data and they have done their SOLiD alignment with bfast 0.64e, so I have two options: I redo the alignment myself with a newer version of bfast as it is said above; or I try to handle the 255 before GATK.

                  Handling the 255 mapping qualities requires replacing this values with something else in the interval [0, 254]. Any ideas on this?

                  I am trying to do it with samtools calmd mapping quality capping option.

                  Comment

                  Latest Articles

                  Collapse

                  • SEQadmin2
                    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                    by SEQadmin2


                    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                    ...
                    06-02-2026, 10:05 AM
                  • SEQadmin2
                    Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                    by SEQadmin2


                    With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                    Introduction

                    Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                    05-22-2026, 06:42 AM
                  • SEQadmin2
                    Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                    by SEQadmin2

                    Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                    Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                    05-06-2026, 09:04 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, 06-02-2026, 12:03 PM
                  0 responses
                  21 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-02-2026, 11:40 AM
                  0 responses
                  14 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 05-28-2026, 11:40 AM
                  0 responses
                  29 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 05-26-2026, 10:12 AM
                  0 responses
                  31 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...