Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • nexgengirl
    Member
    • Apr 2010
    • 31

    BFAST and Variant Calling

    Hi,

    I have a question regarding variant calling with SOLiD data. For the mapping, I used BFAST and for variant calling I'm trying to use GATK.

    The problem occurs when I run the quality score recalibration and also the unified genotyper. Whenever I run these I get an output that says many of my reads were filtered because they don't have a mapping quality (The MappingQualityUnavailableFilter contains nearly 90% of my reads, also see here: http://www.broadinstitute.org/gsa/wi...TK_release_1.1).

    In other words the mapping quality was 255 which in a sam/bam file means the quality is unavailable.

    Here (http://sourceforge.net/apps/mediawik...apping_Quality) it seems the mapping quality of 255 is actually very good.

    I'm confused as to how to work around this problem. Any help would be appreciated. Thanks.
  • Chipper
    Senior Member
    • Mar 2008
    • 323

    #2
    I had the same problem, my solution was to change the mapping quality. This issue is supposed to be fixed in the latest version of BFAST, so an better option would be to realign reads with mapQV 255. Or just use Samtools mpileup.

    Comment

    • nexgengirl
      Member
      • Apr 2010
      • 31

      #3
      Thanks Chipper.

      If I may ask what did you find was an appropriate change for the mapping quality? Could it merely be changed to 254 or is it more complex than that? Thanks again.

      Comment

      • Chipper
        Senior Member
        • Mar 2008
        • 323

        #4
        As long as it is not 255 it is ok, but if the alignments with score 255 are unreliable it would be better to set it to a lower value.

        Comment

        • nilshomer
          Nils Homer
          • Nov 2008
          • 1283

          #5
          Try upgrading to the newest version as Chipper suggests, then report back.

          Comment

          • nexgengirl
            Member
            • Apr 2010
            • 31

            #6
            I will try again using the new version (0.7.0a). Before I used 0.6.5a. My only concern is that the manual for the new version on page 39 also says "If a read has one alignment, then the mapping quality is set to 255." Is there another option I should specify to avoid getting a score of 255?

            Comment

            • nilshomer
              Nils Homer
              • Nov 2008
              • 1283

              #7
              I would welcome feedback, but I think the calculation should produce a lot fewer 255s. If you find that it does, perhaps I should update the manual.

              Comment

              • nexgengirl
                Member
                • Apr 2010
                • 31

                #8
                Thanks Chipper and nilshomer! I reran using the new version and now there are no reads failing this filter: MappingQualityUnavailableFilter

                It seems to be working much better now.

                Thanks again.

                Comment

                • priesgo
                  Member
                  • Aug 2012
                  • 22

                  #9
                  Let me retake this old issue.

                  I am working with 1000 genomes project alignments data and they have done their SOLiD alignment with bfast 0.64e, so I have two options: I redo the alignment myself with a newer version of bfast as it is said above; or I try to handle the 255 before GATK.

                  Handling the 255 mapping qualities requires replacing this values with something else in the interval [0, 254]. Any ideas on this?

                  I am trying to do it with samtools calmd mapping quality capping option.

                  Comment

                  Latest Articles

                  Collapse

                  • SEQadmin2
                    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                    by SEQadmin2


                    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                    Here are nine questions we think about, in roughly the order they matter, before...
                    06-18-2026, 07:11 AM
                  • SEQadmin2
                    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                    by SEQadmin2


                    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                    ...
                    06-02-2026, 10:05 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, Yesterday, 11:10 AM
                  0 responses
                  8 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-17-2026, 06:09 AM
                  0 responses
                  43 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-09-2026, 11:58 AM
                  0 responses
                  104 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-05-2026, 10:09 AM
                  0 responses
                  125 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...