Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BFAST and Variant Calling

    Hi,

    I have a question regarding variant calling with SOLiD data. For the mapping, I used BFAST and for variant calling I'm trying to use GATK.

    The problem occurs when I run the quality score recalibration and also the unified genotyper. Whenever I run these I get an output that says many of my reads were filtered because they don't have a mapping quality (The MappingQualityUnavailableFilter contains nearly 90% of my reads, also see here: http://www.broadinstitute.org/gsa/wi...TK_release_1.1).

    In other words the mapping quality was 255 which in a sam/bam file means the quality is unavailable.

    Here (http://sourceforge.net/apps/mediawik...apping_Quality) it seems the mapping quality of 255 is actually very good.

    I'm confused as to how to work around this problem. Any help would be appreciated. Thanks.

  • #2
    I had the same problem, my solution was to change the mapping quality. This issue is supposed to be fixed in the latest version of BFAST, so an better option would be to realign reads with mapQV 255. Or just use Samtools mpileup.

    Comment


    • #3
      Thanks Chipper.

      If I may ask what did you find was an appropriate change for the mapping quality? Could it merely be changed to 254 or is it more complex than that? Thanks again.

      Comment


      • #4
        As long as it is not 255 it is ok, but if the alignments with score 255 are unreliable it would be better to set it to a lower value.

        Comment


        • #5
          Try upgrading to the newest version as Chipper suggests, then report back.

          Comment


          • #6
            I will try again using the new version (0.7.0a). Before I used 0.6.5a. My only concern is that the manual for the new version on page 39 also says "If a read has one alignment, then the mapping quality is set to 255." Is there another option I should specify to avoid getting a score of 255?

            Comment


            • #7
              I would welcome feedback, but I think the calculation should produce a lot fewer 255s. If you find that it does, perhaps I should update the manual.

              Comment


              • #8
                Thanks Chipper and nilshomer! I reran using the new version and now there are no reads failing this filter: MappingQualityUnavailableFilter

                It seems to be working much better now.

                Thanks again.

                Comment


                • #9
                  Let me retake this old issue.

                  I am working with 1000 genomes project alignments data and they have done their SOLiD alignment with bfast 0.64e, so I have two options: I redo the alignment myself with a newer version of bfast as it is said above; or I try to handle the 255 before GATK.

                  Handling the 255 mapping qualities requires replacing this values with something else in the interval [0, 254]. Any ideas on this?

                  I am trying to do it with samtools calmd mapping quality capping option.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM
                  • seqadmin
                    Techniques and Challenges in Conservation Genomics
                    by seqadmin



                    The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                    Avian Conservation
                    Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                    03-08-2024, 10:41 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, Yesterday, 06:37 PM
                  0 responses
                  11 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, Yesterday, 06:07 PM
                  0 responses
                  10 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 03-22-2024, 10:03 AM
                  0 responses
                  51 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 03-21-2024, 07:32 AM
                  0 responses
                  67 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X