Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bwa sampe: proper pair but on different contigs!!??!!

    Dear all,


    Does anyone have an idea how the following is possible:
    I have reads mapped in a proper pair (as indicated by the sam-flag) but they map to different contigs!!!???

    HWUSI-EAS300R:7:1:15:1404#0 147 FW_DM_LINE_Jockey 128 29 74M FW3_DM_LINE_Jockey 3131 0 TGCAAGATCGCTTAAATACATAGTGAATTGTTATCTTAAATAATAAAACTATGAGTCAGAATGACACTCGCGCC Y^S[]^\[]a_XSZ[_]]_`_`]```_^a^`^`[aa__`]V]```aa\a_`]aaaaaaaa`Ta\a`aaaba`aa XT:A:U NM:i:0 SM:i:29 AM:i:29 X0:i:1 X1:i:0 XM:i:0 XO:i:0 XG:i:0 MD:Z:74
    HWUSI-EAS300R:7:1:15:1495#0 147 Gypsy4_LTR_LTR_Gypsy 112 60 74M Gypsy4_I_LTR_Gypsy 6216 0 CATTCCACTGCCCGGAGCGTGTGAAGCGCAATGTCAGCATTCTGCCGTGAGCGCTGCTTCAAAAGACGGGCTAC XUPM^NHLSMW\SWSPM\MW]PW\TZ\aPMP^MS^S]]Z^M_^X]^Z^]Z^]`a]^Z_\aaS]Z`Sa]a`_a\a XT:A:U NM:i:3 XN:i:1 SM:i:37 AM:i:37 X0:i:1 X1:i:0 XM:i:3 XO:i:0 XG:i:0 MD:Z:5T32C22G12
    HWUSI-EAS300R:7:1:22:1504#0 147 FW_DM_LINE_Jockey 85 29 74M FW3_DM_LINE_Jockey 3125 0 AACTAAATAAAAAATCTGAAAGCGAAAGAGACGCTCTATGCGATGCAAGATCGCTTAAATACATAGTGAATTGT ]N^I_^WG[[[_YNFQP[XGM\_^^S\a__^``_Y[a^\_a_```aaa`a]a`a````ba_baa`a_bbaabaa XT:A:U NM:i:0 SM:i:29 AM:i:29 X0:i:1 X1:i:0 XM:i:0 XO:i:0 XG:i:0 MD:Z:74
    HWUSI-EAS300R:7:1:25:1975#0 83 BLOOD_I_LTR_Gypsy 145 29 13M3D61M BLASTOPIA_LTR_LTR_Gypsy 271 0
    Hope anyone can help on this!!
    best ro

  • #2
    Could be a bug in the mapping tool used. What tool and what version was it?

    Comment


    • #3
      Mapper: bwa
      Version: 0.57
      command bwa aln -n 0.01 -o 2 -e 12 -d 12 -t 2 etc

      Comment


      • #4
        Is there any obvious link between the contigs, in particular are they subsequent entries in the FASTA reference file?

        Comment


        • #5
          I was under the impression that BWA concatenates all the references together and aligns reads against that long string. Might it have something to do with that?

          Comment


          • #6
            Yes they are subsequent entries in the fasta file! It is the insert of a LTR transposon followed by the LTR, i.e.: this sequences are frequently found in exactly this order in the different species.
            This could be an explanation for the problem than. If BWA is concatenating the sequences and measuring the distance between the mates, than it finds the difference is correct, while ignoring the fact that a contig boundary is crossed, and thus assigns the flag mapped in a proper pair.

            Comment


            • #7
              Originally posted by GoneSouth View Post
              Yes they are subsequent entries in the fasta file!
              Given Lee Sam's post you can probably see why I asked that

              i.e. This is probably a bug in BWA, wrongly marking the reads as "properly paired".

              Comment


              • #8
                Yes I do, many thanks for all your help!!
                Now that I know whats going on I can handle this in my sam parser.
                And maybee the people from Sanger will find some time to fix this in one of the next versions - I will send a bug report.
                thanks ro

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Recent Advances in Sequencing Analysis Tools
                  by seqadmin


                  The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                  05-06-2024, 07:48 AM
                • seqadmin
                  Essential Discoveries and Tools in Epitranscriptomics
                  by seqadmin




                  The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                  04-22-2024, 07:01 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Today, 06:35 AM
                0 responses
                10 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, Yesterday, 02:46 PM
                0 responses
                16 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 05-07-2024, 06:57 AM
                0 responses
                15 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 05-06-2024, 07:17 AM
                0 responses
                18 views
                0 likes
                Last Post seqadmin  
                Working...
                X