Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Yexzm
    Junior Member
    • Dec 2017
    • 5

    Extract the XS field from bam

    Hello everyone,

    BWA mem generates for each read an "XS" field (the suboptimal alignment score). When I use samtools view, it's presented this way :
    - NS500801:90:HY7JVBGXY:2:21205:8003:11253 147 chrM 958 60 76M = 920 -114 CCCCCTCCCCAATAAAGCTAAAACTCACCTGAGTTGTAAAAAACTCCAGTTGACACAAAATAGACTACGAAAGTGG >>;@B@CC1C??=??AAC=???>C@C>CC@BAAA?@<>>>>>=B?BB@@@?A=B=B>>>><@A=B<=;A>=@=;>= BD:Z:IIIMPOLKNKJJJBIMOMIBBJLKKIKLMKJKJIIKHAAAAILKKKLJIHJKHHHH@@GGIHLLLKKLJCKOJLJJ PG:Z:MarkDuplicates RG:Z:id BI:Z:LLLPTSOOSROPQHOTSQOGGNPPQNQPROLPNMMNNFFFFLNNONPOMLNOMLMNEEKLNMOOPOONMHOROPNN NM:i:0 AS:i:76 XS:i:55

    Does anyone know an easy way to extract it ? With R ? I mean I know I could use samtools view + awk but it'll take a long time.

    Thanks in advance!
  • lindenb
    Senior Member
    • Apr 2010
    • 143

    #2
    using bioalcidaejdk: http://lindenb.github.io/jvarkit/BioAlcidaeJdk.html

    Code:
    java -jar dist/bioalcidaejdk.jar -e 'stream().forEach(R->println(R.getAttribute("XS")));'  in.bam

    Comment

    • Yexzm
      Junior Member
      • Dec 2017
      • 5

      #3
      Hi lindenb, thank you for your answer,

      How can I get the read name too ? I would like to have a table the in the first column the read name, and in the second the XS.

      Comment

      • lindenb
        Senior Member
        • Apr 2010
        • 143

        #4
        > How can I get the read name too ?

        Code:
        ... printl(R.getReadName()+" "+R.getAttribute("XS")

        Comment

        • Yexzm
          Junior Member
          • Dec 2017
          • 5

          #5
          Thank you very much for your help!

          The output file is too big, I'm trying to get the chromosome too so that I can separate it per chromosome. I tried "getReferenceIndex" but it returns "null". Do you know how I could do ?

          Comment

          Latest Articles

          Collapse

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          22 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          28 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-04-2026, 08:59 AM
          0 responses
          39 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-02-2026, 12:03 PM
          0 responses
          61 views
          0 reactions
          Last Post SEQadmin2  
          Working...