Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Sam to Bam using Picard - help!

    Hi,

    I am trying to convert from sam to bam using Picard but am getting an error message (see below). Can anyone interpret this for me?

    Thanks!

    Kath

    [skgtkaf@login01 13655]$ java -jar /home/skgtkaf/ngs/picard-tools-1.35/SamFormatConverter.jar I=13655_q20.sam O=13655_q20.bam
    [Wed Nov 24 10:54:33 CET 2010] net.sf.picard.sam.SamFormatConverter INPUT=13655_q20.sam OUTPUT=13655_q20.bam TMP_DIR=/tmp/skgtkaf VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false
    [Wed Nov 24 10:54:33 CET 2010] net.sf.picard.sam.SamFormatConverter done.
    Runtime.totalMemory()=128843776
    Exception in thread "main" net.sf.samtools.SAMFormatException: Error parsing text SAM file. CIGAR should have zero elements for unmapped read.; Line 165
    Line: GA04_0001:4:1:1082:7316#0 133 chr12 27877005 0 35M43S =27877005 0 AAANNATNTNNTNNAATGANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTTANNANNACNNNNNNNNNN %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% XC:i:35
    at net.sf.samtools.SAMTextReader.reportErrorParsingLine(SAMTextReader.java:220)
    at net.sf.samtools.SAMTextReader.access$500(SAMTextReader.java:40)
    at net.sf.samtools.SAMTextReader$RecordIterator.parseLine(SAMTextReader.java:424)
    at net.sf.samtools.SAMTextReader$RecordIterator.next(SAMTextReader.java:268)
    at net.sf.samtools.SAMTextReader$RecordIterator.next(SAMTextReader.java:240)
    at net.sf.samtools.SAMFileReader$AssertableIterator.next(SAMFileReader.java:604)
    at net.sf.samtools.SAMFileReader$AssertableIterator.next(SAMFileReader.java:582)
    at net.sf.picard.sam.SamFormatConverter.doWork(SamFormatConverter.java:65)
    at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:156)
    at net.sf.picard.sam.SamFormatConverter.main(SamFormatConverter.java:73)

  • #2
    What aligner are you using? From the SAM flag of that read (133) that read is unaligned and so picard wants it to have no defined CIGAR (which makes sense!) Have you tried samtools? May its less strict

    Comment


    • #3
      I think you should change the VALIDATION_STRINGENCY option, read more about it here: http://seqanswers.com/forums/showthread.php?t=4246

      Comment


      • #4
        Thanks for the suggestions. I used BWA to align the reads. I didn't realise you could use SAMtools to convert from SAM to BAM.

        Comment


        • #5
          Ok, setting VALIDATION_STRINGENCY=LENIENT allows the .bam file to be produced. Thanks very much for your help!

          Comment


          • #6
            How did you set the validation=lenient?

            Comment


            • #7
              Originally posted by muzz56 View Post
              How did you set the validation=lenient?
              just include that string "VALIDATION_STRINGENCY=LENIENT" somewhere in your command.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin


                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                Yesterday, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              55 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              51 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              45 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              55 views
              0 likes
              Last Post seqadmin  
              Working...
              X