Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • FastqToSam Picard tool error

    Hi all,

    I've been running into a problem with the FastqToSam Picard tool:

    My run command is: java -Xmx4g -jar /n/sw/picard-tools-1.98//FastqToSam.jar FASTQ=file.R1.fastq.gz FASTQ2=file.R2.fastq.gz SAMPLE_NAME=2a OUTPUT=text.bam

    The error I get is: Exception in thread "main" net.sf.picard.PicardException: Missing sequence line at line 54573 in fastq /path/file.R2.fastq.gz

    Anyone know what's going on?

  • #2
    Is there something wrong with the format of the fastq file at the line indicated by the error message?

    Comment


    • #3
      Yes, I just checked that.
      It seems that there is no read2 sequence on that line, although read1 exists.

      Should I delete those lines in the R1 and R2 fastq files?

      PS: This is what R2 looks like around the line that is giving the error:
      ////////<//E//////<///<//<//E////<//<///////////////////<///
      @NS500422:212:HJG22BGXX:1:11101:1063:7774 2:N:0:TAGGCATG

      +

      (No sequence and no quality)

      Comment


      • #4
        Is it possible that your R1/R2 files have got out of sync? Just deleting those lines may not be enough. You should use repair.sh from BBMap to check and repair the files.

        Comment


        • #5
          You can get rid of that with BBMap's reformat program:

          reformat.sh in=file.R#.fastq.gz out=reformatted.fq.gz minlen=1

          Alternately, you can output directly as sam (or if samtools is installed, as bam):

          reformat.sh in=file.R#.fastq.gz out=reformatted.bam minlen=1

          The "minlen=1" flag will throw away 0-length reads.

          Comment


          • #6
            OK, but what about the corresponding read1?
            For example if line 2 of file.R2 has read length = 0, but line 2 of file.R1 has read length >0 ?

            Comment


            • #7
              Reformat always keeps pairs together, to maintain consistency. So, both of them will be discarded.

              Comment


              • #8
                Thank you so much Brian! This seems to have solved the problem.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Essential Discoveries and Tools in Epitranscriptomics
                  by seqadmin




                  The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                  04-22-2024, 07:01 AM
                • seqadmin
                  Current Approaches to Protein Sequencing
                  by seqadmin


                  Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                  04-04-2024, 04:25 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Yesterday, 11:49 AM
                0 responses
                13 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-24-2024, 08:47 AM
                0 responses
                16 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-11-2024, 12:08 PM
                0 responses
                61 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-10-2024, 10:19 PM
                0 responses
                60 views
                0 likes
                Last Post seqadmin  
                Working...
                X