Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • New Member with Trimmomatic questions

    Hello, I would like to start off by saying hello to all and thank you so much for providing such indepth support. I have done some scouring on this site and am impressed with some of the resolutions members are able to provide.

    That being said I am trying to us trimmomatic 0.36 to trim the output from the first step in the CROPS pipeline. I am doing this instead of using the CROPS script itself because I am using Platform LSF and trimmomatic 0.36 is available.

    Trimmomatic is telling me "Sequence and quality length don't match". I am not sure what would be causing this as I am using unmodified files from step 1 of CROPS.

    This is the .err file I am getting from the mainframe and wonder if anyone has an idea on what the issue could be?

    Picked up _JAVA_OPTIONS: -Xmx512m
    TrimmomaticSE: Started with arguments:
    -phred33 /nethome/j.fiorentino3/GBS_Data/HFM7HBBXX/parsed/HFM7HBBXXCTGC_001.R1parsed.fastq.gz /nethome/j.fiorentino3/GBS_Data/HFM7HBBXX/parsed/HFM7HBBXXCTGC.trimmed.fq.gz ILLUMINACLIP:TruSeq3-SE.fa:2:30:10 LEADING:20 TRAILING:20 SLIDINGWINDOW:4:20 MINLEN:36
    Automatically using 1 threads
    Using Long Clipping Sequence: 'AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTA'
    Using Long Clipping Sequence: 'AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC'
    ILLUMINACLIP: Using 0 prefix pairs, 2 forward/reverse sequences, 0 forward only sequences, 0 reverse only sequences
    Exception in thread "main" java.lang.RuntimeException: Sequence and quality length don't match: 'AATGGAGGGTGCTCAGCAAACACATAAACACACAACCATGTCCCATCAGCCTCCTGGAATGGTAGTGATCTAGATTATTGTGAGACAAAG' vs 'JJAFJFJJFJFFJJFJJJJFJFJJ77FAFJJFJJJJJFFJA<<FJA<FA-AF7FAFJJJAFJJAAJJF<AJJ7A'
    at org.usadellab.trimmomatic.fastq.FastqRecord.<init>(FastqRecord.java:25)
    at org.usadellab.trimmomatic.fastq.FastqParser.parseOne(FastqParser.java:89)
    at org.usadellab.trimmomatic.fastq.FastqParser.next(FastqParser.java:179)
    at org.usadellab.trimmomatic.TrimmomaticSE.processSingleThreaded(TrimmomaticSE.java:60)
    at org.usadellab.trimmomatic.TrimmomaticSE.process(TrimmomaticSE.java:222)
    at org.usadellab.trimmomatic.TrimmomaticSE.run(TrimmomaticSE.java:306)
    at org.usadellab.trimmomatic.Trimmomatic.main(Trimmomatic.java:85)

  • #2
    You appear to have one read (at least) where the sequence length does not match the quality score line length (corrupt fastg record). You can do something like
    Code:
     $ grep -A 3 "AATGGAGGGTGCTCAGCAAACACATAAACACACAACCATGTCCCATCAGCCTCCTGGAATGGTAGTGATCTAGATTATTGTG" your_file
    and examine that record for corruption.

    Comment


    • #3
      Hi,

      In line with questions by new users of Trimmomatic, I was wondering what "Using Long Clipping Sequence" means? Appreciate help in understanding this.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM
      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      30 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      32 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      28 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      53 views
      0 likes
      Last Post seqadmin  
      Working...
      X