Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Does BBDuk work on concatenated fastq files

    Basically, I have around 200 samples and wanted to sequence at a depth of 50 million reads per sample. This was not possible on one run so I chose to do all 200, 14 times. I have concatenated all the R1 together and all the R2 together from one sample and so on.... so now I have 200 R1 and 200 R2.

    I was wondering if BBDuk can deal with these files as each file is made up of 14 fastq output with 14 headings.

    Thank you in advance.

  • #2
    BBduk will work fine. Hopefully you concatenated the files in exactly the same order for both R1/R2 files.

    Comment


    • #3
      Adaptor trimming is not working

      Thank you for getting back to me. The adaptor trimming is not working sadly.

      This is what my script looks like:

      Ordered=t #Set to true to output reads in same order as input
      Ktrim=r #once a reference kmer is matched in a read, that kmer and all the bases to the right will be trimmed
      K=21 #specifies the kmer size
      Mink=8 #"mink" allows it to use shorter kmers at the ends of the read
      Hdist=2 #number of permitted mismatches


      for Prefix in `ls -1 *_R1.fastq.gz | sed 's/_R1.fastq.gz//'`
      do

      bbduk.sh -Xmx128g in1=$Prefix\_R1.fastq.gz in2=$Prefix\_R2.fastq.gz out1=$Prefix\_clean_R1.fastq.gz out2=$Prefix\_clean_R2.fastq.gz ref=$adapters ordered=$Ordered ktrim=$Ktrim k=$K mink=$Mink hdist=$Hdist tpe tbo

      done

      Remember my R1 and R2 files consist of concatenated sequences from different runs. Do you think this could be the reason?

      Many thanks

      Comment


      • #4
        As I said before as long as the files are concatenated in same order AND they had the same number of reads in sync across R1/R2 files to begin with this should work without any problems. If things are not working you need to make sure that the reads in your files are in sync. You can check on that using a different bbtool called "repair.sh".

        BBduk.sh needs very little memory there is no need to assign 128G for this job. 4G would be perfectly fine.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM
        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        17 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        22 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        16 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        46 views
        0 likes
        Last Post seqadmin  
        Working...
        X