Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    Originally posted by hartmaier View Post
    I haven't done it yet, but based on the discussion above, I like the idea of removing the external adapters first, then going back for the internal ones - I think this will work:

    cutadapt -a GATCGGAAGAGCACACGTCTGAACTCCAGTCAC -m 20 $fastq1 | cutadapt -b CTGTCTCTTATACACATCT -b AGATGTGTATAAGAGACAG -m 20 > $fastq1_trimmed

    cutadapt -a GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT -m 20 $fastq2 | cutadapt -b CTGTCTCTTATACACATCT -b AGATGTGTATAAGAGACAG -m 20 > $fastq2_trimmed

    This would be followed by steps to ensure pairs in each file. I use novoalign to align which allows alignment of two orientations.

    Thoughts?
    It works perfectly for me ! Thanks a lot !
    Diego'

    Comment


    • #17
      Originally posted by Diegodescarpates View Post
      It works perfectly for me ! Thanks a lot !
      Diego'
      Awesome. I have also used it successfully now. I've also found a neat little unix trick to allow both files to run at the same time (in order to utilize multiple processors) - see below. The '2>$1 | cat' is a little trick I found in another message board and redirects stderror to stdout and only moves onto the next step in a pipeline once both are finished...this effectively halves the time for trimming since both run at once.

      Code:
      (
      cutadapt ... > fastq1_trimmed &
      cutadapt ... > fastq2_trimmed &
      ) 2>$1 | cat

      Also, remember that trimming will almost always produce orphan reads which will cause error when you align. I immediately send trimmed output to the filtering script found here: http://seqanswers.com/forums/showthread.php?t=24076

      You are now ready to send to your favorite aligner.

      Comment


      • #18
        Code:
        (
        cutadapt ... > fastq1_trimmed &
        cutadapt ... > fastq2_trimmed &
        ) 2>$1 | cat
        Hi all,

        Could anyone provide a perl script solution for this task ?

        Thanks !

        Diego'

        Comment


        • #19
          Can this tool, NextClip, be a good option for Nextera mate-pair sequencing data analysis?
          Source code, user guide and example data are available from https://github.com/richardmleggett/nextclip/.

          (although I have not tested it yet)

          Comment


          • #20
            Originally posted by MesutOezil View Post
            Can this tool, NextClip, be a good option for Nextera mate-pair sequencing data analysis?
            Source code, user guide and example data are available from https://github.com/richardmleggett/nextclip/.

            (although I have not tested it yet)
            Thanks for sharing. I haven't seen NextClip. After a quick read it really appears the adapter trimming strategy seems similar to what came about in this thread. However, it appears to be making tradeoffs for the sake of de novo builds. For example, placing emphasis on reads with the junction adapter. This is very conservative. My final Nextera libraries have an average size of 700-800 base pairs (and extending well past 1kb). Since the shearing is random, the junction adapter can occur anywhere in this fragment. Standard 100bp paired end sequencing will likely miss the junction adapter in many fragments. They do (kind of) mention this is the discussion. The PCR duplication rate is very important but there are other tools already available for that (picard tools). I align with Novoalign which allows 2 insert sizes to be specified, allowing for "paired-end" contaminants to be aligned and excluded from downstream analyses. That said, the Nextera seems much, much better in this regard to the previous Illumina mate pair kit.
            So, my take away…if you don't have a reference, or you are doing a de novo build, this is probably useful, although at some cost.

            Comment


            • #21
              Now skewer can handle Nextera Long Mate Pair (LMP) reads, trimming all the adapters elegantly.

              skewer is very easy to use. Please have a try!

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM
              • seqadmin
                Strategies for Sequencing Challenging Samples
                by seqadmin


                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                03-22-2024, 06:39 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              30 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              32 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              28 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              53 views
              0 likes
              Last Post seqadmin  
              Working...
              X