Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • fabrice
    Member
    • Oct 2009
    • 86

    Paired-end Illumina RNA-seq adapter trimming

    Does anyone can recommend a tool for Paired-end Illumina RNA-seq adapter trimming? Most of tools are for single-end sequence. I think if there a tools used for paired-end data.
    The shortcoming of the single end tools:
    1, The single end tools sometimes will make the paired-end data mess.
    2, It does not consider paired-end information, if there is a tool for paired-end, such remove will be accurately.
  • Simon Anders
    Senior Member
    • Feb 2010
    • 995

    #2
    HTSeq has facilities useful for this.

    You can write a little Python script like this:

    Code:
    import itertools
    import HTSeq
    
    in1 = iter( HTSeq.FastqReader( "mydata_1.fastq" ) )
    in2 = iter( HTSeq.FastqReader( "mydata_2.fastq" ) )
    out1 = open( "trimmed_1.fastq", "w" )
    out2 = open( "trimmed_2.fastq", "w" )
    
    for read1, read2 in itertools.izip( in1, in2 ):
       read1.trim_right_end( "ACGGTC" )
       read2.trim_left_end( "TTCGAC" )
       read1.write_to_fastq_file( out1 )
       read2.write_to_fastq_file( out2 )
          
    out1.close()
    out2.close()
    I haven't tested the script, and you will have to adjust it to your needs, of course.

    Note that you can make the trimming tolerant to read errors by either adding a second argument, giving the allowed fraction of mismatches, or using trim_right_end_with_quals, which accepts a maximum sum of mismatch qualities. See here and here for an explanation.

    Comment

    • fabrice
      Member
      • Oct 2009
      • 86

      #3
      Simon Anders,
      Thanks for your reply.
      Could you please also give an example how to trim 3' reads which have low quality? For example, set a cutoff value. Bellow this cutoff, the nucleotide will be trimmed.

      Thanks.

      Comment

      • saharnm
        Junior Member
        • Jul 2011
        • 1

        #4
        Take a look at "http://solexaqa.sourceforge.net/". It works on both single and paired-end reads.

        Comment

        • vbiaudet
          Member
          • Apr 2011
          • 14

          #5
          We have the same problem for paired-end of 100bp and sizing of 200 (fragment size) from Illumina GAIIx RNAseq. In many case we think we got an adapter in 3' that have been sequenced... we think that because the mapping is very bad in 3'. BUT we know that the Quality score is very good (Q> 30). The problem is the 3'end can contains just some bases from the adapter and not the complete sequence... when the list of adapter is known did you find a tool to solve this problem ?

          Thanks, vb

          Comment

          • ataheri
            Junior Member
            • Nov 2011
            • 7

            #6
            Trimmomatic

            We are using Trimmomatic as it is able to do trimming and adaptor removal for paired-end reads. At first we tried fastx-toolkit but then we switched to trimmomatic as a better choice for paied-end samples.

            Comment

            • relipmoc
              Member
              • Jul 2011
              • 58

              #7
              skewer

              You may also try skewer which is an adapter trimmer dedicated to Illumina paired-end data
              Last edited by relipmoc; 09-23-2013, 05:23 PM. Reason: title

              Comment

              • saran.ela123
                Junior Member
                • Aug 2013
                • 5

                #8
                May i know from where to download skewer ?

                Comment

                • FlorDV
                  Junior Member
                  • Jan 2015
                  • 1

                  #9
                  Hi saran.ela123

                  you can download it from here: http://sourceforge.net/projects/skewer/files/

                  Comment

                  Latest Articles

                  Collapse

                  • SEQadmin2
                    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                    by SEQadmin2


                    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                    ...
                    06-02-2026, 10:05 AM
                  • SEQadmin2
                    Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                    by SEQadmin2


                    With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                    Introduction

                    Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                    05-22-2026, 06:42 AM
                  • SEQadmin2
                    Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                    by SEQadmin2

                    Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                    Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                    05-06-2026, 09:04 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, Yesterday, 08:59 AM
                  0 responses
                  13 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-02-2026, 12:03 PM
                  0 responses
                  21 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-02-2026, 11:40 AM
                  0 responses
                  18 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 05-28-2026, 11:40 AM
                  0 responses
                  31 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...