Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Overrepresented sequences from FastQC report

    I have a question regarding overrepresented sequences from FastQC report. I really appreciate for your replies in advance. Please see attached.

    First, I am going to remove Illumina Single End PCR Primer 1 using cutadapt. After this, there are three TruSeq Adapters left. In fact, they share same sequences but just different length (69, 67, 65bp). In this case, can I just put the longest one among three as input then the rest two shorter ones are removed at the same time? Or should I do it the opposite way?
    Attached Files

  • #2
    I suggest longest to shortest. That way if the read matches the longer adapter then the adapter will be removed completely. If the read matches a short adapter then it would be caught later in the process.

    Comment


    • #3
      Just want to make sure, do you mean I can just put the longest sequence as input, right? Thanks for your reply.

      Comment


      • #4
        I would put in all three sequences starting from the longest to the shortest. Note that I don't use cutadapt very much and instead use trimmomatic (mainly because it worked properly with paired-end sequences which is the bulk of my work) however the times I do use cutadapt I tend to throw all possible vectors at it just to make sure I get clean results.

        Comment


        • #5
          I get it. Thanks a LOT.

          Comment


          • #6
            A more flexible approach is to not use a specific adapter sequence at all. Pretty much all Illumina adapters start with a common seed sequence (I think some of the small RNA ones do something different and strange, but I may be mis-remembering), and then diverge into the more specific barcodes and application specific variants.

            In trim galore (which is our wrapper around cutadapt which makes it a bit easier to work with, and adds some extra options), we default to using this seed sequence (AGATCGGAAGAGC) to trim with. This way it doesn't matter which variant you're using as they'll all get trimmed and you don't need to worry which barcodes you used in a particular run.

            Comment


            • #7
              Simon has a good point. You can just use the seed sequence since any read that contained an entire adapter beyond the seed sequence would be a very poor read indeed. I was focusing too much on your initial comment of having 3 adapters to get rid of instead of focusing on the best solution.

              Comment


              • #8
                Thanks all. This really helped.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Essential Discoveries and Tools in Epitranscriptomics
                  by seqadmin




                  The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                  04-22-2024, 07:01 AM
                • seqadmin
                  Current Approaches to Protein Sequencing
                  by seqadmin


                  Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                  04-04-2024, 04:25 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Yesterday, 08:47 AM
                0 responses
                12 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-11-2024, 12:08 PM
                0 responses
                60 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-10-2024, 10:19 PM
                0 responses
                60 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-10-2024, 09:21 AM
                0 responses
                54 views
                0 likes
                Last Post seqadmin  
                Working...
                X