Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • TruSeq Adaptors reported by FastQC are true adaptors?

    I asked a question about using fastx_clipper to get rid of the adaptor sequence last time and it did not work.

    I googled TruSeq adaptor sequences (http://www.omicsoft.com/downloads/ng...on_list/v1.txt) and compared them with our reported TruSeq Adaptors. Because our sequences have length between 49-52, all reported adaptor have length 50. I choose the first 50 nt from TruSeq adaptor to compare with ours, the results are as follows:
    4 reported TrueSeq Adaptors are exactly the same as the first 50nt of TruSeq adaptor;
    3 reported TrueSeq Adaptors have almost the same sequence as the first 50 nt except the 42nd nt, they all replace A by C;
    2 reported TrueSeq Adaptors, their 2nd-50 nt are the same as TruSeq Adaptor 1-49nt

    I decided to try to align my reads without get rid of adaptor sequence by novoalign (using default setting) to see whether those sequences reported as TruSeq Adaptors are in the result of alignment. Unfortunately, they are. But if I specify the reported adaptor sequence as adaptor in novoalign, these sequence will be removed.

    So my questions now is whether those sequences reported as TruSeq Adaptor by FastQC are true Adaptor sequences or not?

  • #2
    If Ii remind correctly fastQC allows for some mismatches , you may also wish to compare identified adapters to fastQC database contaminants within fastQC folder
    Pbseq

    Comment


    • #3
      FastQC allows some flexibility in its matches, it also doesn't require a match to exist over the whole length of the sequence. The summary of the match will tell you how good a match it actually found.

      Many of the illumina adapters are very similar to each other, differing by only a few bases so FastQC often finds a multitude of possible hits, so it just picks the first of the best set of hits to report.

      Given that the program only does these searches for sequences which occur at very high levels in a library it's pretty unusual to get a complete false positive for the presence of an adapter sequence, although the identification of the exact adapter used may well not be correct.

      Comment


      • #4
        Thank you very much, Simon. That makes more sense to me.

        I want to correct the observation I mentioned earlier. I double checked manuscript of novoalign, the reported adaptor sequences are not aligned to any region, I am assuming they are true adaptors.


        Originally posted by simonandrews View Post
        FastQC allows some flexibility in its matches, it also doesn't require a match to exist over the whole length of the sequence. The summary of the match will tell you how good a match it actually found.

        Many of the illumina adapters are very similar to each other, differing by only a few bases so FastQC often finds a multitude of possible hits, so it just picks the first of the best set of hits to report.

        Given that the program only does these searches for sequences which occur at very high levels in a library it's pretty unusual to get a complete false positive for the presence of an adapter sequence, although the identification of the exact adapter used may well not be correct.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM
        • seqadmin
          Techniques and Challenges in Conservation Genomics
          by seqadmin



          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

          Avian Conservation
          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
          03-08-2024, 10:41 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 06:37 PM
        0 responses
        10 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 06:07 PM
        0 responses
        9 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-22-2024, 10:03 AM
        0 responses
        50 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-21-2024, 07:32 AM
        0 responses
        67 views
        0 likes
        Last Post seqadmin  
        Working...
        X