Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • FastQC, Kmer count, Trimmomatic: no success in trimming, still fail Kmer

    I'm using RNAseq for DGE and failed in trying to map paired end reads in TopHat2. Samples passed the first two parts of fastqc, but failed per base and Kmer count. Some samples did pass per base.

    I tried trimming them to remove adaptors, but nothing improved the Kmer plot. http://imgur.com/Fhbx534 All of the kmer plots from this data set (18) look similar. I used the adaptor settings built into trimmomatic, but none worked. I then emailed genewhiz and asked for their (truseq) adaptors, but trimming for those sequences didn't improve the plots.

    Ex per seq quality score http://imgur.com/5L5PaJU,Hg2mHk0#0
    per base http://imgur.com/5L5PaJU,Hg2mHk0#1

    Am I doing something wrong? Should I just trim everything across certain bases?

    Edit: reads are paired

    adaptor input file:
    >PrefixPE
    GATCGGAAGAGCACACGTCTGAACTCCAGTCAC
    >Prefix PE
    AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT
    Last edited by skmotay; 10-09-2014, 06:22 AM. Reason: links for images

  • #2
    Are your reads paired? If so, you can trim them with BBDuk via overlap, without specifying the adapter sequence, like this:

    bbduk.sh in1=r1.fq in2=r2.fq out1=trimmed1.fq out2=trimmed2.fq tpe tbo

    That package comes with both TruSeq and Nextera adapters, which are another possibility that you can try.

    Comment


    • #3
      Yes, reads are paired.

      I think I should have mentioned that I'm working in Galaxy, which I think limits what tools I can use. I don't see BBDuk in my tools.

      Comment


      • #4
        Ah, too bad. I'll see if I can get BBTools put into Galaxy. Though you can still download it and try out the Nextera adapter file. I would recommend, though, that you contact the source of the data to find out what kind of adapters were actually used - that will save a lot of time.

        Comment


        • #5
          I emailed genewhiz and they sent me the adaptor sequences (listed I original post). I'm not quite sure I wrote the adaptor sequence file correctly and I couldn't find a clear example.

          Comment


          • #6
            Does Fastqc give you a warning for overrepresented sequences?

            I don't know the default -k setting for fastqc but maybe it helps to increase it.
            My impression is that the adapter trimming worked. But the overrepresented k-mers test fails due to other reasons

            Comment


            • #7
              They all pass overrepresented sequences.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Strategies for Sequencing Challenging Samples
                by seqadmin


                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                03-22-2024, 06:39 AM
              • seqadmin
                Techniques and Challenges in Conservation Genomics
                by seqadmin



                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                Avian Conservation
                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                03-08-2024, 10:41 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Yesterday, 06:37 PM
              0 responses
              10 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, Yesterday, 06:07 PM
              0 responses
              10 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-22-2024, 10:03 AM
              0 responses
              51 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-21-2024, 07:32 AM
              0 responses
              67 views
              0 likes
              Last Post seqadmin  
              Working...
              X