Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • FastQC, Kmer count, Trimmomatic: no success in trimming, still fail Kmer

    I'm using RNAseq for DGE and failed in trying to map paired end reads in TopHat2. Samples passed the first two parts of fastqc, but failed per base and Kmer count. Some samples did pass per base.

    I tried trimming them to remove adaptors, but nothing improved the Kmer plot. http://imgur.com/Fhbx534 All of the kmer plots from this data set (18) look similar. I used the adaptor settings built into trimmomatic, but none worked. I then emailed genewhiz and asked for their (truseq) adaptors, but trimming for those sequences didn't improve the plots.

    Ex per seq quality score http://imgur.com/5L5PaJU,Hg2mHk0#0
    per base http://imgur.com/5L5PaJU,Hg2mHk0#1

    Am I doing something wrong? Should I just trim everything across certain bases?

    Edit: reads are paired

    adaptor input file:
    >PrefixPE
    GATCGGAAGAGCACACGTCTGAACTCCAGTCAC
    >Prefix PE
    AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT
    Last edited by skmotay; 10-09-2014, 06:22 AM. Reason: links for images

  • #2
    Are your reads paired? If so, you can trim them with BBDuk via overlap, without specifying the adapter sequence, like this:

    bbduk.sh in1=r1.fq in2=r2.fq out1=trimmed1.fq out2=trimmed2.fq tpe tbo

    That package comes with both TruSeq and Nextera adapters, which are another possibility that you can try.

    Comment


    • #3
      Yes, reads are paired.

      I think I should have mentioned that I'm working in Galaxy, which I think limits what tools I can use. I don't see BBDuk in my tools.

      Comment


      • #4
        Ah, too bad. I'll see if I can get BBTools put into Galaxy. Though you can still download it and try out the Nextera adapter file. I would recommend, though, that you contact the source of the data to find out what kind of adapters were actually used - that will save a lot of time.

        Comment


        • #5
          I emailed genewhiz and they sent me the adaptor sequences (listed I original post). I'm not quite sure I wrote the adaptor sequence file correctly and I couldn't find a clear example.

          Comment


          • #6
            Does Fastqc give you a warning for overrepresented sequences?

            I don't know the default -k setting for fastqc but maybe it helps to increase it.
            My impression is that the adapter trimming worked. But the overrepresented k-mers test fails due to other reasons

            Comment


            • #7
              They all pass overrepresented sequences.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Recent Advances in Sequencing Analysis Tools
                by seqadmin


                The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                Yesterday, 07:48 AM
              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin




                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                04-22-2024, 07:01 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Today, 06:57 AM
              0 responses
              7 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, Yesterday, 07:17 AM
              0 responses
              13 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 05-02-2024, 08:06 AM
              0 responses
              19 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-30-2024, 12:17 PM
              0 responses
              21 views
              0 likes
              Last Post seqadmin  
              Working...
              X