Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Help needed on Fastqc-adapter content

    Dear all,

    i have reads from nextseq 500. I have removed adaptors using fastq-mcf. I run Fastqc for the trimmed reads. I am getting fail result for adapter content,k-mer and per base seqeunce content.
    Whether iam missing any adapters to remove?. I can see the probelm with first 30bp. I have attached adapter seq and screenshopt of fastqc results. Any help will be highly useful to me.

    Thanks

    Regards
    Manoj
    Attached Files

  • #2
    Is this RNAseq data? The bias seen at the beginning of reads is a common observation because of the "random" primers not being so random (http://seqanswers.com/forums/showthr...t=4846&page=14) and does not need trimming. A similar bias is also seen with tagmentation reactions and is normal.

    Comment


    • #3
      Thanks. It is exon data. Can i remove first 20bp. Is there any other way to remove that bias. i have mapped with bwa and got mapping quality of 60%. So i need to remove this bias in the beginning of the reads.

      Comment


      • #4
        Does the mapping quality (60%) refer to the percentage of reads that are mapping? I doubt the low % is because of the first 20 bp but if you want to give it a try you could remove them and see if it improves your mapping.

        What kind of reference genome are you mapping to? Is it a reasonably complete one (e.g. human)?

        More than likely you may have some other issue (primer dimers etc) that may be preventing reads from mapping.

        Comment


        • #5
          Yes. only 60% of paired reads mapped to human genome(hg19). how to identify primer dimer in my reads and remove those from my reads.

          Comment


          • #6
            There are two problems here. You have the typical RNA-Seq kmer bias problem at the start of your reads showing up in your Kmer and base composition plots. This isn't something you can fix, or that would be improved by trimming so don't worry about this.

            Secondly you have read-through adapter contamination of your reads as shown by the adapter content plot however this is not produced by the normal common Illumina adapter, but comes from the transposase which Illumina uses to fragment their libraries in some of their kits. This means that your data would benefit from being trimmed to remove this and this will help with the mapping efficiency. You'll need to modify the default options for the trimming program you use to specify that it's the transposase sequence (CTGTCTCTTATA) which you want to remove. Pretty much all trimming programs will allow you to specify this sequence. After trimming you should see the adapter content plot be flat in the trimmed data and hopefully your mapping efficiency will go up too.

            Comment


            • #7
              Thanks for your help. I have removed the transposase sequence and now i can see the adapter content plot flat. I will map the reads and see the maping quality.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Advancing Precision Medicine for Rare Diseases in Children
                by seqadmin




                Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                12-16-2024, 07:57 AM
              • seqadmin
                Recent Advances in Sequencing Technologies
                by seqadmin



                Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                Long-Read Sequencing
                Long-read sequencing has seen remarkable advancements,...
                12-02-2024, 01:49 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 12-17-2024, 10:28 AM
              0 responses
              33 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-13-2024, 08:24 AM
              0 responses
              49 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-12-2024, 07:41 AM
              0 responses
              34 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-11-2024, 07:45 AM
              0 responses
              46 views
              0 likes
              Last Post seqadmin  
              Working...
              X