SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
Paired-end Illumina RNA-seq adapter trimming fabrice Bioinformatics 8 01-05-2015 08:48 AM
FASTXtoolkit adapter trimming Mark Bioinformatics 36 10-24-2013 11:28 AM
SOLiD Barcoded Adapter Trimming DrDTonge Bioinformatics 4 12-06-2011 08:33 AM
Please Help: What is the differences between standard trimming and adaptive trimming byou678 Bioinformatics 8 08-22-2011 01:05 PM
Adapter trimming in MAQ for SOLiD lgoff Bioinformatics 0 05-11-2009 10:55 AM

Reply
 
Thread Tools
Old 05-27-2009, 01:53 PM   #1
caddymob
Member
 
Location: USA

Join Date: Apr 2009
Posts: 36
Default 3' Adapter Trimming

Hi everyone,

I am using Maq for alignments and have found the 3' adapter trimming to be very informative about my overall run/sample prep quality. However, I am not clear about how this actually is working and have a couple questions...

For instance, I have in one lane 13,145,392 quality filtered reads. Using the adapter trimming option I get 10,270,661 possible reads with adapter contamination, and a total of 2,949,120 paired reads mapping (3,404,692 total mapped). So this mapping number is greater than the number of reads NOT containing adapters 13,145,392-10,270,661 = 2,874,731.

So, does Maq simply trim off any adapter and continue with alignment if the read is of sufficient length? Am I reading this correctly?

Next, I have aligned the same exact reads to 3 different regions and for each I get 3 different counts for possible adapter contamination with everything else being equal. For the lane I mentioned above, I get a) 10,270,661 b) 10,473,317 and c) 10,299,171 counts for adapters but again these are the exact same reads, just a different region for alignment -- not huge differences, but differences nonetheless

I whipped together a super simple perl script to count 3' adapters in my FASTQs and get nowhere near the same number..

Code:
$adapt = GATCGGAA;
$count = 0;

while (<>) {
  $line = $_;
  chomp $line;
  if ($line =~ m/^$adapt/) {
    $count++;
  }
}
print "\nThere are $count sequences with adapter!\n\n";
Again, super simple, but with this for the aforementioned lane I get 6,615,038 reads containing adapter....

Does anybody have some insight to any of these issues? Thanks everyone!
caddymob is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 09:19 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2017, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO