SEQanswers

Go Back   SEQanswers > Applications Forums > RNA Sequencing



Similar Threads
Thread Thread Starter Forum Replies Last Post
Problem removing duplicate reads? (samtools and picard) cbl Bioinformatics 19 09-17-2015 12:01 PM
example for using Picard removing duplicate reads? fabrice Bioinformatics 9 10-18-2013 03:32 AM
Duplicate Reads myronpeto Bioinformatics 7 03-07-2013 08:36 AM
Removing similar sequence reads loba17 Bioinformatics 4 10-17-2011 08:31 AM
Removing duplicate reads from multigig .csfasta Bueller_007 Bioinformatics 7 06-26-2010 04:07 PM

Reply
 
Thread Tools
Old 10-08-2010, 09:58 AM   #1
hong_sunwoo
Member
 
Location: Suwon, Korea

Join Date: Jan 2010
Posts: 11
Default Removing duplicate reads for tophat?

Hello.

I have two human single end RNA seq data generated from Solexa.
Yesterday, I ran fastqc using my data and the program reported that two data have a little amount of duplicated reads.

Should I remove this before processing Tophat?
And is there any programs which can be used for such purpose?

Thanks!!
hong_sunwoo is offline   Reply With Quote
Old 10-08-2010, 10:25 AM   #2
shurjo
Senior Member
 
Location: Rockville, MD

Join Date: Jan 2009
Posts: 126
Default

A small amount of duplication is going to be present in almost any RNA-Seq library. I would not remove these before running TopHat or any other aligners.

There are threads on this forum where this issue has been discussed in great detail, inncluding one which has a link to a mathematically rigorous treatment of the question by lh3, a senior user. Searching the forum with "duplicate reads" should point you in the right direction.

Best of luck,

Shurjo
shurjo is offline   Reply With Quote
Old 10-09-2010, 01:46 AM   #3
hong_sunwoo
Member
 
Location: Suwon, Korea

Join Date: Jan 2010
Posts: 11
Default

Thanks shurjo!
hong_sunwoo is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 06:12 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO