Is it ok to split an aligned bam by chromosome and then run MarkDuplicates on each of the files?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
I wouldn't worry about splitting by chromosome and running MarkDuplicates, in fact seems quite astute to me. Do you expect gene duplicates and this is why you are doing it? Or is it due to limited computational resources, or another issue?
I have not tried this but would be interested to see if differences occur between split-by-chr and full bams, can you report back about it if you do that?
Comment
-
Originally posted by Heisman View PostNo in the sense you'll miss PE reads that have each read map to different chromosomes. This is a key advantage of using MarkDuplicates over the samtools method.
Comment
-
Ive been asked to make a pipeline run faster on a distributed system. We can parallize most of the steps (alignment, some of the gatk steps, etc) but the MarkDuplicates is quite time consuming. If I split by chormosome then run on multiple machines it gets done much faster but I don't want it to affect the results.
I am planning tests to compare the deduped bams that are produced soon. I will post the results.
Comment
-
Read through this FAQ for possible tips/explanations to make it faster: http://sourceforge.net/apps/mediawik...=Main_Page#FAQ
bruce01, no idea how common they are but if you want to remove the duplicates you can't split the bam file up by chromosome, I don't think (I could in theory be wrong; I've never considered doing this).
Comment
Latest Articles
Collapse
-
by seqadmin
In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...-
Channel: Articles
01-27-2025, 07:46 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 10:34 AM
|
0 responses
6 views
0 likes
|
Last Post
by seqadmin
Today, 10:34 AM
|
||
Started by seqadmin, 02-03-2025, 09:07 AM
|
0 responses
14 views
0 likes
|
Last Post
by seqadmin
02-03-2025, 09:07 AM
|
||
Started by seqadmin, 01-31-2025, 08:31 AM
|
0 responses
26 views
0 likes
|
Last Post
by seqadmin
01-31-2025, 08:31 AM
|
||
Started by seqadmin, 01-24-2025, 07:35 AM
|
0 responses
78 views
0 likes
|
Last Post
by seqadmin
01-24-2025, 07:35 AM
|
Comment