Seqanswers Leaderboard Ad

**lethalfang** · 10-23-2013, 02:27 PM

Originally posted by gevielr View Post

I'm looking for a script that I could use to remove all homopolymer reads from my 100 bp PE reads. These are unusually overabundant in my sample.

I can't seem to find a script that will do this. I've tried to use fastx_clipper, but defining the adapter as a homopolymer, and that didn't work (defined adaptor too long?).

Any ideas?

I wrote a script in python3 that does that. You may find that useful. If you're interested, email me [email protected]

**gevielr** · 10-23-2013, 03:38 PM

Great, thanks!! I'll shoot you an email.

**Wallysb01** · 10-23-2013, 04:00 PM

You could also use dust via prinseq. You might have to play around with the scoring to get a sense of the type of reads you're losing, but it should become pretty clear when you're dumping only very low complexity stuff, like say trinucleotide repeats and shorter.

**bfantinatti** · 07-31-2014, 10:37 AM

Dust

I am using DUST to perform this task.
But DUST do not remove the reads with low complexity. It put the low complexity bases in lowercase.
I don't know how to remove those lowercase reads after running DUST.

How do you guys do to remove those reads after running DUST?

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

discarding homopolymer reads

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News