Seqanswers Leaderboard Ad

**Brian Bushnell** · 10-27-2014, 03:39 PM

The BBTools package's dedupe program will handle this. It can remove duplicate contigs as long as they are identical, or one is fully contained within the other, up to some maximum edit distance or hamming distance that you can specify, and it handles reverse-complements.

Syntax:
dedupe.sh in=assembly.fa out=deduplicated.fa

**cmbetts** · 10-27-2014, 04:55 PM

I don't have any solution for your Trinity issues, since I've mainly done human/mouse RNA-Seq, but here are a few possibilities for your strandedness issue (I'm assuming that you're using a dUTP based method):

ActD Freshness) The protocol is only ~80% strand specific without ActD to prevent spurious 2nd strand synthesis, and that stuff has a really terrible shelf life in solution at -20.
Nucleotide carryover from 1st strand) if you don't sufficiently remove dTTP from the 1st strand step, it can be incorporated into the 2nd strand cDNA preventing UDG digestion.
USER/UDG freshness) If the UDG enzyme has gone off, or wasn't incubated long enough, you could retain some of the 2nd strand cDNA.

It very likely could be a combination of the three. I'm not sure how you're determining correct strand vs. antisense, but I've seen >99% correct strand, based on ERCCs, using all fresh ingredients.

**nucacidhunter** · 10-27-2014, 08:32 PM

I would add possibility of biological process (antisense transcript) to cmbetts comments. It is well known that in some regions both strands are transcribed.

**evt8** · 11-05-2014, 04:28 PM

Thanks all for your helpful responses - dedupe sounds like what we are after, and its very helpful to know potential library prep issues. We've discussed the observation with our sequencing service provider and will pass these suggestions on.

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

Remove reverse complement redundancy in stranded transcriptome

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News