Seqanswers Leaderboard Ad

**GenoMax** · 01-30-2015, 04:20 AM

So BBSplit analysis did not work that well?

**cyanoevo** · 01-30-2015, 04:26 AM

Not quite... It seemed to work, and mapped a portion of reads to the references, but when I assembled from the mapped reads there were still lots of contigs that were clearly from contaminants. I must say, some samples have worked better than others (I am working on several different strains) which is why I'm beginning to doubt my extractions...

**GenoMax** · 01-30-2015, 05:07 AM

So that is certainly one possible explanation. You could do another round of BBSplit with assembled contaminants to see if you can weed some additional sequences out. Then go back and do the assembly again.

**GenoMax** · 01-30-2015, 05:10 AM

Give SPAdes a try to see if it does a better job of assemblies. At least it may put the bacterial contaminants together better so you can remove them.

**cyanoevo** · 01-30-2015, 06:31 AM

Thanks, I'll have a go with SPADES. Currently, my workflow is something like this:

1) Trim adapters + poor quality reads - Trimmomatic

2) map reads to multiple cyanbacteria genomes - BBsplit

3) assemble mapped reads - ABYSS/SPADES

4) Separate contigs based on taxonomic affiliation - PhyloPythia

I was then thinking about using the initial reads to try and extend the cyanobacterial contigs. I've tried this using IMAGE (which didn't work) and PRICE (which ran out of memory on my 32 GB desktop). Currently the assemblies are generating huge numbers of contigs, many of which are short and obviously want to get this number down without throwing away useful data...

Does all this sound like a reasonable way of going about things? There is such a huge amount of available software out there it's hard to see the wood for the trees sometimes...

**GenoMax** · 01-31-2015, 06:23 AM

You may need to try and iterate between 2 and 3 to see if you can improve things. If the cynobacterial DNA is underrepresented in the current library then you may need to do another prep.

**Brian Bushnell** · 01-31-2015, 10:00 AM

When you say there are contigs from contaminants... what kind of contaminants are you talking about? The wrong strain, or the wrong phylum entirely, or synthetic lab molecules?

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Bad assembly or bad sequence data?

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News