Seqanswers Leaderboard Ad

**westerman** · 01-27-2015, 08:56 AM

Combining reads into one set of files would not be a good idea. However assemblers such as ABySS will happily take two or more sets of files and treat each one as separate entities.

I am not saying that ABySS is the best assembler for your work -- although it is my 'go-to' assembler for large projects -- but do suggest giving it a try. In your case I would tell ABySS that I had 4 different paired-end libraries (the HiSeq data) and a single-end library (the MiSeq merged reads).

As an answer to your final paragraph, "Do you think an alternative would be to assemble the HiSeq and MiSeq data separately and then combine them using an OLC ...", yeah, that should work as well. minimus/bambus would be what I would use. Not sure if they are 'best' though.

**bioBob** · 01-28-2015, 05:09 AM

I would check out MaSuRCA. As input, you would give it the raw reads, not trimmed or stitched together. Each read set would be a unique library.

I have done this with a few different genomes with varying success. When I had better success, it was generally not by sequencing a single library with longer reads (overlapped or not), it was by sequencing a new library with longer reads. My opinion on why is because you end up averaging out library prep biases when you have more libraries.

In the end, you will probably find that you still need a completely different data type to get to a decent assembly. MP, long (>1k) reads, targeted high depth, etc.

**panos_ed** · 01-29-2015, 03:47 AM

Thank you both guys for the hints!

westerman, I'll give ABySS a go and see what I get.

bioBob, I had tried MaSuRCA about a year ago, but was really disappointed (very, very buggy!). I heard though there's a new version that has lots of bug fixes. I think I'll also give that a try.

**westerman** · 01-29-2015, 07:52 AM

Let us know about MaSuRCA. The one time I tried it I was disappointed as well.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 37 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 41 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 35 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Hybrid assembly using HiSeq and MiSeq data

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News