Seqanswers Leaderboard Ad

**GenoMax** · 02-19-2014, 03:14 PM

If the first data set was done a few years ago you would want to check what format the quality values are in. They may be in "illumina" format and would need to be converted to "sanger" quality if you are going to do any combined analysis. (Ref: http://en.wikipedia.org/wiki/FASTQ_format#Encoding)

Depending on the amount of data available for each of those runs (and time you can spend) you could do all mentioned options in parallel. With a bacterial genome it should not be very time consuming affair.

**Wallysb01** · 02-20-2014, 09:36 AM

Depending on what type of data each is in terms of SE/PE or 50bp/100bp, etc, it maybe worth it to just completely ignore the old data. Though you don’t mention it, I’d bet you have absurd coverage, and it is possible to “over assemble” a genome with ridiculous coverage.

Like GenoMax said, you might as well do all of them with a bacterial genome, but without more specifics its hard to recommend which route is likely to be better.

**Dagga** · 03-06-2014, 05:56 PM

Hi All,

The I have assembled both sets of data individually and it seems the data for the second run is not as good as there is some contamination. So I have a further question.

If I was going to use my first assemble as a reference, how do I map the reads of the second run to this?

Also - if I use my first run as a reference, can the contigs be lengthed using the read reads or will my contigs be limited in size to the reference and can no longer be expanded, even with new reads.

Cheers

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Best way to perform assembly with two sets of raw data

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News