Unconfigured Ad

**GenoMax** · 02-01-2018, 06:27 AM

Welcome to SA!

You can use FastQC for assessment of quality of initial read data. This is pretty much the de facto program people use. The lab that made FastQC has many informative blog posts about data quality/observations at their QC Fail site.

Quast is what you would want to use for assessment of assemblies.

If these are bacterial genomes then Mauve allows you to do genome-wide alignments to quickly identify rearrangements.\

I recommend BBMap suite (multiple threads here on various tools included). This suite has many tools that allow you to work with NGS data.

**berthenet** · 02-01-2018, 07:08 AM

Just lost my post, I've been logged out during writing it. Note to myself: always copy my answer in my clipboard before posting...

So, let's right it all again.

Thanks for the links you shared. Some of them I knew of, but some of them I'll go and have a look. I do work with bacterial genomes.

So most of my assemblies look fine in terms of number of contigs (<100 contigs) once I filter out the smallest ones (<1000bp). However, for some of them the number of contigs remain really high, and when I check the length of the complete genome, I obain 3 genomes of more than 2.4Mb when I expect 1.65Mb approximately. I checked the 30 largest contigs for one of these outsider strain by doing a nblast against the NCBI database. I noticed that some of the contigs don't match the species of interest. These contigs have a low coverage value (indicated in the name of the contig): around 1, against more than 200 for contigs matching the species of interest.

Do you usually filter your contigs based on this coverage value? Is that why I have weird sizes?

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 24 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 42 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 48 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 49 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Hello from a french bio-informatician looking for help with NextSeq500 Illumina data

Comment

Comment

Latest Articles

ad_right_rmr

News