|Thread||Thread Starter||Forum||Replies||Last Post|
|Low GC, fragments, low read alignment||jmah||Bioinformatics||0||03-15-2017 03:18 PM|
|short reads Assembly in contigs||mido1951||Illumina/Solexa||12||11-04-2015 07:13 AM|
|Low mapping percentage of reads on assembled contigs||morning latte||Bioinformatics||20||03-23-2014 09:01 AM|
|Minimum short read required for transcriptome assembly||edge||Bioinformatics||4||08-25-2013 09:16 PM|
|Minia: ultra-low memory contigs assembly||rayanc||De novo discovery||15||04-01-2013 05:31 AM|
|03-15-2017, 04:47 PM||#1|
Join Date: Sep 2016
Transcriptome assembly: Low GC, short contigs, low read alignment
Sorry if this is a repeat. I posted earlier but now can't find it under the posts/threads listed under my username...
I am trying to troubleshoot two de novo Trinity assemblies. They were sequenced during the same run for two species of sponge, and I obtained 2x150 bp reads to a depth of 124x. We already have a whole transcriptome for each species assembled, but for our purposes I would like a de novo assembly. The GC content of my new assemblies are 3-7% lower than our old assemblies. Furthermore, my assemblies have many short contigs (ie. N50: 800 bp, cf. to 1800 bp of the old assemblies, median length: 300 vs 800 bp, mean length: 600 vs 1200 bp). The nail on the coffin is that there are few reads aligned in proper paired orientation when mapped back to my de novo assemblies: ~50% in proper pairs.
I am most worried about the GC content. GC content of the reads are similar to our old transcriptomes and only lower after assembly. I have changed adapter trimming parameters and tried out the jaccard clip setting for Trinity, but my assembly stats remain almost identical each run.
Has anyone received assemblies with low GC and short contigs before? If so, what did you do to fix that?
Thanks! If there's any more information that can prove helpful, please let me know.
|gc content, transcriptome assembly, trinity 2.0.6|