Go Back   SEQanswers > Applications Forums > RNA Sequencing

Similar Threads
Thread Thread Starter Forum Replies Last Post
cuffmerge assembly vs denovo assembly of RNAseq data skm Bioinformatics 0 10-16-2013 09:16 PM
Denovo assembly Thenna Bioinformatics 2 05-06-2013 06:09 AM
High duplication percentage klebsiella Illumina/Solexa 2 01-17-2013 04:45 AM
High percentage of N calls in the library apredeus Bioinformatics 2 12-08-2012 03:27 PM
crossmatch/phrap: improper assembly pag Bioinformatics 12 09-13-2012 08:51 AM

Thread Tools
Old 11-20-2013, 07:26 AM   #1
Junior Member
Location: Barcelona

Join Date: Nov 2013
Posts: 1
Default deNovo assembly. High percentage of improper pairs.


I'm trying to obtain the best de novo transcriptome assembly for my data. I pooled together all the reads from my different samples and individuals and filtered the data using Trimmomatic (and removed Illumina adaptors). The reads are 150bp and only reads longer than 100bp were used.

general stats
Total trinity transcripts: 1295606
Total trinity components: 776170
Contig N50: 981
#read_type count pct
proper_pairs 119877694 44.02
improper_pairs 111202610 40.84
left_only 25080118 9.21
right_only 16146667 5.93

Total aligned reads: 272307089

My question is, which reasons could be behind obtaining such a high number of improper pairs aligned and how could we improve our assembly?
biojl is offline   Reply With Quote

bowtie2, de novo transcriptome, illumina, rnaseq, trinity assembly

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 08:22 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO