Go Back   SEQanswers > Applications Forums > RNA Sequencing

Similar Threads
Thread Thread Starter Forum Replies Last Post
STAR RNA-seq aligner yields different read length than fastqc/zcat? LeonDK RNA Sequencing 2 01-26-2015 01:25 AM
How does read length affect sequencing depth for paired end RNA-Seq? bob-loblaw RNA Sequencing 2 02-15-2013 03:02 PM
picard error: Mismatch between read length and quals length writing read shawpa Bioinformatics 0 08-20-2012 05:52 AM
RNA-seq data with read of >= 50-bp length Xi Wang Bioinformatics 0 01-15-2010 12:01 AM

Thread Tools
Old 12-07-2015, 10:16 AM   #1
Location: New Brunswick

Join Date: Aug 2012
Posts: 16
Default RNA seq PE vs SE read length

Hello everyone,

We are about to send some samples off for sequencing, and I would like some input before we spend all of our money.

We have RNA from a wild type and a mutant bacteria, with four biological replicates each (eight samples total). We wish to examine all the differentially expresses genes between the genotypes, and I was wondering what is the best read length to use?

Our genome is sequenced and assembled (4.9 Mbp), and we will be generating rRNA-depleted stranded libraries to be multiplexed on a single Illumina HiSeq lane.

What would be the optimal method to sequence? 50bp vs 100bp, SE vs PE?
Tom_C is offline   Reply With Quote
Old 12-07-2015, 12:36 PM   #2
Brian Bushnell
Super Moderator
Location: Walnut Creek, CA

Join Date: Jan 2014
Posts: 2,707

50bp SE would be sufficient for the vast majority of genes in most bacteria. 2x100bp is always better, of course, but only marginally for bacterial RNA expression.

But if you want, you can use KmerCountExact like this: in=genome.fa k=50 khist=stdout

For E.coli, I got this output:

#Depth  Count
1       4536984
2       11541
3       5674
4       1023
5       1692
6       1635
7       4075
8       30
9       117
10      961
11      68
For 4639675 bases, 4536984 kmers occurred exactly once in the genome. That means 97.8% of error-free 50bp reads will map uniquely. It only increases to 4552752 for k=100, which is not much of an improvement. Longer, paired reads are more useful when dealing with differentially-spliced or highly repetitive eukaryotes.
Brian Bushnell is offline   Reply With Quote

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 07:44 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO