Seqanswers Leaderboard Ad

**chadn737** · 03-05-2013, 09:36 AM

Why not do a de novo assembly?

**jgibbons1** · 03-05-2013, 02:16 PM

@cyendrek I've had the same issue before. Rather than using bowtie2 (which I typically use too) try using seqmap and rseq. Both programs are in the rseq package http://www-personal.umich.edu/~jianghui/rseq/

You first need to map the reads to the reference (or in your case unigene/EST) using seqmap then you can generate RPKM and read count calculations per gene/EST using rseq. I've used this pipeline quite a bit in the past so let me know if you have any problems.

Here's an example command line of seqmap allowing 2 mismatches:

[user]$ seqmap 2 ReadFile.fasta Reference.fasta Output.seqmap /eland:3

Here's an example command line of rseq assuming read length is 50 bp:

[user]$ rseq comp_exp -r 50 Reference.fasta Output.seqmap

This will create a file with the "comp_exp" extension that has the number of mapped reads, number of uniquely mapped reads and rpkm values (among other stats).

A few words of wisdom, your reads must be in fasta format, so convert fastq to fasta (I use the fastxtoolkit for this). Also, seqmap uses ALOT of memory, so I usually break my read file up into batches of 5-10 million reads. I then mapped these independently against the reference, merge the output files then run rseq on the merged output.

Good luck!

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 13 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

read counts without gff3 file

Comment

Comment

Latest Articles

ad_right_rmr

News