Unconfigured Ad

**SylvainL** · 03-03-2016, 01:51 AM

Hi,

did you map your reads versus the closely realted genome of directly on your transcriptome assembly? If you did on the transcriptome, simply use bedtools to get the number of reads on each transcripts...

**colindaven** · 03-03-2016, 02:32 AM

Suggestions

#1 Remap reads to closely related genome. Satisfied with mapping rate ?

#2 Use gmap (easy) or Maker to map your de novo assembled transcripts to the related genome. Again, satisfied ? View both sets in a genome browser.

#3 if unsatisfied with #2 perhaps use Trinity genome guided or cufflinks to recreate transcripts.

#4 Quantify - ie using featureCounts - transcripts from #2 or #3.

Forget blast for this kind of approach.

**moldach** · 03-03-2016, 12:06 PM

Originally posted by SylvainL View Post

Hi,

did you map your reads versus the closely realted genome of directly on your transcriptome assembly? If you did on the transcriptome, simply use bedtools to get the number of reads on each transcripts...

I mapped directly on my transcription assembly.

I couldn't find any reference to getting the number of reads on each transcript (maybe it's just worded differently?) from the documentation of bedtools. However, I found a Biostars link that suggested using the multicov sub-command in the bedtools suite.

However, according to the documentation the multicov from BEDtools requires genome annotation. For example:

>bedtools multicov –bams run.bam -bed genes.bed

Are you talking about another sub-command or can multicov be run without the bed file?

**moldach** · 03-03-2016, 12:44 PM

Originally posted by colindaven View Post

Suggestions

#1 Remap reads to closely related genome. Satisfied with mapping rate ?

I had tried mapping at one point some-time-ago to the closely related un-annotated genome. Unfortunately, I used Bowtie2. I now know better; you need to use a splice-junction aware aligner.

Originally posted by colindaven View Post

Suggestions
#2 Use gmap (easy) or Maker to map your de novo assembled transcripts to the related genome. Again, satisfied ? View both sets in a genome browser.

So GMAP maps and aligns with this command:

>gmap -d <genome> -A <cdna_file>

And it would output SAM files.

What I don't understand is how (or if) GMAP annotates this genome?
The documentation for maker on the other hand clearly states it annotates but I can't find anything in the GMAP documentation.

Will gmap and Maker output an annotation file including chromosomal coordinates of features (GTF)? It says that this is a required file to use featureCounts

**moldach** · 03-07-2016, 12:16 PM

Can anyone help?

**SylvainL** · 03-07-2016, 11:35 PM

Hi,

since you aligned directly on your transcriptome, I guess your reference contains all the transcripts, so you can get the counts for each by using samtools idxstats

**colindaven** · 03-08-2016, 12:27 AM

A GMAP command which produces GFF3 output might look like this:

~/gmap-2015-07-23/bin/gmap -f gff3_gene -D gmap/ -d mygenome.fasta.gmap -B 5 -t 12 --intronlength=50000 --totallength=1000000 -p 3 --npaths=1 transcripts.fa > transcripts.gff3

This is a nice GFF3 which can be used directly by "bedtools multicov"

If you want to use featureCounts for read counting try using ngsutils to convert from gff3 to gtf.

NGSUtils - gtfutils

http://ngsutils.org/modules/gtfutils/

**shi** · 03-09-2016, 02:20 PM

featureCounts works with both GTF and GFF formats. I think it should be fine if you directly provide your GFF3 annotation to featureCounts program.

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, Today, 11:08 AM	0 responses 6 views 0 reactions	Last Post by SEQadmin2 Today, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 11 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 53 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

Assigning reads to genes in the absence of genomic annotation

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News