SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
"allele balance ratio" and "quality by depth" in VCF files efoss Bioinformatics 2 10-25-2011 11:13 AM
Relatively large proportion of "LOWDATA", "FAIL" of FPKM_status running cufflink ruben6um Bioinformatics 3 10-12-2011 12:39 AM
The position file formats ".clocs" and "_pos.txt"? Ist there any difference? elgor Illumina/Solexa 0 06-27-2011 07:55 AM
"Systems biology and administration" & "Genome generation: no engineering allowed" seb567 Bioinformatics 0 05-25-2010 12:19 PM
SEQanswers second "publication": "How to map billions of short reads onto genomes" ECO Literature Watch 0 06-29-2009 11:49 PM

Reply
 
Thread Tools
Old 04-21-2011, 11:38 AM   #1
PFS
Member
 
Location: USA

Join Date: Mar 2010
Posts: 55
Default "coverage" of introns, intergenic regions for RNASEQ

I would like to determine the % of reads mapping to exons, introns and intergenic regions. I know BEDTools can do this, but I am not sure how to get the GFF files with the information for exons, introns and intergenic regions separately. Any help? Is there a better/easier way?
PFS is offline   Reply With Quote
Old 04-22-2011, 01:07 PM   #2
Michael.James.Clark
Senior Member
 
Location: Palo Alto

Join Date: Apr 2009
Posts: 213
Default

One easy way is through UCSC.

When you go to get the Refseq or UCSC knownGene table in BED format, for example, you can choose to report exons, introns, UTRs, etc.

Some of the tables will not have this information, making your life difficult. Like the Ensembl table in UCSC doesn't have this option as far as I can see.

For Ensembl if you prefer it, the GTF file you can get for their db has useful meta info columns 2 and 3 that can help you do this as well.
__________________
Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog]
Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post]
Projects: U87MG whole genome sequence [Website] [Paper]
Michael.James.Clark is offline   Reply With Quote
Old 09-07-2011, 01:51 PM   #3
unidodo
Junior Member
 
Location: USA

Join Date: Jul 2010
Posts: 8
Default

UCSC bed files potentially cause the over-counting problem.
unidodo is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 02:05 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO