Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • JonB
    Member
    • Jan 2010
    • 85

    Getting level of spliced genes from gff-file

    Hi,

    I have a list of genes and I want to check how many of them are spliced, i.e. have multiple exons. Many of these genes also exist in several isoforms, but I mostly just need to know if a gene is multi-exonic or not.

    Any tips on how to efficiently calculate this? I have used bedtools before, but cannot see that it can do this task for me. Maybe cufflinks is an alternative, but I am not familiar with everything it can do.

    Thanks,

    Jon
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    Presuming you have a GTF or GFF file, you can use the GenomicRanges package in R. Just load the GTF/GFF file into a GRanges object, split() by the gene name, reduce() that and then lapply() length(). Multiexonic genes will have lengths > 1 (the result of applying length() will be the number of exons).

    Comment

    • JonB
      Member
      • Jan 2010
      • 85

      #3
      Thanks Devon,

      I struggle to understand how to split by the gene name, and also to understand exactly how the output of this contains (GRangesList?). Should I split using "ID" or "gene"? Do you mind showing me on this exctract from my gff3 file? Thanks a lot.

      Code:
      ##gff-version 3
      ###
      scis2053	noncoding	gene	27485	28677	.	-	.	ID=scign013105;Name=scign013105
      scis2053	noncoding	mRNA	27485	28677	5921	-	.	ID=scitn013105.1;Parent=scign013105;Name=scitn013105.1
      scis2053	noncoding	exon	27485	28677	.	-	.	Parent=scitn013105.1
      ###
      scis543	noncoding	gene	28381	29172	.	+	.	ID=scign020729;Name=scign020729
      scis543	noncoding	mRNA	28381	29172	3731	+	.	ID=scitn020729.1;Parent=scign020729;Name=scitn020729.1
      scis543	noncoding	exon	28381	29172	.	+	.	Parent=scitn020729.1
      ###
      scis396	noncoding	gene	286737	287339	.	-	.	ID=scign021128;Name=scign021128
      scis396	noncoding	mRNA	286737	287339	3015	-	.	ID=scitn021128.1;Parent=scign021128;Name=scitn021128.1
      scis396	noncoding	exon	286737	287339	.	-	.	Parent=scitn021128.1
      ###
      scis673	noncoding	gene	85677	115116	.	+	.	ID=scign002358;Name=scign002358
      scis673	noncoding	mRNA	113016	115116	6254	+	.	ID=scitn002358.1;Parent=scign002358;Name=scitn002358.1
      scis673	noncoding	exon	113016	113049	.	+	.	Parent=scitn002358.1
      scis673	noncoding	exon	113444	114538	.	+	.	Parent=scitn002358.1
      scis673	noncoding	exon	114973	115116	.	+	.	Parent=scitn002358.1
      scis673	noncoding	mRNA	85677	115099	3835	+	.	ID=scitn002358.2;Parent=scign002358;Name=scitn002358.2
      scis673	noncoding	exon	85677	85697	.	+	.	Parent=scitn002358.2
      scis673	noncoding	exon	113896	114538	.	+	.	Parent=scitn002358.2
      scis673	noncoding	exon	114973	115099	.	+	.	Parent=scitn002358.2

      Comment

      Latest Articles

      Collapse

      • SEQadmin2
        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
        by SEQadmin2


        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

        Here are nine questions we think about, in roughly the order they matter, before...
        06-18-2026, 07:11 AM
      • SEQadmin2
        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
        by SEQadmin2


        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
        ...
        06-02-2026, 10:05 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by SEQadmin2, Yesterday, 05:37 AM
      0 responses
      6 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-26-2026, 11:10 AM
      0 responses
      16 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-17-2026, 06:09 AM
      0 responses
      51 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-09-2026, 11:58 AM
      0 responses
      110 views
      0 reactions
      Last Post SEQadmin2  
      Working...