Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How can I use RefGene.gtf with DEXSeq

    I try to convert the hg19_refGene.gtf file to a gff file with DEXseq_prepare_annotation.py. It doesn't work : my output file is empty. In DEXSeq's vignette they say that the script is made to be used with ensembl.gtf file. However I have conducted a full analysis at the gene level with the refGene.gtf file and I don't want to restart with a new mapping made on ensembl.gtf.

    Is there any way to use refGene.gtf file with DEXseq_prepare_annotation.py ?

    Many thanks

  • #2
    Does it give you an error? As I recall, refGene has the unfortunate property of having duplicate identifiers on multiple chromosomes/strands...which completely breaks any script trying to prepare inputs for DEXSeq.

    Rule of thumb: Always use Ensembl and never UCSC.

    Comment


    • #3
      It doesn't give an error but the gff file is empty. So, I can't use DEXSeq with refGene ?

      Comment


      • #4
        You might need to munge the GTF file a bit. Play around with the python script to see why it's giving an empty output. The fix might end up being simple.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 08:47 AM
        0 responses
        11 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        59 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        54 views
        0 likes
        Last Post seqadmin  
        Working...
        X