Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • rflrob
    Member
    • May 2010
    • 50

    Cufflinks not masking features in the mask GTF file?

    Hi all,
    I'm running cufflinks with the mask option:
    Code:
       cufflinks --num-threads 50 --output-dir analysis-multi/CaS3A -u -M Reference/AAA/suspect.gtf  --frag-bias-correct Reference/AAA/mel_only.fa  -G Reference/AAA/mel_only.gtf analysis-multi/CaS3A/assigned_dmel_rescued.bam
    but I still end up with, for instance, relatively high (and variable) FPKMs in mitochondrial genes, which should be filtered out from the reads.

    Code:
    $ grep mt:tRNA:K mel_only.gtf
    dmel_dmel_mitochondrion_genome	FlyBase	exon	3768	3838	.	+	.	transcript_id "FBtr0100864"; gene_id "FBgn0013697"; gene_name "mt:tRNA:K";
    $ grep mt:tRNA:K suspect.gtf
    dmel_dmel_mitochondrion_genome	FlyBase	exon	3768	3838	.	+	.	transcript_id "FBtr0100864"; gene_id "FBgn0013697"; gene_name "mt:tRNA:K";
    I did find an old, essentially un-answered post on this forum mentioning a similar issue; anyone have any other deep insight?
  • pengchy
    Senior Member
    • Feb 2009
    • 116

    #2
    I have compared the speed of running with " -M -g " parameters and without them.
    The "without" group only lasted 3-5 hours, but the "with" group have been running 48 hours without any output.

    Comment

    • sschavan
      Junior Member
      • Oct 2012
      • 7

      #3
      cufflinks masking option for mitochondrial genes

      Hi all,

      I have a similar problem, I tried using the cufflinks masking option for mitochondrial genes using the below command:

      cufflinks --mask-file=/illumina/genomes/Homo_sapiens/Ensembl/GRCh37/Annotation/Genes/mask_MR.gtf -o . -p 6 -g /illumina/genomes/Homo_sapiens/Ensembl/GRCh37/Annotation/Genes/genes.gtf -u ../tophat_HS034/accepted_hits.bam

      In the resultant cufflinks output file, genes.fpkm_tracking and isoforms.fpkm_tracking I see that the MT were still present so I couldn't get the masking option to work

      I also tried both -g / -G , didn't work for them too. I wasn't sure if I am doing something incorrectly.

      I was wondering if there is anyone who could get masking option to work and could provide any guidance, that would be highly appreciated.

      Thanks
      Shweta

      Comment

      • sindrle
        Senior Member
        • Aug 2013
        • 266

        #4
        Can I ask how you created this file?

        /illumina/genomes/Homo_sapiens/Ensembl/GRCh37/Annotation/Genes/mask_MR.gtf

        Comment

        • sschavan
          Junior Member
          • Oct 2012
          • 7

          #5
          I just 'awk' the lines having Chromosome column value as "MT" from the genes.gtf file to get a subset of mitochondrial entries. I obtained a subset from the Ensembl gtf file, which had "MT" as Chromosome in the column1.

          awk -F "\t" '$1=="MT"{print }' < genes.gtf > mask_MR.gtf

          Shweta

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM
          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, Today, 11:10 AM
          0 responses
          5 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          41 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          102 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          123 views
          0 reactions
          Last Post SEQadmin2  
          Working...