![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Multi-occurence of transcripts in Cufflinks output file | Jesse K. | Bioinformatics | 0 | 05-27-2014 12:57 AM |
What is the actually output of a Cufflinks GFT file? | KnowNothing2 | RNA Sequencing | 0 | 10-18-2013 04:28 PM |
cufflinks output against annotation file | masylichu | Bioinformatics | 1 | 09-19-2012 03:43 AM |
BWASW more reads in the output SAM file than in the input file | nanto | Bioinformatics | 2 | 09-18-2012 01:41 AM |
Zero FPKM at some Cufflinks output file | fuad193 | Bioinformatics | 1 | 06-07-2012 12:16 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Sweden Join Date: Jun 2014
Posts: 86
|
![]()
I've been mostly doing differential expression with featureCounts + DESeq2 so far, but now I want to calculate the FPKM of my various samples / replicates as well. I was told I should do this with Cufflinks, so I gave it a go. While I can get some results, I'm a bit unsure exactly how I got them and exactly what they represent...
First, the input files, coming last in the command (I'm running from the Terminal in OS X Mavericks). As far as I understand, I can either run a separate cufflinks command for each replicate for each sample, but I should also be able to run it as a single command, in some way. I read the documentation and searched around on SEQanswers, but I didn't really get any definitives. For example, are these different commands valid? Code:
cufflinks <options> <input1.bam> <input2.bam> cufflinks <options> <input1.bam>,<input2.bam> cufflinks <options> <input*.bam> Secondly, the output. I've gotten the four files specified in the documentation, but the one I'm supposed to be interested in (genes.fpkm_tracking) only has one "FPKM" column. I suppose this makes sense when you run each replicate as a separate command, but having used some trial and error (mostly error, I suppose ![]() And, lastly, the --GTF or --GTF-guide options. Which do I use? I'm using Illumina paired-end, stranded data (so I'm using the --library-type fr-firststrand option), and I'm interested in knowing the FPKM for each replicate for each sample. I've thus far used the --GTF option and the same reference annotation as was used in the alignment (Tophat2). Is this the correct thinking? Thanks in advance! Last edited by ErikFas; 10-27-2014 at 07:44 AM. |
![]() |
![]() |
![]() |
#2 |
I like code
Location: San Diego, CA, USA Join Date: Sep 2009
Posts: 438
|
![]()
I think you have to run cufflinks one time per BAM file. That's how I have always done it and that's how it would handle things internally anyways since each set of alignments would be interpreted independently of the others. If you can supply multiple BAMs in a single command then I'd expect the program to pool them and evaluate them as a single sample.
--GTF is a mode for quantification only. --GTF-guide is a mode for quantification plus de-novo assembly of isoforms from alignments using a supplied GTF as a guide (like giving it a starting point for de-novo assembly). --GTF-guide uses a sligntly different assembly strategy than the default (without --GTF or --GTF-guide).
__________________
/* Shawn Driscoll, Gene Expression Laboratory, Pfaff Salk Institute for Biological Studies, La Jolla, CA, USA */ |
![]() |
![]() |
![]() |
#3 |
Member
Location: Sweden Join Date: Jun 2014
Posts: 86
|
![]()
Okay, then I'll continue doing a single command for every file, with the -G flag. Thanks for the clarification!
|
![]() |
![]() |
![]() |
Thread Tools | |
|
|