Hello All:
I am not sure that I use the cufflink, cuffcompare and cuffdiff output files are corrected here.
Here is my two work flows:
1) work with unknow gene model:
cufflink -p 4 -m 149 accepted_hits_s1.sam
cufflink -p 4 -m 149 accepted_hits_s3.sam
2) Using UCSC annotation GTF file
cuffcompare -r refseqGene.gtf -R -s /indexes/ transcripts_s1.gtf transcripts_s3.gtf
3) Using combined.gtf from the output file of cuffcompare:
cuffdiff -p 4 -m 149 combined.gtf accepted_hits_s1.sam accepted_hits_s3.sam
4) Cuffdiff output file: 0_1_splicing.diff contains about 75 records with test_stat=OK
Workflow two with know gene model(refseqGene.gtf ):
1) cufflink -p 4 -m 149 -G refseqGene.gtf accepted_hits_s1.sam
cufflink -p 4 -m 149 -G refseqGene.gtf accepted_hits_s3.sam
2) Using UCSC annotation GTF file
cuffcompare -r refseqGene.gtf -R -s /indexes/ transcripts_s1.gtf transcripts_s3.gtf
3) Using combined.gtf from the output file of cuffcompare:
cuffdiff -p 4 -m 149 combined.gtf accepted_hits_s1.sam accepted_hits_s3.sam
4) Cuffdiff output file: 0_1_splicing.diff contains about 445 records with test_stat=OK
I should get more records in the first workflow than second workflow because the first one is run as de noval cufflink without any annotation GTF file. I get about 75 records of splicing.diff in the de noval (workflow one) and about 445 records of splicing.diff file in the know gene model (workflow two).
Am I doing any steps wrong in these two workflows?
Thanks for any comments!
R
I am not sure that I use the cufflink, cuffcompare and cuffdiff output files are corrected here.
Here is my two work flows:
1) work with unknow gene model:
cufflink -p 4 -m 149 accepted_hits_s1.sam
cufflink -p 4 -m 149 accepted_hits_s3.sam
2) Using UCSC annotation GTF file
cuffcompare -r refseqGene.gtf -R -s /indexes/ transcripts_s1.gtf transcripts_s3.gtf
3) Using combined.gtf from the output file of cuffcompare:
cuffdiff -p 4 -m 149 combined.gtf accepted_hits_s1.sam accepted_hits_s3.sam
4) Cuffdiff output file: 0_1_splicing.diff contains about 75 records with test_stat=OK
Workflow two with know gene model(refseqGene.gtf ):
1) cufflink -p 4 -m 149 -G refseqGene.gtf accepted_hits_s1.sam
cufflink -p 4 -m 149 -G refseqGene.gtf accepted_hits_s3.sam
2) Using UCSC annotation GTF file
cuffcompare -r refseqGene.gtf -R -s /indexes/ transcripts_s1.gtf transcripts_s3.gtf
3) Using combined.gtf from the output file of cuffcompare:
cuffdiff -p 4 -m 149 combined.gtf accepted_hits_s1.sam accepted_hits_s3.sam
4) Cuffdiff output file: 0_1_splicing.diff contains about 445 records with test_stat=OK
I should get more records in the first workflow than second workflow because the first one is run as de noval cufflink without any annotation GTF file. I get about 75 records of splicing.diff in the de noval (workflow one) and about 445 records of splicing.diff file in the know gene model (workflow two).
Am I doing any steps wrong in these two workflows?
Thanks for any comments!
R
Comment