Unconfigured Ad

**kmcarr** · 08-10-2012, 05:12 AM

Richard,

This is most likely caused by a mismatch between the reference names between the fasta file and the gtf file you are using.

The official TAIR10 genome sequence release names the mitochondrial and plastid (chloroplast) chromsomes ChrM and ChrC respectively. These are the names used in TAIR10_chr_all.fas. It would appear that your reference GTF file, arabidopsis_thaliana.TAIR10.60.gtf, is from a different source and uses different names (Mt and Pt). You will need to make sure the chromosome names match exactly between your FASTA and GTF files.

**Richard Barker** · 08-10-2012, 06:56 AM

Thanks for the swift response (again)

Your advice worked perfectly, i searched the directories near where i downloaded the TAIR10.fasta file and found a TAIR10_GFF3_genes.gff file.

The following script appears to be working, but whats the difference between a GFF and GTF file?

cuffmerge -g TAIR10_GFF3_genes -s TAIR10_chr_all.fas -p 6 run297_transcript_cuffmerge.txt

**kmcarr** · 08-10-2012, 07:07 AM

The following script appears to be working, but whats the difference between a GFF and GTF file?

Cufflinks documentation

**Richard Barker** · 08-10-2012, 07:22 AM

Ooops spoke too soon now i get the following error message?

richard@ubuntu:~/RNA_seq_analysis/Cuffmerge$ cuffmerge -g TAIR10_GFF3_genes.gff -s TAIR10_chr_all.fas -p 6 run297_transcript_cuffmerge.txt

[Fri Aug 10 07:41:48 2012] Beginning transcriptome assembly merge
-------------------------------------------

[Fri Aug 10 07:41:48 2012] Preparing output location ./merged_asm/
[Fri Aug 10 07:41:52 2012] Converting GTF files to SAM
[07:41:52] Loading reference annotation.
[07:41:53] Loading reference annotation.
[07:41:54] Loading reference annotation.
[07:41:56] Loading reference annotation.
[07:41:57] Loading reference annotation.
[07:41:58] Loading reference annotation.
[07:41:59] Loading reference annotation.
[07:42:00] Loading reference annotation.
[Fri Aug 10 07:42:02 2012] Quantitating transcripts
You are using Cufflinks v2.0.2, which is the most recent release.
Command line:
cufflinks -o ./merged_asm/ -F 0.05 -g TAIR10_GFF3_genes.gff -q --overhang-tolerance 200 --library-type=transfrags -A 0.0 --min-frags-per-transfrag 0 --no-5-extend -p 6 ./merged_asm/tmp/mergeSam_filefWraGs
[bam_header_read] EOF marker is absent.
[bam_header_read] invalid BAM binary header (this is not a BAM file).
File ./merged_asm/tmp/mergeSam_filefWraGs doesn't appear to be a valid BAM file, trying SAM...
[07:42:02] Loading reference annotation.
[07:42:03] Inspecting reads and determining fragment length distribution.
Processed 47416 loci.
> Map Properties:
> Normalized Map Mass: 194074.00
> Raw Map Mass: 194074.00
> Fragment Length Distribution: Truncated Gaussian (default)
> Default Mean: 200
> Default Std Dev: 80
[07:42:05] Assembling transcripts and estimating abundances.
Processed 47416 loci.
[Fri Aug 10 07:53:04 2012] Comparing against reference file TAIR10_GFF3_genes.gff
You are using Cufflinks v2.0.2, which is the most recent release.
Warning: couldn't find fasta record for 'Chr1'!
Warning: couldn't find fasta record for 'Chr2'!
Warning: couldn't find fasta record for 'Chr3'!
Warning: couldn't find fasta record for 'Chr4'!
Warning: couldn't find fasta record for 'Chr5'!
Warning: couldn't find fasta record for 'ChrC'!
Warning: couldn't find fasta record for 'ChrM'!
[Fri Aug 10 07:53:20 2012] Comparing against reference file TAIR10_GFF3_genes.gff
You are using Cufflinks v2.0.2, which is the most recent release.
Warning: couldn't find fasta record for 'Chr1'!
Warning: couldn't find fasta record for 'Chr2'!
Warning: couldn't find fasta record for 'Chr3'!
Warning: couldn't find fasta record for 'Chr4'!
Warning: couldn't find fasta record for 'Chr5'!
Warning: couldn't find fasta record for 'ChrC'!
Warning: couldn't find fasta record for 'ChrM'!

**Richard Barker** · 08-10-2012, 07:32 AM

I've found a TAIR10_GFF file (ftp://ftp.arabidopsis.org/home/tair/...enome_release/) which was also near the location where i downloaded my genome fasta file (ftp://ftp.arabidopsis.org/home/tair/...omosome_files/) and one was able to completed the alignment!
Thanks for your help!

**Richard Barker** · 08-15-2012, 01:39 PM

Shouldn't the cuffmerge out put have the gene names (Arabidopsis ATG codes?). What methods are there for adding your genome annotation, i thought that was the reason for using the GFF/gtf files during TopHat and/or cuffmerge?

**shinigam123** · 08-02-2017, 08:49 AM

I have the same problem, How you solve it?

Originally posted by Richard Barker View Post

Ooops spoke too soon now i get the following error message?

richard@ubuntu:~/RNA_seq_analysis/Cuffmerge$ cuffmerge -g TAIR10_GFF3_genes.gff -s TAIR10_chr_all.fas -p 6 run297_transcript_cuffmerge.txt

[Fri Aug 10 07:41:48 2012] Beginning transcriptome assembly merge
-------------------------------------------

[Fri Aug 10 07:41:48 2012] Preparing output location ./merged_asm/
[Fri Aug 10 07:41:52 2012] Converting GTF files to SAM
[07:41:52] Loading reference annotation.
[07:41:53] Loading reference annotation.
[07:41:54] Loading reference annotation.
[07:41:56] Loading reference annotation.
[07:41:57] Loading reference annotation.
[07:41:58] Loading reference annotation.
[07:41:59] Loading reference annotation.
[07:42:00] Loading reference annotation.
[Fri Aug 10 07:42:02 2012] Quantitating transcripts
You are using Cufflinks v2.0.2, which is the most recent release.
Command line:
cufflinks -o ./merged_asm/ -F 0.05 -g TAIR10_GFF3_genes.gff -q --overhang-tolerance 200 --library-type=transfrags -A 0.0 --min-frags-per-transfrag 0 --no-5-extend -p 6 ./merged_asm/tmp/mergeSam_filefWraGs
[bam_header_read] EOF marker is absent.
[bam_header_read] invalid BAM binary header (this is not a BAM file).
File ./merged_asm/tmp/mergeSam_filefWraGs doesn't appear to be a valid BAM file, trying SAM...
[07:42:02] Loading reference annotation.
[07:42:03] Inspecting reads and determining fragment length distribution.
Processed 47416 loci.
> Map Properties:
> Normalized Map Mass: 194074.00
> Raw Map Mass: 194074.00
> Fragment Length Distribution: Truncated Gaussian (default)
> Default Mean: 200
> Default Std Dev: 80
[07:42:05] Assembling transcripts and estimating abundances.
Processed 47416 loci.
[Fri Aug 10 07:53:04 2012] Comparing against reference file TAIR10_GFF3_genes.gff
You are using Cufflinks v2.0.2, which is the most recent release.
Warning: couldn't find fasta record for 'Chr1'!
Warning: couldn't find fasta record for 'Chr2'!
Warning: couldn't find fasta record for 'Chr3'!
Warning: couldn't find fasta record for 'Chr4'!
Warning: couldn't find fasta record for 'Chr5'!
Warning: couldn't find fasta record for 'ChrC'!
Warning: couldn't find fasta record for 'ChrM'!
[Fri Aug 10 07:53:20 2012] Comparing against reference file TAIR10_GFF3_genes.gff
You are using Cufflinks v2.0.2, which is the most recent release.
Warning: couldn't find fasta record for 'Chr1'!
Warning: couldn't find fasta record for 'Chr2'!
Warning: couldn't find fasta record for 'Chr3'!
Warning: couldn't find fasta record for 'Chr4'!
Warning: couldn't find fasta record for 'Chr5'!
Warning: couldn't find fasta record for 'ChrC'!
Warning: couldn't find fasta record for 'ChrM'!

**Richard Barker** · 08-02-2017, 08:52 AM

I used the pipeline that was made in the CyVerse Discovery environment. It's easy to use and really fast!

**shinigam123** · 08-02-2017, 09:02 AM

Can you tell me what that pipeline is, do not I know it?
regards

**Richard Barker** · 08-02-2017, 09:06 AM

They have the HTprocess and Kalisto if you're in a rush

**shinigam123** · 08-02-2017, 09:22 AM

But what was the problem, the inputs gff anda fasta? I need the output merged.gtf without warnings

**vivekkeshri** · 04-02-2019, 11:11 PM

Cuffmerge output

I am trying to execute Cuffmerge (cuffmerge -p 5 -g Homo.gtf assemblies.txt), but unable to get FPKM values in output file ("merged.gtf).
Please let me know how to solve this problem.

**vivekkeshri** · 07-29-2019, 04:49 AM

Please let me know about how "Cuffdiff -L" [-L/--labels: comma-separated list of condition labels] command works. How it is labeling / merging the bam files.
Thanks

Topics	Statistics	Last Post
Long-Read RNA Sequencing Uncovers a Hidden Layer of Immune Cell Regulation by SEQadmin2 Started by SEQadmin2, 06-02-2026, 12:03 PM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 12:03 PM
DNA Methylation Study Reveals How Epigenetic Changes Pass Between Generations by SEQadmin2 Started by SEQadmin2, 06-02-2026, 11:40 AM	0 responses 14 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 11:40 AM
MetaBeeAI Helps Scientists Process Research Literature Faster by SEQadmin2 Started by SEQadmin2, 05-28-2026, 11:40 AM	0 responses 29 views 0 reactions	Last Post by SEQadmin2 05-28-2026, 11:40 AM
Scientists Solve a 25-Year Mystery in RNA Interference by SEQadmin2 Started by SEQadmin2, 05-26-2026, 10:12 AM	0 responses 31 views 0 reactions	Last Post by SEQadmin2 05-26-2026, 10:12 AM

Unconfigured Ad

Cuffmerge error coultn't finda data file for Mt or Pt?

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News