Seqanswers Leaderboard Ad

**Nicolas** · 02-08-2012, 01:56 PM

Could it be multiple isoforms of the same gene?

**nwalwort** · 02-08-2012, 02:02 PM

Hey Nicolas,

thats what I was thinking it might be. However, I looked at the read pileups with IGV, and cufflinks is just assembling different transcripts de novo of the same gene based on clustering reads. So, for example, looking at one gene that I am mapping reads too, instead of calculating the FPKM for all reads hitting that gene, cufflinks is splitting the gene up into thirds based on where reads are piling up and calculating three different FPKMs for each region of the gene and then reporting it as different "genes" (transcripts). So, rather than different isoforms, it looks like it is just splitting up genes based on where reads fall. I am also using a mapping program unaware of splicing. I am trying my luck with a few other programs to compare.

**Nicolas** · 02-08-2012, 02:13 PM

Could you post the command you're using?
Which Cufflinks mode are you using, de novo (default), with a reference annotation (-G) or RABT (-g)?
Is there a complete coverage of your gene? If not (and if you're using de novo mode), then Cufflinks has no information supporting the fact that the 3 regions are actually one single gene...

Please provide more info.

**nwalwort** · 02-08-2012, 02:28 PM

Hello Nicolas, my command is below:

cufflinks -N -u seq1_380-380_r1_out.sorted.sam

It is in default mode i think. I aligned my reads to a multifasta file with annotated genes in hopes that it would be sufficient for cufflinks to assign reads to only these genes but I was wrong, and cufflinks assembled transcripts because no GTF was supplied. I was trying to search for anything on how to obtain or generate a reference GTF file for my bacterium, but I cannot seem to find it. Surely, that would probably fix my problem. do you know how I might generate one with an annotated reference genome in fasta format. Thank you for your inquiries! I am still quite new to this

**Nicolas** · 02-09-2012, 07:08 AM

Does your multifasta file contains one entry per gene?
If so, it should be easy to count the number of reads mapping to each entry (samtools idxstats <aln.bam> for instance). You can then normalize by exon size and library depth to achieve something similar to FPKM.
I don't think Cufflinks could do what you want, but I am also not sure you really need it!

**nwalwort** · 02-09-2012, 08:47 AM

Thank you for the replies Nicolas. Much appreciated. good luck with everything

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

Cufflinks reporting differnt FPKMS for the same gene

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News