Seqanswers Leaderboard Ad

**Michael.Ante** · 03-10-2016, 12:57 AM

Hi Muthukumar,

In principle, you are rigth. A fragment is given by the read/read-pair. Unfortunately, each read/read-pair can map to several positions on your annotation and cause a bit of ambiguity.
Therefore, there are many ways to count the reads/read-pairs for a certain gene/transcript or feature. You may start with the Tuxedo-Suite pipeline http://www.ncbi.nlm.nih.gov/pubmed/22383036. Other methods are Salmon, featureCount, RSem, and many many more.

Cheers,

Michael

**GenoMax** · 03-10-2016, 04:45 AM

@Muthukumar: You don't want to do this by hand. There are software packages featureCounts and htseq-count that do this for one (or more) aligned BAM files. Both packages require a genomic feature definition file (GFF/GTF). If you are using a model organism then they are easy to find. Make sure you use one that matches the genome build used for your alignments.

**Muthukumar** · 03-11-2016, 01:06 AM

thanking you for answering the question. I am already following the nature protocol which u were mentioned. when i ran a command for cuffdiff and cufflinks , I got one column as FPKM , I want to do check my manually calculated FPKM and cuffdiff generated FPKM are same.Unfortunately I am not getting the exactly same answer.

Here is the procedure that I was followed for calculation of FPKM.

1. I counted the reads using IGV for specific gene.
(here I want to clarify one doubt I am using pairwise end seq data, some of the reads were found both on left and right i mean overlapping reads for some genes whether I have to calculate as 2 reads or 1 read

for instance:

--------------> (read 1) <-------------- (read L)
------------------->(read R)
______________Exon1_______|___________________________|____Exon2___________

for exon2 whether should I have 2 fragments combine into 1 or into separately. Pls tell calrify me. For my manual calculation I calculated as 2 fragments .

2. I calculated using following formula

# of fragments
FPKM = ___________________________ * 10^9
length of gene. Total no of reads

Whether above formula is r8?

One more doubt => my gene of interest contains 17 exons , All 17 exons are not having read fragments and some of the fragments for a exon is small and some of the fragments are lengthy. So whether I have the count the small reads also pls rectify me?

**Michael.Ante** · 03-11-2016, 04:16 AM

Actually, the FPK-values are different, since Cuffdiff performs some extra heuristics.
As GenoMax posted, you should assess the read counts not manually, but use an accepted tool.
The FPKM is usually computed on transcript level and taking its exons' length as part of the denominator.
The read-length is something which should be controlled for in the alignment. Therefore, if your aligner reported a read/read-pair to map there, I would take it as a valid read/read-pair. In case you are doubting it, you might re-align your data with length-filtered reads (e.g. using bbduk.sh from bbmap).

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Today, 08:47 AM	0 responses 12 views 0 likes	Last Post by seqadmin Today, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

how to identify number of fragments are produced for given gene in RNA-seq?

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News