Seqanswers Leaderboard Ad

**thinkRNA** · 05-28-2010, 02:21 PM

When you ran cufflinks without -G option, did you see that cufflinks detects more novel transcripts expression than when you ran it with the -G option?
if the answer is no then it explains your results.

I am interested in knowing how cufflinks assembles novel transcripts? does it look for ORF and splice sites when assembling them? It is important to know this if we want to trust its "novel" transcripts.

I am also interested in exploring tophat/cufflinks/cuffcompare workflow parameters, so here are a few other "exploratory" parameters.

I am using 75bp reads and want to determine differential expression between 2 states:
1) tophat's -G parameter is set by defualt to 40 which means any read hitting less than 40 places in the genome are kept. I think this is way to high and I bring it down to 10. Although, I want to know how my results would differ if -G is set to 1. Has any one tried this?
2) I also increase tophat's -a parameter from 8 to 10 which means splice sites having 10bases overlap on each side of the splice site will be considered. I wonder, if for 75bp reads, 10 may not be strict enough.
3) tophat calls bowtie without its "-best" parameter. I am interested in calling bowtie (from tophat) with -best option to see if alignments differ. Anyone has any thoughts on this?
4) I know for certain that the last 15 bases of my reads have bad quality overall. Should I trim these before aligning them? Does it make a difference by trimming or bowtie/tophat take the quality score in to consideration.
5) Cufflinks -Q parameter is set to 0 by default. This is a critical parameter which can take the alignment quality into consideration. I don't know what is a good number to try this one.
6) Your idea?

I think an important discussion would be, how does one evaluate these runs? What are some plots, statistics that can give an idea of whether the program is doing a good job. some ideas:
1) if you know certain genes will be down/up in your experimental conditions, check them
2) Evaluate tophat's junctions.bed to determine how splice sites support changes with different parameters
3) if you know of certain "problem" genes having complicated isoform expression or belonging to paralogous gene families, evaluate whether reads are being correctly aligned to them and if splicing changes are being detected
4) Look at the uniformity in coverage in a gene. There should be genes that have even coverage through all exons.
5) if you have replicate sample runs, you should theoretically see no differences in gene expression between them using your workflow. A dot plot of FPKM between replicates should be more or less a diagnol line. Ofcourse, this same plot between your control and treated sample should exhibit anomalies.
6) Any other better ideas that have helped you

**gtb** · 05-31-2010, 06:20 PM

Robin,
You say the refseqGene.gtf file contains the UCSC annotations. Where exactly did you get the refseqGene.gtf file?

**dariober** · 06-01-2010, 12:21 AM

Originally posted by thinkRNA View Post

3) tophat calls bowtie without its "-best" parameter. I am interested in calling bowtie (from tophat) with -best option to see if alignments differ. Anyone has any thoughts on this?

See if this thread can give you some help http://seqanswers.com/forums/showthr...=tophat+strata

Dario

**pinki999** · 07-02-2012, 05:58 AM

Hi ThinkRNA,

Did you figure out how TopHat deals with such situation?

4) I know for certain that the last 15 bases of my reads have bad quality overall. Should I trim these before aligning them? Does it make a difference by trimming or bowtie/tophat take the quality score in to consideration.

**IBseq** · 07-16-2012, 03:21 PM

refseqGene.gtf file

Originally posted by gtb View Post

Robin,
You say the refseqGene.gtf file contains the UCSC annotations. Where exactly did you get the refseqGene.gtf file?

You go to galaxy--get data---ucsc main--set your parameters (clade,genome,assembly)--selct group: gene and genes prediction tracks and tracks: refseq genes---output format:GTF---send to galaxy---it will appear as a new job in your history

Topics	Statistics	Last Post
Evaluating Genome Sequencing for ECMO Patients in the NICU by seqadmin Started by seqadmin, 12-17-2024, 10:28 AM	0 responses 25 views 0 likes	Last Post by seqadmin 12-17-2024, 10:28 AM
New Genetic Toolkit Refines Studies on Gene Function and Disease by seqadmin Started by seqadmin, 12-13-2024, 08:24 AM	0 responses 42 views 0 likes	Last Post by seqadmin 12-13-2024, 08:24 AM
Study Links Brain Mechanism to Emotional Responses in Animals and Humans by seqadmin Started by seqadmin, 12-12-2024, 07:41 AM	0 responses 28 views 0 likes	Last Post by seqadmin 12-12-2024, 07:41 AM
Study Identifies Ribosomal RNA Fingerprints as Early Cancer Biomarkers by seqadmin Started by seqadmin, 12-11-2024, 07:45 AM	0 responses 42 views 0 likes	Last Post by seqadmin 12-11-2024, 07:45 AM

Seqanswers Leaderboard Ad

Announcement

run cufflink, cuffcompare and cuffdiff workflow

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News