Dear all:
new in this... just a quick question to ask for suggestions to tackle potential differential expression of two transcripts from the same gene, what I did:
1. mapped and generated .bam files using tophat2
2. "grep genename the.human.gtf.file" to get exon info from the original gtf file used in tophat2, this info saved to gene.bed file (keep only those that are exon)
3. "bedtools multicov -bams bamfiles -bed gene.bed > output.bed" this is looped to get information for all the bam files...
the issues:
1. the bed file I generated has multiple transcripts ID, they share most exons. my idea right now is to look at exons that are different from each other
2. For short exons, the count info is not reliable since many samples has no match
3. If I want to "normalize" by total match for the sample and exon length, will this be enough?
Thanks and appreciate any suggestions on packages or strategies... this is a time series study it will be nice to show different transcripts change over time....
new in this... just a quick question to ask for suggestions to tackle potential differential expression of two transcripts from the same gene, what I did:
1. mapped and generated .bam files using tophat2
2. "grep genename the.human.gtf.file" to get exon info from the original gtf file used in tophat2, this info saved to gene.bed file (keep only those that are exon)
3. "bedtools multicov -bams bamfiles -bed gene.bed > output.bed" this is looped to get information for all the bam files...
the issues:
1. the bed file I generated has multiple transcripts ID, they share most exons. my idea right now is to look at exons that are different from each other
2. For short exons, the count info is not reliable since many samples has no match
3. If I want to "normalize" by total match for the sample and exon length, will this be enough?
Thanks and appreciate any suggestions on packages or strategies... this is a time series study it will be nice to show different transcripts change over time....