Hi,
I was wondering about the output of htseq-count to count reads associated with transcripts.
I run htseq-count to estimate reads associated with genes with the command below:
htseq-count --stranded=yes --mode=intersection-nonempty file.sam file.gtf >> gene_id.txt
example I found one gene ENSGxxx with associated 700 reads count.
To check how many reads are associated with the different transcripts of ENSGxxx gene i run the command below:
htseq-count --stranded=yes --idattr=transcript_id --mode=intersection-nonempty file.sam file.gtf >> transcript id.txt
I would expect that the sum of the reads associated with each transcript is almost similar to the reads associated with the gene, but it seems not the case..
example:
gene 700 counts
transcript1 0 counts
transcript2 4 counts
transcript3 72 counts
any suggestions?
Ciao!
paolo
I was wondering about the output of htseq-count to count reads associated with transcripts.
I run htseq-count to estimate reads associated with genes with the command below:
htseq-count --stranded=yes --mode=intersection-nonempty file.sam file.gtf >> gene_id.txt
example I found one gene ENSGxxx with associated 700 reads count.
To check how many reads are associated with the different transcripts of ENSGxxx gene i run the command below:
htseq-count --stranded=yes --idattr=transcript_id --mode=intersection-nonempty file.sam file.gtf >> transcript id.txt
I would expect that the sum of the reads associated with each transcript is almost similar to the reads associated with the gene, but it seems not the case..
example:
gene 700 counts
transcript1 0 counts
transcript2 4 counts
transcript3 72 counts
any suggestions?
Ciao!
paolo
Comment