Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
DEXSeq Using Counts File From htseq-count FuzzyCoder Bioinformatics 20 01-04-2016 12:18 AM
total number of counts with HTseq oliviera Bioinformatics 17 07-26-2013 08:33 AM
htseq-count gets more reads? deepsea Bioinformatics 3 03-29-2012 12:27 PM
htseq count missing last mapped fragment mapardo RNA Sequencing 2 11-08-2011 06:27 AM
htseq-count with warning for every read to represent all of zero counts in output hibachings2013 RNA Sequencing 10 07-15-2011 11:19 AM

Thread Tools
Old 03-21-2013, 06:13 AM   #1
Junior Member
Location: the Netherlands

Join Date: Mar 2013
Posts: 1
Default Discrepancy between HTSeq-count counts and total mapped reads

Hi All,

There seems to be a rather large discrepancy between the number of mapped Illumina reads and the total number of HTSeq counts (gene counts + counts with no_feature + ambiguous counts + too_low_aQual + not_aligned + alignment_not_unique).

For instance:
I used Tophat2 to map paired-end illumina reads. My two FASTQ files contain a total of 39785174 reads of which 4177881 were unmapped (unmapped.bam). A total of 35607293 reads were mapped (accepted_hits.bam), of which 31390707 were uniquely mapped (grep -w "NH:i:1" accepted_hits.sam | sort | uniq | wc -l).

My input HTSeq-count command was:
htseq-count --mode=union --stranded=no --type=exon --idattr=gene_id accepted_hits.nsorted.sam Solanum_tuberosum.3.0.17.gtf > HTSeq_counts_UNION_ENSEMBL_GTF.out

The HTSeq output is:
Total sum of gene counts 13319749
no_feature 3150201
ambiguous 346347
too_low_aQual 0
not_aligned 0
alignment_not_unique 9024735

The sum of HTSeq-count is 25841032 counts. Shouldn't the sum of counts be equal to the total number of mapped reads (35607293)? Please help me explain this discrepancy.

Thanks in advance for the answer!

Last edited by M_staats; 03-21-2013 at 06:16 AM.
M_staats is offline   Reply With Quote
Old 03-21-2013, 07:16 AM   #2
Simon Anders
Senior Member
Location: Heidelberg, Germany

Join Date: Feb 2010
Posts: 993

htseq-count counts read pairs, not reads. Maybe this explains it.
Simon Anders is offline   Reply With Quote


Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 02:34 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2018, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO