SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
HTSeq-count warning message canhu Bioinformatics 27 02-11-2015 12:02 PM
Error with GTF file when using htseq-count MDonlin Bioinformatics 13 01-13-2015 09:29 AM
htseq-count error with tophat input moser Introductions 4 10-21-2013 01:35 AM
Strange error when using htseq-count shhuang Bioinformatics 13 11-19-2012 01:40 AM
htseq-count error sissi Bioinformatics 0 03-21-2012 12:40 AM

Reply
 
Thread Tools
Old 01-29-2014, 08:34 AM   #1
roll
Member
 
Location: UK

Join Date: Aug 2009
Posts: 38
Default htseq-count error message

I keep getting the following error message when i use the htseq-count

Error occured when processing SAM input (line 9786565):
'pair_alignments' needs a sequence of paired-end alignments
[Exception type: ValueError, raised in __init__.py:612]

I sorted my sam files using samtools sort -n

On top of this I also get many warnings saying if i am sure sam file is properly sorted or not?

Do you know what is the error message about?
roll is offline   Reply With Quote
Old 01-29-2014, 08:45 AM   #2
dpryan
Devon Ryan
 
Location: Freiburg, Germany

Join Date: Jul 2011
Posts: 3,479
Default

Did you merge BAM files from both paired-end and single-end alignments?
dpryan is offline   Reply With Quote
Old 01-30-2014, 02:54 AM   #3
roll
Member
 
Location: UK

Join Date: Aug 2009
Posts: 38
Default

Quote:
Originally Posted by dpryan View Post
Did you merge BAM files from both paired-end and single-end alignments?
yes, they are all mapped properly. How can i check if there is any other mistakes in the data?
roll is offline   Reply With Quote
Old 01-30-2014, 03:30 AM   #4
dpryan
Devon Ryan
 
Location: Freiburg, Germany

Join Date: Jul 2011
Posts: 3,479
Default

Just browsing through the htseq-count code, it looks like this error will happen if the program things you have paired-end reads and you then hand it a single-end read. You might just feed to unmerged BAM files separately into htseq-count, possibly summing the resulting file in R. Whether you should sum the counts or treat them separately will depend a bit on how they were created. If the paired-end and single-end data were from different library preps of the same sample and you have equivalent data for other samples in your comparison, then I would recommend simply adding "library_type" (i.e., single or paired-end) as a factor in your statistical model (i.e., keep the counts separate). There's an example of that in the DESeq(2) vignette with the pasilla dataset.
dpryan is offline   Reply With Quote
Reply

Tags
error, htseq-count, sorting rna_seq

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 01:00 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2018, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO