![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
HTSeq install | fatakias | Bioinformatics | 12 | 11-12-2015 08:22 AM |
HTSeq Help | Klroop | Bioinformatics | 0 | 07-09-2012 12:36 PM |
HTSeq errors | wangli | RNA Sequencing | 4 | 06-09-2012 03:55 PM |
help with HTSeq installation | madhuk11 | Bioinformatics | 4 | 06-07-2012 11:04 AM |
in HTSeq example, pyplot | hibachings2013 | RNA Sequencing | 0 | 06-29-2011 06:38 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Finland Join Date: Aug 2012
Posts: 29
|
![]()
Hi all,
I have a BAM file from paired end tophat output and and I wanted to apply HTseq to count the mapped reads, because it was paired mate and also BAM, I did following steps but still I receive error: samtools sort accepted_hits.bam accepted_hits.sorted samtools view accepted_hits.sorted.bam | htseq-count - /hg19/Annotation/Genes/genes.gtf > count.txt and this is the error : Warning: Read ERR009097.6031922 claims to have an aligned mate which could not be found. (Is the SAM file properly sorted?) Actually it was the same error I got before sorting so I did sorting but it is still the case. And according to fastqc analysis for the left and right fastq files I have the same number of sequences. Thanks for the help Last edited by narges; 09-06-2012 at 05:39 AM. |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: Baltimore Join Date: Mar 2012
Posts: 120
|
![]()
You didn't actually convert it to .sam. You should have wrote
samtools view accepted_hits.bam accepted_hits.sam And then sort the acccepted_hits.sam file |
![]() |
![]() |
![]() |
#3 | |
Member
Location: Finland Join Date: Aug 2012
Posts: 29
|
![]() Quote:
samtools view accepted_hits.bam | htseq-count - /hg19/Annotation/Genes/genes.gtf > count.txt for the single end dataset which did not ask for sorting and it worked. Althigh I tried your advice and again there were more errors like : [bam_index_load] fail to load BAM index. [main_samview] random alignment retrieval only works for indexed BAM files. And then I indexed the file : samtools index accepted_hits.bam And then : $ samtools view accepted_hits.bam.bai accepted_hits.sam [bam_header_read] EOF marker is absent. [bam_header_read] invalid BAM binary header (this is not a BAM file). [main_samview] fail to read the header from "accepted_hits.bam.bai". The bam file I am using is the tophat output and I did not approach this problem of indexing and so on with this kind of result ![]() |
|
![]() |
![]() |
![]() |
#4 |
Devon Ryan
Location: Freiburg, Germany Join Date: Jul 2011
Posts: 3,480
|
![]()
@narges: the problem was that you sorted by coordinate rather than read name, which is required for pair-end reads.
Try instead: Code:
samtools sort -n accepted_hits.bam accepted_hist.sorted samtools view accepted_hits.sorted.bam | htseq-count - /hg19/Annotation/Genes/genes.gtf > count.txt |
![]() |
![]() |
![]() |
#5 | |
Member
Location: Finland Join Date: Aug 2012
Posts: 29
|
![]() Quote:
|
|
![]() |
![]() |
![]() |
Thread Tools | |
|
|