Hi all.
I have a question about tophat (v > 1.3.1). I am running tophat with 72 single end reads from RNA dataset (Illumina Genome Analyzer was used); I have both a gff3 file and a reference genome. Reading logs/bowtie.left_kept_reads.fixmap.log I have:
# reads processed: 7333307
# reads with at least one reported alignment: 4403978 (60.05%)
# reads that failed to align: 2928767 (39.94%)
# reads with alignments suppressed due to -m: 562 (0.01%)
Reported 4826602 alignments to 1 output stream(s)
So I expected to find no more than 4826602 alignments in accepted_hits.bam.
That is not right: after converting accepted_hits.bam to sam format I have:
wc -l accepted_hits.sam
6121159 accepted_hits.sam
Each line should correspond to an alignment but bowtie report talks about 4826602 alignments. Can anyone tell me, please, what am I missing?
Thanks a lot.
I have a question about tophat (v > 1.3.1). I am running tophat with 72 single end reads from RNA dataset (Illumina Genome Analyzer was used); I have both a gff3 file and a reference genome. Reading logs/bowtie.left_kept_reads.fixmap.log I have:
# reads processed: 7333307
# reads with at least one reported alignment: 4403978 (60.05%)
# reads that failed to align: 2928767 (39.94%)
# reads with alignments suppressed due to -m: 562 (0.01%)
Reported 4826602 alignments to 1 output stream(s)
So I expected to find no more than 4826602 alignments in accepted_hits.bam.
That is not right: after converting accepted_hits.bam to sam format I have:
wc -l accepted_hits.sam
6121159 accepted_hits.sam
Each line should correspond to an alignment but bowtie report talks about 4826602 alignments. Can anyone tell me, please, what am I missing?
Thanks a lot.
Comment