I encountered a problem dealing the RNA-seq by Tophat.
The data is 36bp(single-end) by Illumina's Genome analyzer, and during the first process of Tophat, that is Bowtie,
the result is bad:
# reads processed: 26732967
# reads with at least one reported alignment: 134706 (0.50%)
# reads that failed to align: 26586496 (99.45%)
# reads with alignments suppressed due to -m: 11765 (0.04%)
Reported 400680 alignments to 1 output stream(s)
The reads that failed to align: 26586496 (99.45%) is too high and I really dont know how to deal with that.
I tried a lot of different parameters of Tophat, the parameter is as follows:
tophat --bowtie1 --library-type fr-unstranded --segment-length 18 -p 8 -o /sjn/rep1/try/tophat_821 /sjn/gencode/bowtie1_index/hg19 /rep1/RawDataRep1.fastq
1. Among them, --bowtie1 is try to use Bowtie1 instead of Bowtie2.
I tried bowtie2 before, the result are even worse.
For reads longer than about 50 bp Bowtie 2 is generally faster, more sensitive, and uses less memory than Bowtie 1. For relatively short reads (e.g. less than 50 bp) Bowtie 1 is sometimes faster and/or more sensitive.(http://bowtie-bio.sourceforge.net/index.shtml)
2. --segment-length 18
because the length of the reads is 36bp.
I really do not know what to do.
The data is 36bp(single-end) by Illumina's Genome analyzer, and during the first process of Tophat, that is Bowtie,
the result is bad:
# reads processed: 26732967
# reads with at least one reported alignment: 134706 (0.50%)
# reads that failed to align: 26586496 (99.45%)
# reads with alignments suppressed due to -m: 11765 (0.04%)
Reported 400680 alignments to 1 output stream(s)
The reads that failed to align: 26586496 (99.45%) is too high and I really dont know how to deal with that.
I tried a lot of different parameters of Tophat, the parameter is as follows:
tophat --bowtie1 --library-type fr-unstranded --segment-length 18 -p 8 -o /sjn/rep1/try/tophat_821 /sjn/gencode/bowtie1_index/hg19 /rep1/RawDataRep1.fastq
1. Among them, --bowtie1 is try to use Bowtie1 instead of Bowtie2.
I tried bowtie2 before, the result are even worse.
For reads longer than about 50 bp Bowtie 2 is generally faster, more sensitive, and uses less memory than Bowtie 1. For relatively short reads (e.g. less than 50 bp) Bowtie 1 is sometimes faster and/or more sensitive.(http://bowtie-bio.sourceforge.net/index.shtml)
2. --segment-length 18
because the length of the reads is 36bp.
I really do not know what to do.
Comment