Dear, SEQanswer community,
I try to get read distribution statistics after TopHat alignment using RseqQC. Specifically, I am interested in percentage of reads, that aligned to different regions (like CDS exons, UTR exons, intrond and intergenic regions). I used the following command line
However, I got the error message:
Does it something wrong with my bam or bed file? I used hg19 for aligning to the reference genome. Also I tried different bam files and come up with the same mistake.
Thank You very much for help!
I try to get read distribution statistics after TopHat alignment using RseqQC. Specifically, I am interested in percentage of reads, that aligned to different regions (like CDS exons, UTR exons, intrond and intergenic regions). I used the following command line
Code:
/home/olha/RSeQC-2.6.4/build/scripts-2.7/read_distribution.py -i /home/olha/A15.bam -r /home/olha/hg19_RefSeq.bed
Code:
processing /home/olha/hg19_RefSeq.bed ...Traceback (most recent call last): File "/home/olha/RSeQC-2.6.4/build/scripts-2.7/read_distribution.py", line 303, in <module> main() File "/home/olha/RSeQC-2.6.4/build/scripts-2.7/read_distribution.py", line 192, in main intergenic_down1kb_base,intergenic_down5kb_base,intergenic_down10kb_base) = process_gene_model(options.ref_gene_model) File "/home/olha/RSeQC-2.6.4/build/scripts-2.7/read_distribution.py", line 76, in process_gene_model utr_3 = obj.getUTR(utr=3) File "/usr/local/lib/python2.7/dist-packages/RSeQC-2.3.7-py2.7-linux-x86_64.egg/qcmodule/BED.py", line 425, in getUTR txStart=int(fields[1]) ValueError: invalid literal for int() with base 10: 'NM_032291'
Thank You very much for help!