Couple questions, I have a fasta file of hg18 which is about 3Gb large and I'm afraid that samtools pileup can't read it correctly because there are lower and upper case letters, is there a tool that can open such a large file so I can edit it or it there something that can convert all lower to upper case?
Second question is about samtools pileup , after generating a raw.pileup file and filtering snps with R bioconductor package (Rsamtools) using the following commands :
system("samtools pileup -f genome.fasta output.sorted.bam > raw.pileup")
snps <- readPileup("raw.pileup", variant="SNP")
write.table(as.data.frame(snps), "snps.xls")
I get a xls file in which the column of "referenceBase" is all filled with "*" instead of A/C/T/G....
someone knows what might be the problem?
Thanks ahead!
Second question is about samtools pileup , after generating a raw.pileup file and filtering snps with R bioconductor package (Rsamtools) using the following commands :
system("samtools pileup -f genome.fasta output.sorted.bam > raw.pileup")
snps <- readPileup("raw.pileup", variant="SNP")
write.table(as.data.frame(snps), "snps.xls")
I get a xls file in which the column of "referenceBase" is all filled with "*" instead of A/C/T/G....
someone knows what might be the problem?
Thanks ahead!
Comment