Hello SeqAnswers community,
I am trying to use Samtools to call SNPs from my RNAseq data. I already have bam files, so I tried to use the following commands.
#Method 1 -
samtools sort file.bam file_sorted
samtools mpileup -d8000 -uf genome.fasta bamfile1.bam | bcftools view -vcg - > s.vcf
#When I opened the s.vcf file, it was a very large file (MB), it was showing a varient in every position, "N" was the reference for every position, and almost every "ALT" was multiple bases (A, T, G)
#Method 2 -
samtool sort file.bam file_sorted
samtools mpileup -d8000 -uf sequence_genome.fasta file_sorted.bam | bcftools view -bvcg - > file_samtools.raw.bcf
bcftools view file_samtools.raw.bcf | vcfutils.pl varFilter -D100 > file_samtools.vcf
#When I opened this vcf file, there were no SNPs. Below are the last four lines.
##FORMAT=<ID=DV,Number=1,Type=Integer,Description="# high-quality non-reference bases">
##FORMAT=<ID=SP,Number=1,Type=Integer,Description="Phred-scaled strand bias P-value">
##FORMAT=<ID=PL,Number=G,Type=Integer,Description="List of Phred-scaled genotype likelihoods">
##INFO=<ID=PR,Number=1,Type=Integer,Description="# permutations yielding a smaller PCHI2.">
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT file_sorted.bam
Does anyone know what is happening? I also had trouble using GATK, so please don't stear me in that direction.
Thanks and God bless,
Jason
I am trying to use Samtools to call SNPs from my RNAseq data. I already have bam files, so I tried to use the following commands.
#Method 1 -
samtools sort file.bam file_sorted
samtools mpileup -d8000 -uf genome.fasta bamfile1.bam | bcftools view -vcg - > s.vcf
#When I opened the s.vcf file, it was a very large file (MB), it was showing a varient in every position, "N" was the reference for every position, and almost every "ALT" was multiple bases (A, T, G)
#Method 2 -
samtool sort file.bam file_sorted
samtools mpileup -d8000 -uf sequence_genome.fasta file_sorted.bam | bcftools view -bvcg - > file_samtools.raw.bcf
bcftools view file_samtools.raw.bcf | vcfutils.pl varFilter -D100 > file_samtools.vcf
#When I opened this vcf file, there were no SNPs. Below are the last four lines.
##FORMAT=<ID=DV,Number=1,Type=Integer,Description="# high-quality non-reference bases">
##FORMAT=<ID=SP,Number=1,Type=Integer,Description="Phred-scaled strand bias P-value">
##FORMAT=<ID=PL,Number=G,Type=Integer,Description="List of Phred-scaled genotype likelihoods">
##INFO=<ID=PR,Number=1,Type=Integer,Description="# permutations yielding a smaller PCHI2.">
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT file_sorted.bam
Does anyone know what is happening? I also had trouble using GATK, so please don't stear me in that direction.
Thanks and God bless,
Jason
Comment