I am tearing my hair out trying to run an application that requires all reads be properly mapped 1 time in the correct orientation.
I have applied various GATK filters to the data during indel realignment including:
I think a lot of that was unnecessary because I think some of them are applied automatically but better safe than sorry.
I have also filtered the data using samtools with the following command:
I have also tried it as
I have sorted the output by queryname using picard so that all read pairs should be adjacent.
Still, the application (custom script from a separate research group) I am running tells me I have reads that are missing their mate.
I am going nuts over this. I am at the point where I am removing the problematic reads individually using picard FilterSamReads and the read names.
Any insights into what monumentally dumb thing I am doing?
I have applied various GATK filters to the data during indel realignment including:
# -rf NotPrimaryAlignment \
# -rf DuplicateRead \
# -rf UnmappedRead
# -rf DuplicateRead \
# -rf UnmappedRead
I have also filtered the data using samtools with the following command:
samtools view -b -q 30 -f 2 -o out.filtered.bam in.bam
samtools view -b -q 30 -f 0x02 -o out.filtered.bam in.bam
Still, the application (custom script from a separate research group) I am running tells me I have reads that are missing their mate.
I am going nuts over this. I am at the point where I am removing the problematic reads individually using picard FilterSamReads and the read names.
Any insights into what monumentally dumb thing I am doing?
Comment