After running BBmap with the command like this:
I get Bam files with duplication in the SN fields, like this:
I can get rid of them using samtools reheader, but I'm wondering what is the reason for those duplications. Any ideas welcome.
Code:
java -ea -Xmx30g align2.BBMap build=1 overwrite=true fastareadlen=500 -Xmx30g in=8601_RG15-S1_IndexD706-D508_GAATTCGT+GTACTGAC_L007_R1_001.fastq.gz out=8601_RG15-S1_IndexD706-D508_GAATTCGT+GTACTGAC_L007_R1_001.bam qtrim=t usequality=t minaveragequality=0 local=f strictmaxindel=f xstag=us maxindel=100000 intronlen=10 ambig=toss threads=8
Code:
mac00179:fastq lecka48$ samtools view -H 8601_RG15-S1_IndexD706-D508_GAATTCGT+GTACTGAC_L007_R1_001.bam | head -30 @HD VN:1.4 SO:unsorted @SQ SN:chr1 1 LN:195471971 @SQ SN:chr2 2 LN:182113224 @SQ SN:chr3 3 LN:160039680 @SQ SN:chr4 4 LN:156508116 @SQ SN:chr5 5 LN:151834684 @SQ SN:chr6 6 LN:149736546 @SQ SN:chr7 7 LN:145441459 @SQ SN:chr8 8 LN:129401213 @SQ SN:chr9 9 LN:124595110 @SQ SN:chr10 10 LN:130694993 @SQ SN:chr11 11 LN:122082543 @SQ SN:chr12 12 LN:120129022 @SQ SN:chr13 13 LN:120421639 @SQ SN:chr14 14 LN:124902244 @SQ SN:chr15 15 LN:104043685 @SQ SN:chr16 16 LN:98207768 @SQ SN:chr17 17 LN:94987271 @SQ SN:chr18 18 LN:90702639 @SQ SN:chr19 19 LN:61431566 @SQ SN:chrX X LN:171031299 @SQ SN:chrY Y LN:91744698 @SQ SN:chrM MT LN:16299 @SQ SN:GL456210.1 GL456210.1 LN:169725 @SQ SN:GL456211.1 GL456211.1 LN:241735 @SQ SN:GL456212.1 GL456212.1 LN:153618 @SQ SN:GL456213.1 GL456213.1 LN:39340 @SQ SN:GL456216.1 GL456216.1 LN:66673 @SQ SN:GL456219.1 GL456219.1 LN:175968 @SQ SN:GL456221.1 GL456221.1 LN:206961
Comment