Hello. I was wondering if I could get some help.
I'm trying to use bowtie to align a fasta file using an index and output it in sam format. Later on, I'll need to process the output sam file using another script. Anyway, I called bowtie with these options
bowtie -f -t -p 8 -n 3 -l 32 -k 1 -m 100 -S -y --chunkmbs 1024 --max FASTA_FILE.mm.fasta --best
using input of the form
>38-1
TGGAACGGAACGGAATGGAAGGGAATGGAATGGAAT
and got output of the form
38-1 0 chrY:28807964-28808132 275 255 36M * 0 0 TGGAACGGAACGGAATGGAAGGGAATGGAATGGAAT IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII XA:i:2 MD:Z:5T14A15 NM:i:2
Maybe I'm misunderstanding something, but this doesn't exactly appear to be in sam format. What I would like is for the coordinate of the leftmost position of the sequence to be in the fourth field (instead of 275--I'm not sure what that number represents, but given the label in field three, I don't think it's the coordinate) and for only the chromosome to be in the third field. I could manually modify the output, but I'm afraid to throw away that 275 because I have no idea what it is.
Does anyone know what I'm doing wrong? Any help is appreciated. If you need more information, I'll do my best to provide it.
Thanks,
David
I'm trying to use bowtie to align a fasta file using an index and output it in sam format. Later on, I'll need to process the output sam file using another script. Anyway, I called bowtie with these options
bowtie -f -t -p 8 -n 3 -l 32 -k 1 -m 100 -S -y --chunkmbs 1024 --max FASTA_FILE.mm.fasta --best
using input of the form
>38-1
TGGAACGGAACGGAATGGAAGGGAATGGAATGGAAT
and got output of the form
38-1 0 chrY:28807964-28808132 275 255 36M * 0 0 TGGAACGGAACGGAATGGAAGGGAATGGAATGGAAT IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII XA:i:2 MD:Z:5T14A15 NM:i:2
Maybe I'm misunderstanding something, but this doesn't exactly appear to be in sam format. What I would like is for the coordinate of the leftmost position of the sequence to be in the fourth field (instead of 275--I'm not sure what that number represents, but given the label in field three, I don't think it's the coordinate) and for only the chromosome to be in the third field. I could manually modify the output, but I'm afraid to throw away that 275 because I have no idea what it is.
Does anyone know what I'm doing wrong? Any help is appreciated. If you need more information, I'll do my best to provide it.
Thanks,
David
Comment