Hi all,
For anyone familiar with the SOAP package...
SOAPsnp takes the SOAPaligner output as its input. The only issue here is that the SOAPaligner output has to be slightly modified. The output has to be sorted first by chromosome name (alphabetically) then by chromosome coordinate (numerically). The output contains 13 tab delimited columns. Chromosome name is the 8th column and coordinate is the 9th.
My perl skills are still infantile and I'm having a tough time formatting my data.
Does anyone have a script they wouldn't mind sharing or a solution to this?
Here is an example line of the output:
SRR003674.68 GATTAAATAAATATATAGATACCTTTTCCTACTTAT ,)4E8)*;/,.914+-+,&+&)+(+$%($"($#$&# 1 a 36 + Scer_gi|93117368|ref|NC_001136.8| 1272821 2 T->31C-28 T->28C-28 36M 28T2T4
SRR003674.113 GATATGCTTGAGGATGAACGAGAAGCTAATATAGTC '%&%%%"&$&#)*)(++(+-0+1+'++(104-2-)0 1 a 36 - Scer_gi|93117368|ref|NC_001136.8| 1277159 2 G->6C-30 A->7T-26 36M 6GA28
Thanks in advance...
For anyone familiar with the SOAP package...
SOAPsnp takes the SOAPaligner output as its input. The only issue here is that the SOAPaligner output has to be slightly modified. The output has to be sorted first by chromosome name (alphabetically) then by chromosome coordinate (numerically). The output contains 13 tab delimited columns. Chromosome name is the 8th column and coordinate is the 9th.
My perl skills are still infantile and I'm having a tough time formatting my data.
Does anyone have a script they wouldn't mind sharing or a solution to this?
Here is an example line of the output:
SRR003674.68 GATTAAATAAATATATAGATACCTTTTCCTACTTAT ,)4E8)*;/,.914+-+,&+&)+(+$%($"($#$&# 1 a 36 + Scer_gi|93117368|ref|NC_001136.8| 1272821 2 T->31C-28 T->28C-28 36M 28T2T4
SRR003674.113 GATATGCTTGAGGATGAACGAGAAGCTAATATAGTC '%&%%%"&$&#)*)(++(+-0+1+'++(104-2-)0 1 a 36 - Scer_gi|93117368|ref|NC_001136.8| 1277159 2 G->6C-30 A->7T-26 36M 6GA28
Thanks in advance...
Comment