Hi,
I have sequence reads as a fasta file with a fasta quality file. I am converting these into .frg input for the Celera Assembler (CA) using AMOS,
and then:
However, the above strategy does not recognize the 'mate pairs' in my data (paired end reads) that are 'linked' using the St Louis naming convention (basically read0012.b is pared with read0012.g, the .b and the .g denoting the forward and the backward read, respectively).
On the AMOS mailing list I got a reply to a similar question from Sven Klages, which told me that CA is usually passed a separate "linkage information" file in addition to the .frg input created above. His suggestion was to create linkage data in 'Trace Archive' format*, and then convert that to input for CA using AMOS again.
Has anyone tackled this problem before? Is there any code 'off the shelf' to create the mate pair data?
* http://www.ncbi.nlm.nih.gov/Traces/t...#header-global
Cheers,
Dan.
P.S. Any other good mailing lists for sequence assembly?
If you know any, I'll add them here:
I have sequence reads as a fasta file with a fasta quality file. I am converting these into .frg input for the Celera Assembler (CA) using AMOS,
Code:
toAmos -s test.fasta -q test.qual -o test.afg
Code:
amos2frg -i test.afg
However, the above strategy does not recognize the 'mate pairs' in my data (paired end reads) that are 'linked' using the St Louis naming convention (basically read0012.b is pared with read0012.g, the .b and the .g denoting the forward and the backward read, respectively).
On the AMOS mailing list I got a reply to a similar question from Sven Klages, which told me that CA is usually passed a separate "linkage information" file in addition to the .frg input created above. His suggestion was to create linkage data in 'Trace Archive' format*, and then convert that to input for CA using AMOS again.
Has anyone tackled this problem before? Is there any code 'off the shelf' to create the mate pair data?
* http://www.ncbi.nlm.nih.gov/Traces/t...#header-global
Cheers,
Dan.
P.S. Any other good mailing lists for sequence assembly?
If you know any, I'll add them here:
Comment