Hello all,
Does anyone know a method to create a consensus sequence from an .ace file? I am assembling solexa data to a reference using MOSAIK. I can convert the .ace file to .fasta using a script, however this does not make a consensus, instead it creates an individual sequence for each read:
>.MosaikReference
AAGGACTTATGAGGACAGAGAGCGGGATGACTCTTCATATGCCACATGCAGCCCACATCCCAACACCTCT
CCCATGTCTCAAGTTGAAGGATGCGAGACCATTTCGCAGAGAAGCCGTACCAAACGCAAACACGACCATA
TCACCTGTTAGTGCCGAAGTTGGAAGCTTCCCCACCCTTGACTGCAGCACCTGGGAAGAAAGCACTTAAG
AAACCCAGGCATCCTACTGTGGATGACGATGGGGTTCATAGAGGCCATCGACTGACTAGGGCAGTGACGC
AAAGACAAGCCACTGGCGCTCGAAGTGGCCACATTTCAAGACCCAAGCGGCTCTACCCCAGACTGGACTG
ATGTGGATATTCCATTGCTGCCTAGCCCTTGGAGACGATGCGATTCGCTCTCCCTGGAGGGTAATTTGTC
AATAGATCCCATGAGACATCGAGATTGATTGAGGGCCGGTAAGGCTGAAACGGACAATGATTCAGTCATA
AAGTGTAGTGGTCACTTGATGTTGAGATGCGACTTGCCTGCAAATCGCTGTGTGCTTCGTGTACAGTCAC
TATTTGGATAACATCGAGTCTGGAGTTTTCTTTCAAGCGCCAGTTATCACAGTACTTTGAGTTTTTCTGT
TTATTGGTTGACCACCCAAAGCATCCGATTTCACGGAAGGGACACGATGGCGGTCTGCACTTATTTCCGT
GATTGAACCGGTTCAAATTAGAAAACGGGCGACAAAGTGCCACGTGCTATGCCATCGAGTATCACTTACG
AGGTTCTACCATGCTGGATGTAGGGAGCGGGGCAAAAGCCATAATTGCTGCTTTTCGGTGCCAGGTCGTA
CAAGATACGGAAAGGCATGTGTACCATACATGCACCAAGATTTCAACCTGCGGTGTAGTTGTGGCCCCCT
TCTTTGACAACACATCCCTAGATTACCCCTAAACGCTTTCCCTCAGCTTACCAGACATAATCTGCCTCTT
ATCTTATTCAGTCAGCCACG
>HWI-EAS240_0001:2:70:591:1131#0/1
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
-------------------------------------------------GCTTTTCGGTGCCAGGTCGTA
CAAGATACGGAAAGG
>HWI-EAS240_0001:2:22:830:70#0/1
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
-----------------------------------------------------TTCGGTGCCAGGTCGTA
CAAGATACGGAAAGGCATG
>HWI-EAS240_0001:2:59:1080:696#0/1
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
-------CGGAAAGGCATGTGCACCATACATGCACCAAGATTT
>HWI-EAS240_0001:2:16:926:1755#0/1
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
---------GAAAGGCATGTGCACCATACATGCACCAAGATTTCA
Ideally I would like an aligned consensus sequence of the reads to the reference. Any ideas?
Thanks!
John
Does anyone know a method to create a consensus sequence from an .ace file? I am assembling solexa data to a reference using MOSAIK. I can convert the .ace file to .fasta using a script, however this does not make a consensus, instead it creates an individual sequence for each read:
>.MosaikReference
AAGGACTTATGAGGACAGAGAGCGGGATGACTCTTCATATGCCACATGCAGCCCACATCCCAACACCTCT
CCCATGTCTCAAGTTGAAGGATGCGAGACCATTTCGCAGAGAAGCCGTACCAAACGCAAACACGACCATA
TCACCTGTTAGTGCCGAAGTTGGAAGCTTCCCCACCCTTGACTGCAGCACCTGGGAAGAAAGCACTTAAG
AAACCCAGGCATCCTACTGTGGATGACGATGGGGTTCATAGAGGCCATCGACTGACTAGGGCAGTGACGC
AAAGACAAGCCACTGGCGCTCGAAGTGGCCACATTTCAAGACCCAAGCGGCTCTACCCCAGACTGGACTG
ATGTGGATATTCCATTGCTGCCTAGCCCTTGGAGACGATGCGATTCGCTCTCCCTGGAGGGTAATTTGTC
AATAGATCCCATGAGACATCGAGATTGATTGAGGGCCGGTAAGGCTGAAACGGACAATGATTCAGTCATA
AAGTGTAGTGGTCACTTGATGTTGAGATGCGACTTGCCTGCAAATCGCTGTGTGCTTCGTGTACAGTCAC
TATTTGGATAACATCGAGTCTGGAGTTTTCTTTCAAGCGCCAGTTATCACAGTACTTTGAGTTTTTCTGT
TTATTGGTTGACCACCCAAAGCATCCGATTTCACGGAAGGGACACGATGGCGGTCTGCACTTATTTCCGT
GATTGAACCGGTTCAAATTAGAAAACGGGCGACAAAGTGCCACGTGCTATGCCATCGAGTATCACTTACG
AGGTTCTACCATGCTGGATGTAGGGAGCGGGGCAAAAGCCATAATTGCTGCTTTTCGGTGCCAGGTCGTA
CAAGATACGGAAAGGCATGTGTACCATACATGCACCAAGATTTCAACCTGCGGTGTAGTTGTGGCCCCCT
TCTTTGACAACACATCCCTAGATTACCCCTAAACGCTTTCCCTCAGCTTACCAGACATAATCTGCCTCTT
ATCTTATTCAGTCAGCCACG
>HWI-EAS240_0001:2:70:591:1131#0/1
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
-------------------------------------------------GCTTTTCGGTGCCAGGTCGTA
CAAGATACGGAAAGG
>HWI-EAS240_0001:2:22:830:70#0/1
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
-----------------------------------------------------TTCGGTGCCAGGTCGTA
CAAGATACGGAAAGGCATG
>HWI-EAS240_0001:2:59:1080:696#0/1
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
-------CGGAAAGGCATGTGCACCATACATGCACCAAGATTT
>HWI-EAS240_0001:2:16:926:1755#0/1
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
----------------------------------------------------------------------
---------GAAAGGCATGTGCACCATACATGCACCAAGATTTCA
Ideally I would like an aligned consensus sequence of the reads to the reference. Any ideas?
Thanks!
John
Comment