Hi all,
I'm working on a large genome assembly (~1Gbp) with Illumina paired-end reads, and currently I'm down to ~90 000 scaffolds (N50=26kb). Now I've got some additional 454 data (single end), and would like to use that for improving my assembly. I've heard about people assembling the two sets separately, and then try to merge them into one, and also people trying to do one big assembly with all reads.
I would instead like to map the 454 reads onto my Illumina assembly, and
see if I can get rid of NNNs in the scaffolds, or even link some scaffolds to each other. I tried the Roche GSReferenceMapper, and most reads mapped fully within scaffolds, but some are marked as "Chimeric". It seems like these reads map to more then one scaffold - possibly exactly what I'm looking for! But there seems to be no way to get the information on what scaffolds they map to (and to what positions) - I guess the software discards them as wrongly mapped?
Does anyone more familiar to this software know if this information can be retrieved? Or is there a better software for this purpose?
Any input would be appreciated!
I'm working on a large genome assembly (~1Gbp) with Illumina paired-end reads, and currently I'm down to ~90 000 scaffolds (N50=26kb). Now I've got some additional 454 data (single end), and would like to use that for improving my assembly. I've heard about people assembling the two sets separately, and then try to merge them into one, and also people trying to do one big assembly with all reads.
I would instead like to map the 454 reads onto my Illumina assembly, and
see if I can get rid of NNNs in the scaffolds, or even link some scaffolds to each other. I tried the Roche GSReferenceMapper, and most reads mapped fully within scaffolds, but some are marked as "Chimeric". It seems like these reads map to more then one scaffold - possibly exactly what I'm looking for! But there seems to be no way to get the information on what scaffolds they map to (and to what positions) - I guess the software discards them as wrongly mapped?
Does anyone more familiar to this software know if this information can be retrieved? Or is there a better software for this purpose?
Any input would be appreciated!
Comment