Hi,
We are working on finding new transposon insertion sites using NGS data. Our candidate reads should contain part of transposon sequence and part of genome sequence around its insertion site. In another words, the reads that we are interested are the reads that can not perfectly aligned to genome, therefore the duplicate removing tools based on alignment to reference are not suitable for our project.
I'm just wondering if anyone know about any tools that can remove duplicated sequence as well as condense shorter reads into the longer ones for the reads that can't aligned to reference?
Thanks in advance!
bin
We are working on finding new transposon insertion sites using NGS data. Our candidate reads should contain part of transposon sequence and part of genome sequence around its insertion site. In another words, the reads that we are interested are the reads that can not perfectly aligned to genome, therefore the duplicate removing tools based on alignment to reference are not suitable for our project.
I'm just wondering if anyone know about any tools that can remove duplicated sequence as well as condense shorter reads into the longer ones for the reads that can't aligned to reference?
Thanks in advance!
bin
Comment