View Single Post
Old 09-18-2012, 09:08 AM   #1
dan
wiki wiki
 
Location: Cambridge, England

Join Date: Jul 2008
Posts: 266
Default Challenging read mapping problem

Hi,

I have about 70Gbp of paired-end data from potato [1] that I'd like to map to one of the chromosomes (chr04). I only want pairs where one end or the other, or both, matches chr04.

Aside from the scale of the problem, the format of the files is a pain, because each pair is in a different file.

Any tips on how to do this (efficiently)?

A prize goes to the best answer ;-)


Cheers,
Dan.

[1] http://www.ebi.ac.uk/ena/data/view/SRA029323
__________________
Homepage: Dan Bolser
MetaBase the database of biological databases.

Last edited by dan; 09-18-2012 at 09:09 AM. Reason: Either end or both ends can match, and I want to return the pair.
dan is offline   Reply With Quote