Hi all,
We are doing some whole genome on a solid 5500 machine and using all 12 lanes for each sample. My current plan is to align each lane individually, then merge the aligned files together to use in my downstream analysis. My question is regarding the individual sequence read Ids given by the machine. I know that on each lane of the machine, each read has a unique ID, but is that true also for each lane. So for example, if I have read ID of 100_1000_1011, will the be on all lanes of the machine? If so, after I merge everything together, will downstream programs mark these duplicated read names in some way and cause problems for me?
Thanks
We are doing some whole genome on a solid 5500 machine and using all 12 lanes for each sample. My current plan is to align each lane individually, then merge the aligned files together to use in my downstream analysis. My question is regarding the individual sequence read Ids given by the machine. I know that on each lane of the machine, each read has a unique ID, but is that true also for each lane. So for example, if I have read ID of 100_1000_1011, will the be on all lanes of the machine? If so, after I merge everything together, will downstream programs mark these duplicated read names in some way and cause problems for me?
Thanks
Comment