Recently ran a single HiScan lane of shotgun DNA size-selected at ~700-800bp insert size (user wanted large inserts to avoid repeats as much as possible without mate-pair). We got lower cluster density than usual but lane still returned ~14Gbp sequence from a 2x101bp run. However, the user has reported a very high level (~80%) of broken pairs when mapping reads to references or de novo assembly. I was wondering if anyone here has come across this issue and knows a way around it?
Some possible causes I can think of:
1) insert size - largest we've tried, but don't think this is the cause as I've seen people here post results from 1.5Kbp MiSeq runs.
2) this was a single indexed sample (only sample in lane) but the rest of the run employed indexing, so the sequencer had trouble with this lane due to the lack of bases in both laser channels during the index read. Only 20% of reads successfully found the index. I merged the FASTQ files from the indexed folder and the unaligned folder for this lane to give to the user, with the caveat that he would need to filter out the PhiX reads from the unaligned portion.
Any help appreciated.
Some possible causes I can think of:
1) insert size - largest we've tried, but don't think this is the cause as I've seen people here post results from 1.5Kbp MiSeq runs.
2) this was a single indexed sample (only sample in lane) but the rest of the run employed indexing, so the sequencer had trouble with this lane due to the lack of bases in both laser channels during the index read. Only 20% of reads successfully found the index. I merged the FASTQ files from the indexed folder and the unaligned folder for this lane to give to the user, with the caveat that he would need to filter out the PhiX reads from the unaligned portion.
Any help appreciated.
Comment