When looking at paired-end sequencing:
I noticed that during ChIP-seq, the adapters bind to both ends of a polished double-stranded DNA molecule. If we focus on a particular fragment of DNA, after PCR amplifications, you will have two sets of double-stranded DNA (one of which was originally made from the + strand, and one made from the - strand).
When this get sequenced, you will have two sets of reads. One that was made from the original + strand and one made from the original - strand. Read 1 from the sample obtained originally from the + strand will be the reverse complement of Read 2 from the sample obtained from the - strand, while Read 2 from the sample obtained from the + strand will be the reverse complement of Read 2 from the sample obtained from the - strand.
My question is do programs like BWA take this into account and simply map everything to the + strand? i.e. does BWA take all the reads that map 5'->3' on the - strand, take the reverse complement, and map them to the + strand?
I noticed that during ChIP-seq, the adapters bind to both ends of a polished double-stranded DNA molecule. If we focus on a particular fragment of DNA, after PCR amplifications, you will have two sets of double-stranded DNA (one of which was originally made from the + strand, and one made from the - strand).
When this get sequenced, you will have two sets of reads. One that was made from the original + strand and one made from the original - strand. Read 1 from the sample obtained originally from the + strand will be the reverse complement of Read 2 from the sample obtained from the - strand, while Read 2 from the sample obtained from the + strand will be the reverse complement of Read 2 from the sample obtained from the - strand.
My question is do programs like BWA take this into account and simply map everything to the + strand? i.e. does BWA take all the reads that map 5'->3' on the - strand, take the reverse complement, and map them to the + strand?
Comment