Hi all,
I have a set of fastq files. Some of the fastq files have reads which are missing the 3rd line (which begins with +).
And also sequence line contains the + at the end. I guess 3rd line has been concatenated to the end of 2nd line.
Any thoughts on how to proceed with this kind of data?? Any scripts to change it into proper format ??
I have a set of fastq files. Some of the fastq files have reads which are missing the 3rd line (which begins with +).
Code:
@HWI-ST750:151:C1C6AACXX:5:2316:17997:100881 1:N:0: AGCGGTNCGCAATATTTTAGTAGCTCGTTACAGTCCGGTGCGTTTTTGGTTTTTTGAAAGTGCGTCTTCAGAGCGCTTTTGGTTTTCAAAAGCGCTCTGAAGTT+ 1:+B+0#2<DDDDIIIIIIFIIIIIIIIIIIIIEEIID?DDDBBADIII@DDDDD@@AAAAAAA??A?ADAAAA?????A>?8<>?>AAAAA8>>?><AAAA:4 @HWI-ST750:151:C1C6AACXX:5:2316:19129:100793 1:N:0: AGCGCTNCTTGATATCAATCAACTGCTAGACAAATCCAATAGTAAATTGGGTAAACCAAATCTCGATATCGACAGCAAAGTATCACAATATGCCTATAACTACA+ ;1=DD?#2ACDDDIDEEIIIEIEIEIEEIIIIDEIEIIIDEDDCEDDDEID?BBDIDIIIIEECDIDA@DD=A?D@DAAAA@DDDA>AAABE>AAAA>A>A>AA @HWI-ST750:151:C1C6AACXX:5:2316:19695:100854 1:N:0: AGCGCTNCACCGCGGTAAGCTTTAGCAGATCTCACTTTGTCTAGCGTTTGAACCATGTTTTCAAGGATATTGGCTCTAAGTTGTGGGTATTTTTCGATCACTTC+ @<1DDD#2<DFDDGI@CEEGHIIIIIIIEGIIHCGHIIIHGGIGIIAFFHFHAHHIG?CCHFHEEBBC@CDCCCCACCCC5>CCBBB'>ACDECCBBDB7?CC>
Any thoughts on how to proceed with this kind of data?? Any scripts to change it into proper format ??
Comment