Hi all,
I just start to use sed and awk. I would like to change the following fastq file
@SRR391606.9.1 1 length=50
AGCACCGCGAGGGCGGAGCTGCGTTCTCCTCTGCACAGCTTTCGGTGGTA
+SRR391606.9.1 1 length=50
@9@?<<AA:3>7A/;@?B57@6;++4=4<,=6=+++17/)1?%%%%%%%%
@SRR391606.10.1 2 length=50
GTTGCGTTCTCCTCAGCACAGACCCGGAGAGCACCGCGAGGGCGGAGCTG
+SRR391606.10.1 2 length=50
?BB?BA?AABAB>->ABB=99<AAB++35)137<>:37=<735(2-9492
into following fastq file:
@SRR391606.9 1 length=50
AGCACCGCGAGGGCGGAGCTGCGTTCTCCTCTGCACAGCTTTCGGTGGTA
+SRR391606.9 1 length=50
@9@?<<AA:3>7A/;@?B57@6;++4=4<,=6=+++17/)1?%%%%%%%%
@SRR391606.10 2 length=50
GTTGCGTTCTCCTCAGCACAGACCCGGAGAGCACCGCGAGGGCGGAGCTG
+SRR391606.10 2 length=50
?BB?BA?AABAB>->ABB=99<AAB++35)137<>:37=<735(2-9492
Basically, it is a task to remove the ".1" on every other line. As I have a huge fastq file around 30 Gb, I only put the 9th and 10th reads out of it.
Does anyone know how to do it using sed or awk? Please also explain the script in detail as I am a beginner.
Thanks a lot!
Yao
I just start to use sed and awk. I would like to change the following fastq file
@SRR391606.9.1 1 length=50
AGCACCGCGAGGGCGGAGCTGCGTTCTCCTCTGCACAGCTTTCGGTGGTA
+SRR391606.9.1 1 length=50
@9@?<<AA:3>7A/;@?B57@6;++4=4<,=6=+++17/)1?%%%%%%%%
@SRR391606.10.1 2 length=50
GTTGCGTTCTCCTCAGCACAGACCCGGAGAGCACCGCGAGGGCGGAGCTG
+SRR391606.10.1 2 length=50
?BB?BA?AABAB>->ABB=99<AAB++35)137<>:37=<735(2-9492
into following fastq file:
@SRR391606.9 1 length=50
AGCACCGCGAGGGCGGAGCTGCGTTCTCCTCTGCACAGCTTTCGGTGGTA
+SRR391606.9 1 length=50
@9@?<<AA:3>7A/;@?B57@6;++4=4<,=6=+++17/)1?%%%%%%%%
@SRR391606.10 2 length=50
GTTGCGTTCTCCTCAGCACAGACCCGGAGAGCACCGCGAGGGCGGAGCTG
+SRR391606.10 2 length=50
?BB?BA?AABAB>->ABB=99<AAB++35)137<>:37=<735(2-9492
Basically, it is a task to remove the ".1" on every other line. As I have a huge fastq file around 30 Gb, I only put the 9th and 10th reads out of it.
Does anyone know how to do it using sed or awk? Please also explain the script in detail as I am a beginner.
Thanks a lot!
Yao
Comment