Seqanswers Leaderboard Ad

**swbarnes2** · 11-04-2010, 10:58 AM

You might be able to grep it out. Something like:

grep -A 3 pattern_that_is_only_in_read_1_sample_name combined_file.fq > read1.fq

**drio** · 11-04-2010, 11:40 AM

Assuming single.

Code:

$ cat ./single.fq | ruby -ne 'BEGIN{@i=0} ; @i+=1; puts $_  if @i.to_s =~ /[1234]/; @i = 0 if @i == 8' > one.fq && cat single.fq | ruby -ne 'BEGIN{@i=0} ; @i+=1; puts $_  if @i.to_s =~ /[5678]/; @i = 0 if @i == 8' > two.fq

Also, you may want to check the fastx package. It should include that feature.

**JayM** · 11-05-2010, 12:13 AM

Originally posted by drio View Post

Assuming single.

Code:

$ cat ./single.fq | ruby -ne 'BEGIN{@i=0} ; @i+=1; puts $_  if @i.to_s =~ /[1234]/; @i = 0 if @i == 8' > one.fq && cat single.fq | ruby -ne 'BEGIN{@i=0} ; @i+=1; puts $_  if @i.to_s =~ /[5678]/; @i = 0 if @i == 8' > two.fq

Also, you may want to check the fastx package. It should include that feature.

I take it 'assume single' here refers to assume single [input] file with read1 and read2.

**JayM** · 11-05-2010, 02:34 AM

Originally posted by swbarnes2 View Post

You might be able to grep it out. Something like:

grep -A 3 pattern_that_is_only_in_read_1_sample_name combined_file.fq > read1.fq

But how do you grep for read1 and not read2 from a paired end fastq given that essentially the whole name is identical except one character at the end and there are millions of such scenarios in the file...?
I'm just thinking about which pattern that could be.

**JayM** · 11-05-2010, 02:58 AM

Originally posted by drio View Post

Assuming single.

Code:

$ cat ./single.fq | ruby -ne 'BEGIN{@i=0} ; @i+=1; puts $_  if @i.to_s =~ /[1234]/; @i = 0 if @i == 8' > one.fq && cat single.fq | ruby -ne 'BEGIN{@i=0} ; @i+=1; puts $_  if @i.to_s =~ /[5678]/; @i = 0 if @i == 8' > two.fq

Also, you may want to check the fastx package. It should include that feature.

Wow! Thanks, it worked and an arbitrary inspection of the respective reads seems to confirm a perfect split into read1 and read2.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 27 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 26 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Splitting concatenated PE fastq to two files for respect reads

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News