Seqanswers Leaderboard Ad

**Michael.Ante** · 09-22-2015, 12:03 AM

BBmap can do that

:
I haven't used it but it includes a tool to reorder reads while keeping pairs together

Code:

shuffle.sh in=R1.fastq in2=R2.fastq out=R1_rand.fastq out2=R2_rand.fastq

**fanli** · 09-22-2015, 07:06 AM

Out of curiosity, in what context(s) does the order of reads in a FASTQ file matter? I could imagine in clustering or something like that...but most mappers treat reads independently right?

**maubp** · 09-22-2015, 08:29 AM

Read order does matter for many de novo assemblers, and also for things like k-mer based normalisation.

**methylnick** · 09-22-2015, 01:35 PM

Originally posted by fanli View Post

Out of curiosity, in what context(s) does the order of reads in a FASTQ file matter? I could imagine in clustering or something like that...but most mappers treat reads independently right?

Also I was hoping to peform some saturation analysis so downsampling the FASTQ files and see what happens to the gene transcripts but also the unmapped stuff.

Good to know it does have an affect on de-novo assembly, which is also something I would like to try out.

cheers

Nick

**methylnick** · 09-22-2015, 01:43 PM

Originally posted by Michael.Ante View Post

BBmap can do that

:
I haven't used it but it includes a tool to reorder reads while keeping pairs together

Code:

shuffle.sh in=R1.fastq in2=R2.fastq out=R1_rand.fastq out2=R2_rand.fastq

Thanks for that Micheal.Ante, this looks to be what I am looking for, fun times playing with it! cheers.

Nick

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 18 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 47 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Recreating TCGA FASTQ file from BAM and unaligned files - How to randomize reads?

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News