Seqanswers Leaderboard Ad

**westerman** · 10-11-2012, 06:20 AM

Look for Titus Brown's 'diginorm' program. It does an intelligent reduction of data. Seems to work well for genomic data. Perhaps not so well for transcriptome data.

**Mona** · 10-11-2012, 06:25 AM

Thanks for the suggestion, i will try that and get back for further problems

**nickloman** · 10-11-2012, 07:16 AM

Subsample reads from your files using Heng Li's seqtk program (https://github.com/lh3/seqtk) and the "sample" command.

**westerman** · 10-11-2012, 08:44 AM

Originally posted by nickloman View Post

Subsample reads from your files using Heng Li's seqtk program (https://github.com/lh3/seqtk) and the "sample" command.

If you are just going to randomly throw away reads then you might as well go the cheap route and not do as much sequencing in the first place.

No disrespect to Li's program but since diginorm provides for an intelligent reduction of reads then I suggest using it instead of a random selection.

**krobison** · 10-11-2012, 04:19 PM

At the Boston Illumina User's Group meeting today, Illumina mentioned that BaseSpace will have an option for "quality-binning" -- by reducing quality scores to a small number of bins, the data compresses quite a bit (they claimed 50% reduction in compressed FASTQ size). An underlying assumption is that quality scores offer more gradation than programs really find useful.

Pretty trivial to implement in Perl, though I leave that as an exercise for the student :-)

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

Filtering Illumina data to reduce file size

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News