Seqanswers Leaderboard Ad

**Richard Finney** · 01-29-2012, 09:24 AM

It appears that razip uses gzip compression (or other kinds of compression) but is not gzip. The output contains chunks of (gzipped) compressed data but the entire file is not a gzipped file. razip output cannot be gUNzipped (but can be uncompressed using razip). zgrep tries to UNgzip the file, succeeds party, but then runs into the uncompressed part and fails.

**splaisan** · 01-30-2012, 12:11 AM

Thanks Richard

I wrongly assumed this was a bonafide archive. Never mind, I will keep a second copy compressed with bgzip to reduce storage as compared to the plain 3GB fasta and use one or the other depending on the needs.

Great help, thanks
Stephane

Originally posted by Richard Finney View Post

It appears that razip uses gzip compression (or other kinds of compression) but is not gzip. The output contains chunks of (gzipped) compressed data but the entire file is not a gzipped file. razip output cannot be gUNzipped (but can be uncompressed using razip). zgrep tries to UNgzip the file, succeeds party, but then runs into the uncompressed part and fails.

**lh3** · 01-30-2012, 12:32 PM

It is part of the razip problem and part of gzip. If you run "gzip -dc | grep", you get normal output. But if you run "gzip -dcf | grep", which is what zgrep is actually calling, you get those rubbish.

**splaisan** · 01-31-2012, 01:10 AM

thanks Heng

This helps a lot and I will alias it for regular use!
You just saved me several gigabites of disk space.
Cool
Stephane

Originally posted by lh3 View Post

It is part of the razip problem and part of gzip. If you run:

Code:

gzip -dc | grep

, you get normal output. But if you run "gzip -dcf | grep", which is what zgrep is actually calling, you get those rubbish.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

compressing reference genome and indexing

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News