maybe I should ask this in a compression forum (too) ...
but the problem only happened here, when I downloaded 1000 genome files.
Apparently they don't decompress correctly on my system, the filelengths
are strange.
downloading from :
E.g. chromosome 11 has 52335487 bytes as .gz , decompressing
gives a file of 107085824 bytes, which is a very bad compression rate
when e.g. compared to chromosome 1 which has 80MB as gz
and ~1.5GB when expanded.
Now, maybe my gzip is the wrong one ?
Although I never had problems and I downloaded and ungzipped
lots of big files recently without problem.
OK, I went to gzip-homepage, read about a recent bug
with big files > 2GB (chr11 is only 50MB) , downloaded
the recent version 1.2.4. Win32 , downloaded chromosome 11
again and decompressed it.
347996160 bytes ! More, but still not enough, e.g. much
fewer than chromosome 17.
There are similar problems with other chromosomes too,
although #17, which I had tried first seems to be correct.
(64160 lines)
Anyone else had similar problems ?
Any idea how to resolve it ?
----------------------------------------------
see also this thread:
new keyword for search engines:
README_omni_2123_samples_b37_SHAPEIT_haplotypes
but the problem only happened here, when I downloaded 1000 genome files.
Apparently they don't decompress correctly on my system, the filelengths
are strange.
downloading from :
E.g. chromosome 11 has 52335487 bytes as .gz , decompressing
gives a file of 107085824 bytes, which is a very bad compression rate
when e.g. compared to chromosome 1 which has 80MB as gz
and ~1.5GB when expanded.
Now, maybe my gzip is the wrong one ?
Although I never had problems and I downloaded and ungzipped
lots of big files recently without problem.
OK, I went to gzip-homepage, read about a recent bug
with big files > 2GB (chr11 is only 50MB) , downloaded
the recent version 1.2.4. Win32 , downloaded chromosome 11
again and decompressed it.
347996160 bytes ! More, but still not enough, e.g. much
fewer than chromosome 17.
There are similar problems with other chromosomes too,
although #17, which I had tried first seems to be correct.
(64160 lines)
Anyone else had similar problems ?
Any idea how to resolve it ?
----------------------------------------------
see also this thread:
new keyword for search engines:
README_omni_2123_samples_b37_SHAPEIT_haplotypes
Comment