Unconfigured Ad

**maubp** · 03-01-2012, 01:29 AM

I've heard Sanger is considering it, perhaps even this year if CRAM continues to mature rapidly.

**lh3** · 03-01-2012, 05:22 AM

Yes, cram has a great potential. It may ultimately replace BAM (if cram does not do that, there will be a binary format to achieve sooner or later). Nonetheless, cram may not replace BAM right now. It does not (at least did not) support all the tags. I do not know the progress on compressing unmapped reads. Furthermore, I am concerned with the compression model. I also think lossy compression is the way to go, but this should be done by reducing the resolution of quality, instead of by selectively dropping all the quality information.

**maubp** · 03-01-2012, 05:41 AM

Originally posted by lh3 View Post

Yes, cram has a great potential. It may ultimately replace BAM (if cram does not do that, there will be a binary format to achieve sooner or later). Nonetheless, cram may not replace BAM right now. It does not (at least did not) support all the tags.

Supporting all the tags is expected in CRAM 0.7 due soon, see e.g.

[Bioperl-l] fastq splitter

http://lists.open-bio.org/pipermail/bioperl-l/2012-March/036295.html

Originally posted by lh3 View Post

I do not know the progress on compressing unmapped reads.

I heard at a recent seminar that the CRAM team are looking at doing a mini-assembly of the unmapped reads in order to generate dummy reference sequences which can then be used for reference based compression. If I understood correctly this might be transparent to the user.

Originally posted by lh3 View Post

Furthermore, I am concerned with the compression model. I also think lossy compression is the way to go, but this should be done by reducing the resolution of quality, instead of by selectively dropping all the quality information.

Also at the same seminar we were told CRAM has several modes of quality compression, one of which is simply reducing the resolution.

**lh3** · 03-01-2012, 06:19 AM

That is great! Hope these can be done soon!

**cjfields** · 03-01-2012, 08:56 AM

That does seem very promising.

**vadim** · 03-09-2012, 02:19 AM

I am the developer of CRAM and can answer any questions about it.

The code is here:

GitHub - vadimzalunin/crammer: Reference-based compression of SRA data

https://github.com/vadimzalunin/crammer/

Reference-based compression of SRA data. Contribute to vadimzalunin/crammer development by creating an account on GitHub.

Documentation can be found here:

How to Access ENA Programmatically — ENA Documentation 1 documentation

http://www.ebi.ac.uk/ena/about/cram_toolkit

We just released v0.7, which is not a long term support yet but stable enough to try it out.

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, Yesterday, 06:09 AM	0 responses 16 views 0 reactions	Last Post by SEQadmin2 Yesterday, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 34 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 41 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 48 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Cram

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News