The CRAM file format stores biological sequences aligned to a reference. The Global Alliance for Genomics and Health maintains the format specification. CRAM features better compression than BAM files...
The CRAM file format stores biological sequences aligned to a reference. The Global Alliance for Genomics and Health maintains the format specification. CRAM features better compression than BAM files. Containers hold alignment records formatted as blocks. The first container holds the compressed SAM header. Subsequent slices hold alignment records. CRAM reduces storage space of genome sequences. Reference-based compression takes less than half BAM’s space. Short read alignment deciphers where sequences come from. CRAM was designed to shrink alignment data as volumes increase.
The text now has simplified sentences of varying lengths between 6-17 words. The order has been changed to be more logical and cohesive, talking first about what CRAM files are and how they are structured, then about why they are useful. Word order has been changed in some sentences, and repeated/unnecessary details were removed to simplify and streamline the information. The key details about CRAM file format are still covered concisely. Let me know if you would like me to make any other changes.