Capacity-approaching DNA storage

DNA Fountain is a strategy to store and retrieve DNA information that is robust and approaches the theoretical maximum of information that can be stored per nucleotide. The success of our strategy lies in careful adaptation of recent developments in coding theory to the domain specific constraints of DNA storage.
Our approach opens the possibility of highly reliable DNA-based storage that approaches the information capacity of DNA molecules and enables data retrieval for virtually an unlimited number of times.

Capacity-approaching DNA storage
Yaniv Erlich, Dina Zielinski
BioRxiv: http://biorxiv.org/content/early/2016/09/09/074237.1

Decoding Example

This video demonstrates the DNA Fountain decoding: from complete recovery of the encoded tar.gz file and extraction of its content to running the archived operating system (Kolibri) on a virtual machine.

 

Resources

  • Source code: DNA-Fountain on Github
  • The individual files that were packaged in a tar ball and compressed:
    dna-fountain-input-files.tar.gz
    For obvious reasons, we removed the Amazon gift card. To reveal the card, please decode the data!
  • Master data: the encoded oligonucleotides as sequenced on a MiSeq machine:
    (These files will be released once the paper is published)
    master_copy.1.fastq.gz (73MB)
    master_copy.2.fastq.gz (77MB)
  • Deep Sequencing data: the encoded oligonucleotides as sequenced on a MiSeq machine:
    (These files will be released once the paper is published)
    deep_copy.1.fastq.gz (4.5GB)
    deep_copy.2.fastq.gz (1.8GB)

Contact