10x Genomics
Chromium De Novo Assembly

Supernova2.0, printed on 07/14/2025

Overview of De Novo Assembly Software

Before generating Supernova data, please carefully read [Achieving Success with De Novo Assembly](/de-novo-assembly/guidance/doc/achieving-success-with-de-novo-assembly). Please also review [Supernova performance on twenty human and nonhuman datasets](/de-novo-assembly/software/overview/2.0/performance).

Supernova is a software package for de novo assembly from Chromium Linked-Reads that are made from a single whole-genome library from an individual DNA source. A key feature of Supernova is that it creates diploid assemblies, thus separately representing maternal and paternal chromosomes over very long distances. Almost all other methods instead merge homologous chromosomes into single incorrect 'consensus' sequences. Supernova is the only practical method for creating diploid assemblies of large genomes.

The Supernova software package includes two processing pipelines and one for post-processing:

supernova mkfastq wraps Illumina's bcl2fastq to correctly demultiplex Chromium-prepared sequencing samples and to convert barcode and read data to FASTQ files.
supernova run takes FASTQ files containing barcoded reads from supernova mkfastq and builds a graph-based assembly. The approach is to first build an assembly using read kmers (K = 48), then resolve this assembly using read pairs (to K = 200), then use barcodes to effectively resolve this assembly to K ≈ 100,000. The final step pulls apart homologous chromosomes into phase blocks, which are often several megabases in length.
supernova mkoutput takes Supernova's graph-based assemblies and produces several styles of FASTA suitable for downstream processing and analysis.

How to cite Supernova

Please refer to our 2017 paper "Direct determination of diploid genome sequences" for broad algorithmic details and assessment of computational performance and assembly quality for Supernova 1.2. There have been changes to algorithms and results since then.

For the Linked-Read laboratory technology that Supernova exploits, please refer to

Zheng GXY, et al. 2016. Haplotyping germline and cancer genomes with high-throughput linked-read sequencing. Nat Biotechnol 34, 303-311. The originating paper for Linked-Reads.

Marks P, et al. 2017. Resolving the Full Spectrum of Human Genome Variation using Linked-Reads. bioRxiv. Second generation of the technology, on which Supernova is based.

10x Genomics
Chromium De Novo Assembly

Overview of De Novo Assembly Software

How to cite Supernova

About

Legal Notices

Resources

Headquarters

Social

10x GenomicsChromium De Novo Assembly

Overview of De Novo Assembly Software

How to cite Supernova

10x Genomics
Chromium De Novo Assembly