10x Genomics
Chromium Single Cell CNV

Cell Ranger DNA1.1 (latest), printed on 06/22/2025

Reference Support

Analysis software for the 10x Genomics single cell DNA product is no longer supported. Raw data processing pipelines and visualization tools are available for download and can be used for analyzing legacy data from 10x Genomics kits in accordance with our end user licensing agreement without support.

Cell Ranger DNA provides pre-built GRCh38 (human), GRCh37 (human) and GRCm38 (mouse) reference packages for use with the pipeline. These references come packaged with GENCODE annotations. Following current conventions, these references have the following properties:

GRCh38 (human): Chromosome names begin with "chr". The FASTA is obtained from this location, and the annotations are GENCODE V28 obtained from here. This build has no alt loci and does not have a decoy, but does have a Epstein-Barr virus decoy.
GRCh37 (human): Chromosome names do not begin with "chr". The FASTA is obtained from this location, and the annotations are GENCODE V25 (Ensembl V87) obtained from here. This version of GRCh37 has a decoy sequence (hs37d5).
GRCm38 (mouse): The FASTA is obtained from this location, and the annotations are GENCODE M17 obtained from here. This build has no alt loci.

At this time, only these human and mouse references have seen extensive testing.

Making a Reference Package

Cell Ranger DNA comes packaged with the command mkref which will construct a reference. mkref requires a single FASTA file containing the reference genome sequence and contig_defs.json file. Any alternate haplotype sequence records should be omitted from the FASTA file since these will result in a greater fraction of the genome being unmappable. An optional GTF may also be provided, which will be used solely to provide gene annotations for visualization using Loupe scDNA Browser:

$ cellranger-dna mkref <fasta_file> <contig_defs_file> [--gtf=<file.gtf.gz>]

After this process has completed, there should be a new folder called refdata-$GENOME (where $GENOME is the FASTA filename without the filetype suffix) in the current directory with the following structure:

$ tree refdata-$GENOME
├── fasta
│   ├── genome.fa
│   ├── genome.fa.amb
│   ├── genome.fa.ann
│   ├── genome.fa.bwt
│   ├── genome.fa.fai
│   ├── genome.fa.flat
│   ├── genome.fa.gdx
│   ├── genome.fa.pac
│   └── genome.fa.sa
│   └── contig-defs.json
├── genes
├── genome
├── regions
└── snps

Contig Defs file

A contig_defs.json file must be provided. This file is necessary to determine which contigs in the input FASTA are considered primary contigs, as well as keeping track of sex-chromosomes, and non-nuclear sequences.

The contig_defs.json file has the following keys:

species_prefixes: This field is not currently used and can be omitted. If this field is present, the contig names must match the species prefixes in the pattern ${prefix}_${contig}. As an example, if species_prefixes is ["GRCh38"], then primary_contigs would have the names ["GRCh38_chr1", ...].
primary_contigs: A list of primary contigs. Copy number variants will only be called on primary contigs. Primary contigs should be at least 10 megabases in length.
sex_chromosomes: A key-value list defining expected copy number for sex chromosomes in the male and female case. This field is not currently used and may be omitted.
non_nuclear_contigs: A list of non-nuclear contigs such as the mitochondrial sequence. This field is not currently used and may be omitted.

As an example, here is the contig_defs.json that comes packaged with the GRCh38 reference:

{
        "species_prefixes": [""],
        "primary_contigs": [
        "chr1", "chr2", "chr3","chr4", "chr5", "chr6", "chr7", "chr8", "chr9", "chr10", "chr11", "chr12", "chr13", "chr14", "chr15", "chr16", "chr17", "chr18", "chr19", "chr20", "chr21", "chr22", "chrX", "chrY"
        ],
        "sex_chromosomes": {
                "_male": {
                        "chrX": 1,
                        "chrY": 1
                 },
                 "_female": {
                        "chrX": 2,
                        "chrY": 0
                 }
        },
        "non_nuclear_contigs": ["chrM"]
}

GTF Gene Annotations file

A GTF file may be provided to mkref to facilitate visualization in Loupe scDNA Browser. This file may be provided to mkref gzipped or not, and only annotations with gene_type or gene_biotype attribute of protein_coding or pseudogene are considered.

10x Genomics
Chromium Single Cell CNV

Reference Support

Making a Reference Package

Contig Defs file

GTF Gene Annotations file

About

Legal Notices

Resources

Headquarters

Social

10x GenomicsChromium Single Cell CNV

Reference Support

Making a Reference Package

Contig Defs file

GTF Gene Annotations file

10x Genomics
Chromium Single Cell CNV