10x Genomics
Chromium Single Cell ATAC

Cell Ranger ATAC1.0, printed on 07/29/2025

Cell Ranger ATAC Reference packages

Overview

The reference data consists of the reference genome sequence and its associated genome annotation, which includes gene and transcript coordinates, regulatory regions and transcription factor motifs. Both the genome sequence and annotation packaged with the software are derived from reputable, well-established consortia such as NCBI, GENCODE, Ensembl and ENCODE. The exact files in the reference directory have undergone minimal processing from the source files directly downloaded from each consortium (details below).

Versions

The provided single species reference packages are:

Human GRCh37 build in two variants:
- hg19/UCSC-style chromosome naming convention ("chr1", "chrM")
- b37/1000 Genomes-style chromosome naming convention ("1", "MT")
Human GRCh38 build.
Mouse mm10 build.

Please note that Cell Ranger ATAC 1.0.0 does not currently support the ability to build custom references.

Note that for GRCh38, we do not use the decoy and alternate contigs in any analysis steps in the pipeline.

For mutli-species experiments, we provide the following reference packages that are combinations of some of the single species builds above. These are made by taking the union of reference sequences and annotations.

hg19_and_mm10
grch38_and_mm10

Note that the contigs names are prefixed by species build. Eg. chr1 from hg19 is labelled as hg19_chr1 inside the hg19_and_mm10 build.

Genome sequences

All genome sequences are in the "fasta" directory, in which the raw fasta data is downloaded from NCBI. Genome index files are also created by samtool faidx, bwa and pysam. Finally, a contig definition json file is created to be read by the pipeline to parse the contents of the reference package.

Gene annotation

Gene annotations are downloaded from GENCODE and the version of "basic" annotation (instead of the "comprehensive") is used (links are at hg19, GRCh38 and mm10).

Regulatory regions

Regulatory regions are downloaded from the following sources:

The annotations for promoter and enhancer regions are from Ensembl regulatory build.
The DNase hypersensitivity regions are from ENCODE (for hg19, b37 and mm10) or Anshul Kundaje's published pipeline (GRCh38).
The blacklist regions are from ENCODE (hg19, b37, GRCh38 and mm10).
Transcription start sites (TSS) are generated by extracting the first nucleotide position of each transcript from Gencode annotation.
Transcription factor motifs are from JASPAR vertebrate non-redundant collection. Motifs are renamed to contain the species name regardless of the original species of the motif annotated in JASPAR database.

10x Genomics
Chromium Single Cell ATAC

Cell Ranger ATAC Reference packages

Overview

Versions

Genome sequences

Gene annotation

Regulatory regions

About

Legal Notices

Resources

Headquarters

Social

10x GenomicsChromium Single Cell ATAC

Cell Ranger ATAC Reference packages

Overview

Versions

Genome sequences

Gene annotation

Regulatory regions

10x Genomics
Chromium Single Cell ATAC