Cell Ranger DNA1.1 (latest), printed on 11/12/2024
Analysis software for the 10x Genomics single cell DNA product is no longer supported. Raw data processing pipelines and visualization tools are available for download and can be used for analyzing legacy data from 10x Genomics kits in accordance with our end user licensing agreement without support. |
Cell Ranger DNA is a set of analysis pipelines that process Chromium single cell DNA sequencing output to align reads, identify copy number variation (CNV), and compare heterogeneity among cells.
Cell Ranger DNA includes five main pipelines:
cellranger-dna mkfastq wraps Illumina's bcl2fastq to correctly demultiplex Chromium-prepared sequencing samples and to convert barcode and read data to FASTQ files.
cellranger-dna cnv takes FASTQ files from cellranger-dna mkfastq and performs reference alignment, cell calling, copy number estimation and hierarchical clustering.
cellranger-dna bamslice takes a BAM file from cellranger-dna cnv and subsets it to specified cells of interest.
cellranger-dna aggr aggregates outputs from multiple runs of cellranger-dna cnv, aggr, or reanalyze); and reperforms secondary analysis including copy number estimation and hierarchical clustering.
cellranger-dna reanalyze takes the HDF5 output of an existing cellranger-dna cnv, aggr, or reanalyze run, restricted to the selected barcodes or group of interest, and reperforms copy number estimation and hierarchical clustering.
Output is delivered in standard HTML, CSV, BED, BAM and HDF5 formats.
Throughout the documentation, you will see references to samples, libraries, sequencing runs, Martian and pipestance. We define these as follows:
Note that a single library may be sequenced across multiple flowcells, and then combined as if all the reads came from one sequencing run. Additionally, a single flowcell may contain multiple libraries, separated using different lanes and / or sample indices.
Cell Ranger DNA allows for the analysis of a single 10x-barcoded sequencing library prepared from a sample that is sequenced across one or more flowcells. The analysis of multiple libraries generated from the same sample, for example using multiple channels on a chip, is not currently supported.