Cell Ranger ATAC1.0, printed on 12/23/2024
Entries are ordered alphabetically.
Adapters: Customized strands of base pairs created to bind with specific sequences of DNA.
ATAC: Assay for Transposase Accessible Chromatin.
Barcode: Each GEM contains a Gel Bead which carries many DNA oligos with the same barcode. Different GEMs have different barcodes.
Cell Barcode: Any barcodes that have been determined by the 'cell-calling' step of the pipeline to be associated with cells cells
Chromatin: Macromolecular complex formed by DNA, nucleosomes and other proteins that bind DNA (for example transcription factors).
Cut-site: Genome location where transposase cuts the DNA and inserts adapters.
Enhancer: an enhancer is a short (50–1500 bp) region of DNA that can be bound by transcription factors to increase the likelihood that transcription of a particular gene will occur.
Fragment: A piece of genomic DNA, bounded by two adjacent cut sites, that has been converted into a sequencer-compatible molecule with an attached cell-barcode. The alignment interval of the fragment is obtained by correcting the alignment interval of the sequenced fragment by +4 bp on the left end of the fragment, and -5bp on the right end (where left and right are relative to genomic coordinates). This is to account for the 9 bp of DNA that the tranposase occupies when it cuts the DNA (accessibility is recorded around the center of this 9 bp stretch; see figure in Algorithms). Most fragment-based metrics computed by the pipeline are based on fragments that passed various quality filters.
GEM: Gelbead-in-emulsion. A droplet containing some sample volume and barcoded Gel Bead, forming an isolated reaction volume. When referring to the subset of the sample contained in the droplet, the term 'partition' may also be used.
Histone: Protein found in eukaryotic cell nuclei that forms nucleosomes.
Library: A 10x-barcoded sequencing library prepared from a single sample, corresponding to a single chip channel of a 10x Chromium Controller run.
Nucleosome: Structural units formed by histones that help package the eukaryotic DNA into well organized chromosomes.
PCR Duplicates: During PCR amplification of the fragments, each unique fragment that is created may result in multiple read-pairs sequenced with near identical barcodes and sequence data. These duplicate reads are identified computationally, and collapsed into a single fragment record for downstream analysis.
Peak: a compact region of the genome identified as having 'open chromatin' due to an enrichment of cut-sites inside the region.
Promoter: a promoter is a region of DNA that initiates transcription of a particular gene. Promoters are located near the transcription start sites of genes, on the same strand and upstream on the DNA
Read Data: Raw genomic data from sequenced DNA.
Read-pair: Read data sequenced from one molecule. This includes read1, read2, and the barcode sequence read.
Sample: A cell or nuclei suspension extracted from a single biological source (blood, tissue, etc).
Sequencing Run (or Flowcell): A flowcell containing data from one sequencing instrument run. The sequencing data can be further addressed by lane and by one or more sample indices.
Targeted region: Any known, annotated, epigenetically relevant regions in the genome such as transcription start sites (TSS), enhancers, promoters or DNase hypersensitive sites. The pipeline metrics often refer to these targeted regions.
Transcription Factor (TF): A protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequences (like promoter or enhancers) that are commonly located in the vicinity of the gene they control.
Transposase Enzyme: Cuts open chromatin and ligates adapters to the 3' end of each strand.
Transposition: Reaction carried out by the transposase enzyme.
TSS: The transcription start site is the location where transcription starts at the 5'-end of a gene sequence