Understanding Space Ranger Outputs

The spaceranger count and spaceranger aggr pipelines run in a directory named after the --id argument (the "pipestance" directory). Output files appear in the outs/ subdirectory within the pipestance directory.

The exact output files produced for a given pipestance depend on:

the Space Ranger version used
which pipeline was used (spaceranger count or spaceranger aggr)
the parameters specified to the pipeline
whether or not the CytAssist instrument was used
whether brightfield or fluoresence images were provided
which Visium assay was used

As an example, a spaceranger count analysis will display a message similar to the following after completion:


Outputs:
- Run summary HTML:                 /home/jdoe/runs/sample345/outs/web_summary.html
- Outputs of spatial pipeline:
    aligned_fiducials:              /home/jdoe/runs/sample345/outs/spatial/aligned_fiducials.jpg
    detected_tissue_image:          /home/jdoe/runs/sample345/outs/spatial/detected_tissue_image.jpg
    scalefactors_json:              /home/jdoe/runs/sample345/outs/spatial/scalefactors_json.json
    tissue_hires_image:             /home/jdoe/runs/sample345/outs/spatial/tissue_hires_image.png
    tissue_lowres_image:            /home/jdoe/runs/sample345/outs/spatial/tissue_lowres_image.png
    cytassist_image:                null
    aligned_tissue_image:           null
    tissue_positions:               /home/jdoe/runs/sample345/outs/spatial/tissue_positions.csv
    spatial_enrichment:             /home/jdoe/runs/sample345/outs/spatial/spatial_enrichment.csv
    barcode_fluorescence_intensity: null
- Run summary CSV:                                           /home/jdoe/runs/sample345/outs/metrics_summary.csv
- Correlation values between isotypes and Antibody features: null
- BAM:                                                       /home/jdoe/runs/sample345/outs/possorted_genome_bam.bam
- BAM BAI index:                                             /home/jdoe/runs/sample345/outs/possorted_genome_bam.bam.bai
- BAM CSI index:                                             null
- Filtered feature-barcode matrices MEX:                     /home/jdoe/runs/sample345/outs/filtered_feature_bc_matrix
- Filtered feature-barcode matrices HDF5:                    /home/jdoe/runs/sample345/outs/filtered_feature_bc_matrix.h5
- Unfiltered feature-barcode matrices MEX:                   /home/jdoe/runs/sample345/outs/raw_feature_bc_matrix
- Unfiltered feature-barcode matrices HDF5:                  /home/jdoe/runs/sample345/outs/raw_feature_bc_matrix.h5
- Secondary analysis output CSV:                             /home/jdoe/runs/sample345/outs/analysis
- Per-molecule read information:                             /home/jdoe/runs/sample345/outs/molecule_info.h5
- Loupe Browser file:                                        /home/jdoe/runs/sample345/outs/cloupe.cloupe
- Feature Reference:                                         null
- Target Panel file:                                         null
- Probe Set file:                                            null
Pipestance completed successfully!

In this case:

/home/jdoe/runs/ is where the pipeline was run
/home/jdoe/runs/sample345/ is the pipestance directory containing pipeline metadata
/home/jdoe/runs/sample345/outs/ contains the final pipeline output files
null means this particular file type was not generated for this run

When running spaceranger count on Visium v1/v2 Spatial Gene Expression libraries, the following files can be found within the outs/ subfolder:

File or Directory Name	Description
`web_summary.html`	Run summary metrics and plots in HTML format
`cloupe.cloupe`	Loupe Browser visualization and analysis file
`spatial/`	Folder containing outputs that capture the spatiality of the data.
`analysis/`	Folder containing secondary analysis data including graph-based clustering and K-means clustering (K = 2-10); differential gene expression between clusters; PCA, t-SNE, and UMAP dimensionality reduction.
`metrics_summary.csv`	Run summary metrics in CSV format
`probe_set.csv`	Copy of the input probe set reference CSV file. Present for Visium FFPE and CytAssist workflow
`possorted_genome_bam.bam`	Indexed BAM file containing position-sorted reads aligned to the genome and transcriptome, annotated with barcode information
`possorted_genome_bam.bam.bai`	Index for `possorted_genome_bam.bam`. In cases where the reference transcriptome is generated from a genome with very long chromosomes (>512 Mbp), Space Ranger v2.0+ generates a `possorted_genome_bam.bam.csi` index file instead.
`filtered_feature_bc_matrix/`	Contains only tissue-associated barcodes in MEX format. Each element of the matrix is the number of UMIs associated with a feature (row) and a barcode (column). This file can be input into third-party packages and allows users to wrangle the barcode-feature matrix (e.g. to filter outlier spots, run dimensionality reduction, normalize gene expression).
`filtered_feature_bc_matrix.h5`	Same information as `filtered_feature_bc_matrix/` but in HDF5 format.
`raw_feature_bc_matrices/`	Contains all detected barcodes in MEX format. Each element of the matrix is the number of UMIs associated with a feature (row) and a barcode (column).
`raw_feature_bc_matrix.h5`	Same information as `raw_feature_bc_matrices/` in HDF5 format.
`raw_probe_bc_matrix.h5`	Contains UMI counts of each probe for all detected barcodes in HDF5 format. Only produced when running pipelines for probe-based assays.
`molecule_info.h5`	Contains per-molecule information for all molecules that contain a valid barcode, valid UMI, and were assigned with high confidence to a gene or protein barcode. This file is required for additional analysis `spaceranger` pipelines including `aggr`, `targeted-compare` and `targeted-depth`.

Space Ranger v2.1 introduced support for Gene and Protein Expression (v2 slides only), and there are a few variations from the list above. For more information, see the Protein Expression outputs.

When spaceranger count is run on Visium HD data, outputs follow a similar format as described above. The biggest difference is that Visium HD data are binned by default. Therefore, the outputs have a hierarchical structure to match different bin levels.

File or Directory Name	Description
`binned_outputs`	By default, this directory has three subdirectories: `square_002um`, `square_008um`, and `square_016um`. Each directory contains `filtered_feature_bc_matrix`, `raw_feature_bc_matrix`, `spatial`, `filtered_feature_bc_matrix.h5`, and `raw_feature_bc_matrix.h5`. The `analysis` directory is only provided at 8 and 16 µm bin size. The `cloupe.cloupe` is only provided at 8 µm bin size. The `raw_probe_bc_matrix.h5` is only provided at 2 µm resolution.
`cloupe_008um.cloupe`	Symlink to the .cloupe file at 8 µm bin size
`feature_slice.h5`	A new file type, specific to Visium HD, to support efficient fetching of 2 µm resolution image slices for a single gene or multiple genes. See this page for more details.
`metrics_summary.csv`	Run summary metrics in CSV format
`molecule_info.h5`	Contains per-molecule information for all molecules that contain a valid barcode, valid UMI, and were assigned with high confidence to a gene barcode or bin.
`probe_set.csv`	Copy of the input probe set reference CSV file.
`spatial`	Folder containing outputs that capture the spatiality of the data. See the Spatial Outputs page for more details.
`web_summary.html`	Run summary metrics and plots in HTML format

For convenient multi-sample analysis, the spaceranger aggr pipeline generates output files that contain all the data from the individual input jobs, aggregated into single output files. The capture area suffix of each barcode is updated to prevent barcode collisions.

Visium HD is not supported with spaceranger aggr.

A successful spaceranger aggr run should conclude with a message similar to this:


Outputs:
- Aggregation metrics summary HTML:                          /opt/runs/AGG123/outs/web_summary.html
- Aggregation metrics summary JSON:                          /opt/runs/AGG123/outs/summary.json
- Secondary analysis output CSV:                             /opt/runs/AGG123/outs/analysis
- Filtered feature-barcode matrices MEX:                     /opt/runs/AGG123/outs/filtered_feature_bc_matrix
- Filtered feature-barcode matrices HDF5:                    /opt/runs/AGG123/outs/filtered_feature_bc_matrix.h5
- Copy of the input aggregation CSV:                         /opt/runs/AGG123/outs/aggregation.csv
- Loupe Browser file:                                        /opt/runs/AGG123/outs/cloupe.cloupe
- Aggregated tissue positions list:                          /opt/runs/AGG123/outs/aggr_tissue_positions.csv
- Spatial folder containing spatial images and scalefactors: /opt/runs/AGG123/outs/spatial
&nbsp;
Pipestance completed successfully!

Upon completion, all the main pipeline outputs can be found in the outs/ subfolder which has the following structure:


outs
├── aggregation.csv
├── aggr_tissue_positions.csv
├── analysis
│   ├── clustering
│   ├── diffexp
│   ├── pca
│   ├── tsne
│   └── umap
├── cloupe.cloupe
├── filtered_feature_bc_matrix
│   ├── barcodes.tsv.gz
│   ├── features.tsv.gz
│   └── matrix.mtx.gz
├── filtered_feature_bc_matrix.h5
├── spatial
│   ├── LV123
│   │   ├── scalefactors_json.json
│   │   ├── tissue_hires_image.png
│   │   └── tissue_lowres_image.png
│   ├── LB456
│   │   ├── scalefactors_json.json
│   │   ├── tissue_hires_image.png
│   │   └── tissue_lowres_image.png
│   └── LP789
│       ├── scalefactors_json.json
│       ├── tissue_hires_image.png
│       └── tissue_lowres_image.png
├── summary.json
└── web_summary.html

The files are similar to the spaceranger count pipeline outputs. The aggr pipeline also outputs a copy of the input aggregation.csv.

Overview of output structure

Visium v1/v2 Spatial Gene Expression

Visium HD Spatial Gene Expression

Aggr outputs