Download single-cell protein and RNA data
SCoPE2 data processed to ASCII text matrices
single cellsat 1% FDR. The first 2 columns list the corresponding protein identifiers and peptide sequences and each subsequent column corresponds to a single cell. Peptide identification is based on spectra analyzed by MaxQuant and is enhanced by using DART-ID to incorporate retention time information. See Specht et al., 2019 for details.
single cellsat 1% FDR, imputed and batch corrected.
single cells. Each column corresponds to a single cell and the rows include relevant metadata, such as, cell type if known, measurements from the isolation of the cell, and derivative quantities, i.e., rRI, CVs, reliability.
- Meta data following the Sample to Data file format (SDRF) for Proteomics project guidelines for for all single cells used in analysis constituting all figures.
- Joint protein-RNA data
single cells. Both sets imputed and batch-corrected separately then combined, taking only genes common to both data sets. Uniprot accession numbers used to denote gene.
- Signal-to-noise data
single cellsat 1% FDR. The first 2 columns list the corresponding protein identifiers and peptide sequences and each subsequent column corresponds to a single cell. The quantitation is the Signal-to-noise (S/N) ratio for each single cell’s corresponding reporter ion extracted from the RAW file. The single cell identification numbers are mapped to cell type and RAW file. Complete extracted S/N for each RAW file can be found here.
Minimal data files necessary for generating Peptides-raw.csv and Proteins-processed.csv
- Single cell proteomics data processing: The analysis of the data described here has been replicated by Christophe Vanderaa and Laurent Gatto with the scp Bioconductor package: The scp package is used to process and analyze mass spectrometry-based single cell proteomics data and is freely available from their Github repository. The scp package and the replication are described in this video.
SCoPE2 RAW data and search results from MaxQuant
The repositories below contain RAW mass-spectrometry data files generated by a Q exactive instrument as well as the search results from analyzing the RAW files by MaxQuant and by DART-ID. The files in Repository 1 were generated in the spring of 2019 and described in the first version of Specht et al., 2019. The files in Repository 2 were generated in the fall of 2019 from biological replicates and are described in the third version of Specht et al., 2019. The data in Repository 1 were generated with 11-plex TMT while data in Repository 2 were generated with 16-plex TMT pro.
- MassIVE Repository 1:
- MassIVE Repository 2:
scRNA-seq 10x Genomics RAW and processed data
A cellular mixture identical to that used for the single-cell proteomics was assessed with scRNA-seq using 10x Genomics Chromium platform and the Single Cell 3’ Library & Gel Bead Kit (v2). Two biological replicates of the cell suspension (stock concentration: 1200 cells/μl) were loaded into independent lanes of the device. An average number of about 10,000 cells/lane were recovered. Following the library preparation and sample QC by Agilent BioAnalyzer High Sensitivity chip, the two libraries were pooled together, quantified by KAPA Library Quantification kit and sequenced using the Illumina Novaseq 6000 system (Nova S1 100 flow cell) with the following run parameters: Read1: 26 cycles, i7 index: 8 cycles, Read2: 93 cycles.
- GEO Repository
- Processed data matrices:
single cellsin UMI counts.