If you have questions or issues to report please email email@example.com.
The HCA is a collaborative effort by an international group of scientists to create a comprehensive reference map of all the cells in the human body. The HCA Data Portal and Data Coordination Platform are built by scientists and technologists at EMBL-EBI, the Broad Institute, UC Santa Cruz, and the Chan Zuckerberg Initiative. Learn more about the HCA project here.
The HCA data will eventually accept all types of single-cell data, but it currently contains single-cell RNA-sequencing data from Smart-seq2 and 10X v2 assays. While we are focused on curating a collection of data from healthy human specimens, as we grow we are accepting data from a number of sources, including model organisms and organoids, and both healthy and diseased samples. Stay tuned, the HCA is constantly evolving!
Currently, both Smart-seq2 and 10X v2 processed data can be downloaded as expression matrices from individual cell data using the CLI. The matrices for 10x data are per channel.
In addition, an expression matrix containing the combined Smart-Seq2 data from all the cells of a particular project can also be obtained by requesting an expression matrix in the Data Browser.
Note that these matrices contain Gencode IDs, rather than gene name symbols. Please contact us if you need help converting Gencode IDs to gene names.
You can learn more about each project by visiting the Projects tab in the Explore section of the Data Portal. We encourage all contributing projects to provide a detailed description of their protocol and post their protocol details at protocols.io. Publication information, where available, is also visible on the Projects page.
Yes, data from the HCA can be used for publication. Learn more about our data use agreement.
The URL for the HCA DCP Data Portal, https://data.humancellatlas.org/, can be used as the citation for data obtained from the HCA. Alternatively, you can cite: Regev A., et al. 2017. The Human Cell Atlas. Elife. Dec 5;6. pii: e27041. doi: 10.7554/eLife.27041.
The user guides in the Intro section of the data portal contain detailed information about how to access the data. We have also created a series of data consumer vignettes, which give examples of how others have accessed and used the data from the HCA.
Single-cell sequencing analysis methods are constantly changing, and there are many tools for visualizing or analyzing the data in the HCA. Popular tools for analyzing single-cell RNA-seq data include Seurat (R) and ScanPy (Python),though many other excellent tools exist. We offer a list of community-built analysis applications in the Analyze section on the Data Portal.
We encourage scientists and developers to contribute their own analysis and visualization applications and data processing pipelines to the HCA. You can learn more about how to do this in the Intro section of the Data Portal.
Visit the Contact page to learn how to collaborate with us and to reach us with your feedback.
One of our goals is to make the HCA data coordination platform code open and reusable to the community. For now, contact us at firstname.lastname@example.org with questions about reusing our code. Stay tuned for more information about reusing code in the Intro section of the Data Portal.