OmniCellTOSG is the first dataset of cell text-omic signaling graphs (TOSGs), which represents the signaling network of individual or meta-cells and is labeled with information such as organ, disease, sex, age, and cell subtype.
The dataset integrates human-readable annotations and quantitative gene and protein abundance data, enabling graph reasoning to decode cell signaling.
It is built from single-cell RNA sequencing data of approximately 120 million cells from diverse tissues and conditions, and is compatible with PyTorch.
The OmniCellTOSG dataset has the potential to transform research in life sciences, healthcare, and precision medicine.