Automated subset identification and characterization pipeline for multidimensional flow and mass cytometry data clustering and visualization
Meehan, S., Kolyagin, G.A., Parks, D. et al.When examining datasets of any dimensionality, researchers frequently aim to identify individual subsets (clusters) of objects within the dataset. The ubiquity of multidimensional data has motivated the replacement of user-guided clustering with fully automated clustering. The fully automated methods are designed to make clustering more accurate, standardized and faster. However, the adoption of these methods is still limited by the lack of intuitive visualization and cluster matching methods that would allow users to readily interpret fully automatically generated clusters. To address these issues, we developed a fully automated subset identification and characterization (SIC) pipeline providing robust cluster matching and data visualization tools for high-dimensional flow/mass cytometry (and other) data. This pipeline automatically (and intuitively) generates two-dimensional representations of high-dimensional datasets that are safe from the curse of dimensionality. This new approach allows more robust and reproducible data analysis,+ facilitating the development of new gold standard practices across laboratories and institutions.
Meehan, S., Kolyagin, G.A., Parks, D. et al. "Automated subset identification and characterization pipeline for multidimensional flow and mass cytometry data clustering and visualization" Nature Communications Biology (2019): 229