A single repo of all available/potential tabular normalization methods for downstream analysis of spatial proteomics datasets
Normalization is a critical step in the process of cell type classification in highly multiplexed imaging, a technique that enables the simultaneous measurement of multiple biomarkers within a single tissue section. In this context, normalization refers to the process of adjusting the data to correct for various sources of technical variation that can obscure true biological differences. These variations can arise from differences in staining intensity, imaging conditions, and instrument calibration, which can significantly affect the accuracy and reliability of the results.
Highly multiplexed imaging generates complex and high-dimensional data, making it imperative to employ robust normalization techniques to ensure that the data is comparable across different samples and conditions. Proper normalization helps in mitigating batch effects and other artifacts that can lead to misclassification or inaccurate identification of cell types. By standardizing the data, normalization enhances the ability to distinguish subtle biological signals from noise, thereby improving the sensitivity and specificity of cell type classification.
Furthermore, normalization facilitates the integration and comparison of data from different experiments, which is essential for large-scale studies and meta-analyses. It enables researchers to draw more reliable and reproducible conclusions, ultimately advancing our understanding of cellular heterogeneity and the intricate mechanisms underlying various biological processes and diseases.
Method | Paper/Resource | Previously Applied to | Example Code |
---|---|---|---|
arcsinh transformation | Flow cytometry Paper IMC Paper MIBI Paper testing co-factors: Option 1 |
CODEX, Flow cytometry, IMC, MIBI | Code |
Centered log-ratio based normalization | CODEX, CITE-seq | Seurat code | |
manual background identification in MCD viewer | Paper | IMC | |
semi-automated background identification in ilastik | Paper | IMC | |
Z Normalization | Paper | CODEX | Code |
log(double Z) normalization | Paper | CODEX | Code |
min_max normalization | Paper | CODEX | |
functional data analysis (FDA) or ComBAT (z-score based) | Paper mxnorm package |
MxIF | mxnorm package |
normalizing to a reference channel (example: nucleus) | Paper | MIBI, CODEX | Code |
flow-cytometry based overview: https://www.nature.com/articles/nri.2016.56
scverse scRNA seq normalization overview: https://www.sc-best-practices.org/preprocessing_visualization/normalization.html
Computational solutions for spatial transcriptomics: https://www.sciencedirect.com/science/article/pii/S2001037022003786
IMC end to end workflow: https://www.nature.com/articles/s41596-023-00881-0
Feel free to explore the provided resources and example codes to implement these normalization methods in your projects.
If you want to add more methods, please open an issue in this repository.