首先来看看简介,The ingest function assumes an annotated reference dataset that captures the biological variability of interest.(这一句话就很重要,首先需要一个注释好的参考数据集,然后来“捕获”疾病样本的生物学变化) 。The rational(理论) is to fit a model on the reference data and use it to project new data(用参考数据集拟合一个model,从而来插入新的数据集). For the time being, this model is a PCA combined with a neighbor lookup search tree, for which we use UMAP’s implementation