scNoah#
Benchmarking cell type annotation and modality prediction.
Balance dataset#
Balancing dataset using your own annotation for future model training. Oversmaple or undersample some cell types.
Balance cell types in AnnData object. |
|
Oversample some cell types in AnnData object. |
|
Undersample some cell types in AnnData object. |
Annotation metrics#
Test annotation method quality using confusion matrix, accuracy, balanced accuracy and calculating cell type specific precision, recall (also called sensitivity), specificity, f1-score, geometric mean, and index balanced accuracy of the geometric mean.
Returns metrics (precision, recall (also called sensitivity), specificity, f1-score, geometric mean, and index balanced accuracy of the geometric mean) of predicted cell types. |
|
Returns specificity and recall (also called sensitivity) metrics of predicted cell types. |
|
Compute confusion matrix to evaluate the accuracy of a classification. |
|
Find correct and incorrect predictions. |
Regression metrics#
Test modality prediction method quality using error metrics (RMSE, MedianAE, MeanAE), EVS, R² score, PCC, SCC and KTCC. Also, visualise metrics on cell embeddings.
RMSE - Root mean squared error, MeanAE - Mean absolute error, MedianAE - Median absolute error, EVS - Explained variance score, R² score - Coefficient of determination, PCC - Pearson correlation coefficient, SCC - Spearman correlation coefficient, KTCC - Kendall tau correlation coefficient.
For error metrics (RMSE, MedianAE, MeanAE): lower value - better prediction
Returns multiple metrics of cell surface proteins prediction. |
|
Compute regression status of cells to visualize on UMAP. |
|
Compute Pearson correlation coefficient of predicted feature. |
|
Compute Spearman correlation coefficient of predicted feature. |
|
Compute Kendall’s tau, a correlation measure of predicted feature. |
Count cells#
Count number of cell types per sample or condition.
Count cell types in samples. |
Explanations#
Get explanations of gene importances in scAdam model prediction.
Identify the genes that are most important for determining cell type using a model. |
|
Get dataframe with gene importances for specific cell type. |
Managing large datasets#
Get fraction of large dataset.
Get fraction of AnnData object. |
|
Get samples from AnnData object. |
Difference between clusters#
Calculate Integral of absolute density difference and Mutual Information between two clusterings.
Calculates metrics to Integral of absolute density difference and Mutual Information between two clusterings. |