Given an \(n x m\) matrix of centroids, where \(m\) are the prototypic centroids with \(n\) features, classify new samples according to the distance to the centroids.

classify(data, centroid, method = "pearson")

Arguments

data

a data.frame of dimensions \(n x p\) with the samples to classify, were \(n\) are the same set of features as in the centroids

centroid

a data.frame of dimensions \(n x m\), where each column is a prototypic centroid to classify the samples

method

Character string indicating which method to use to calculate distance to centroid. Options are "pearson" (default), "kendall", or "spearman"

Value

Returns a numeric vector of length \(p\) with the class assigned to each sample according to the shortest distance to centroid

Examples

if (FALSE) { rna_luad<-use_rna_luad() #The expression of the toy datasets are already scaled prm <- rna_luad$TCGA$expression_matrix #We change the rownames to be gene Symbol insted of Gene Id. rownames(prm)<- rna_luad$TCGA$feature_data$gene #Wilkerson's centroids centroids<- rna_luad$WilkCentroids #Extract features from both data.frames inBoth<- Reduce(intersect, list(rownames(prm),rownames(centroids))) #Classify samples Wilk.Class<- classify(prm[inBoth,],centroids[inBoth,]) table(Wilk.Class) }