Reduces dimensionality to 2D using principal component analysis (PCA) and displays a dynamic visualization of two principal components (PC).
pcplot(data, clusters, barColor = "steelblue", colorScheme = "schemeCategory10", width = NULL, height = NULL, labelSizes = NULL, dotSize = NULL, pcGridlines = FALSE, barGridlines = FALSE)
data | A dataframe of numeric columns. Scaled data is preferred as PCA does not work the same with non-scaled data. |
---|---|
clusters | A named integer matrix of clusters where names are the row names
of the above dataframe and integers are the integer value of the row's associated cluster.
This can be obtained from a function such as |
barColor | The color to use for the bar-chart fill. May be any html color (hex or named). |
colorScheme | The color scheme of the PCA plot. May be a pre-configured D3 ordinal color scheme or a vector of html colors (hex or named) of the same length as the number of clusters. |
width | The width of the plot window. |
height | The height of the plot window. |
labelSizes | A number or list of any combination of parameters shown that define the label sizes. |
dotSize | A number to adjust the size of the dots. |
pcGridlines |
|
barGridlines |
|
Clicking on axis labels will display a bar-chart of PC contribution
Clicking on legend colors will fade out all points but the points in the cluster selected
Hover over points to see the label and point coordinates
# Barebones scaled_df <- scale(state.x77) clus <- kmeans(scaled_df, 5)$cluster pcplot(data = scaled_df, clusters = clus) # With Options scaled_df <- scale(state.x77) clus <- kmeans(scaled_df, 5)$cluster pcplot(data = scaled_df, clusters = clus, barColor = "red", colorScheme = c("red", "green", "orange", "blue", "yellow"), labelSizes = list(yaxis = 20, yticks = 15, tooltip = 25), pcGridlines = TRUE, barGridlines = TRUE)