Multidimensional Scaling

In general, Multidimensional Scaling (MDS) refers to techniques that transforms samples into lower dimensional space while preserving the inter-sample distances as well as possible.

Example

Performing MDS on Iris data set:

using MultivariateStats, RDatasets, Plots

# load iris dataset
iris = dataset("datasets", "iris")

# take half of the dataset
X = Matrix(iris[1:2:end,1:4])'
X_labels = Vector(iris[1:2:end,5])

Suppose X is our data matrix, with each observation in a column. We train a MDS model, allowing up to 3 dimensions:

M = fit(MDS, X; maxoutdim=3, distances=false)

Classical MDS(indim = 4, outdim = 3)

Then, apply MDS model to get an embedding of our data in 3D space:

Y = predict(M)

3×75 Matrix{Float64}:
 2.71359     2.90321     2.75875   …  -2.39001   -1.51972   -1.87717
 0.238246   -0.233575    0.228345      0.333917  -0.297498   0.0985705
 0.0140596   0.0221454  -0.104549     -0.520729   0.181055  -0.717537

Now, we group results by testing set labels for color coding and visualize first 3 principal components in 3D interactive plot

setosa = Y[:,X_labels.=="setosa"]
versicolor = Y[:,X_labels.=="versicolor"]
virginica = Y[:,X_labels.=="virginica"]

p = scatter(setosa[1,:],setosa[2,:],setosa[3,:],marker=:circle,linewidth=0)
scatter!(versicolor[1,:],versicolor[2,:],versicolor[3,:],marker=:circle,linewidth=0)
scatter!(virginica[1,:],virginica[2,:],virginica[3,:],marker=:circle,linewidth=0)

Classical Multidimensional Scaling

This package defines a MDS type to represent a classical MDS model^[1], and provides a set of methods to access the properties.

MultivariateStats.MDS — Type

Classical Multidimensional Scaling (MDS), also known as Principal Coordinates Analysis (PCoA), is a specific technique in this family that accomplishes the embedding in two steps:

Convert the distance matrix to a Gram matrix. This conversion is based on

the following relations between a distance matrix $D$ and a Gram matrix $G$:

\[\mathrm{sqr}(\mathbf{D}) = \mathbf{g} \mathbf{1}^T + \mathbf{1} \mathbf{g}^T - 2 \mathbf{G}\]

Here, $\mathrm{sqr}(\mathbf{D})$ indicates the element-wise square of $\mathbf{D}$, and $\mathbf{g}$ is the diagonal elements of $\mathbf{G}$. This relation is itself based on the following decomposition of squared Euclidean distance:

\[\| \mathbf{x} - \mathbf{y} \|^2 = \| \mathbf{x} \|^2 + \| \mathbf{y} \|^2 - 2 \mathbf{x}^T \mathbf{y}\]

Perform eigenvalue decomposition of the Gram matrix to derive the coordinates.

Note: The Gramian derived from $D$ may have non-positive or degenerate eigenvalues. The subspace of non-positive eigenvalues is projected out of the MDS solution so that the strain function is minimized in a least-squares sense. If the smallest remaining eigenvalue that is used for the MDS is degenerate, then the solution is not unique, as any linear combination of degenerate eigenvectors will also yield a MDS solution with the same strain value.

Multidimensional Scaling

Example

Classical Multidimensional Scaling

Metric Multidimensional Scaling

References