FEA Implement classical MDS #31322

dkobak · 2025-05-06T13:50:42Z

Fixes #15272. Supersedes #22330.

This PR implements classical MDS, also known as principal coordinates analysis (PCoA) or Torgerson's scaling, see https://en.wikipedia.org/wiki/Multidimensional_scaling#Classical_multidimensional_scaling. As discussed in #22330, it is implemented as a new class ClassicalMDS.

Simple demonstration:

import pylab as plt
import numpy as np

from sklearn.datasets import load_iris
from sklearn.manifold import ClassicalMDS
from sklearn.decomposition import PCA

X, y = load_iris(return_X_y=True)

Z1 = PCA(n_components=2).fit_transform(X)
Z2 = ClassicalMDS(n_components=2, metric="euclidean").fit_transform(X)
Z3 = ClassicalMDS(n_components=2, metric="cosine"   ).fit_transform(X)
Z4 = ClassicalMDS(n_components=2, metric="manhattan").fit_transform(X)

fig, axs = plt.subplots(nrows=2, ncols=2, figsize=(6, 6), layout="constrained")

axs.flat[0].scatter(Z1[:,0], Z1[:,1], c=y)
axs.flat[0].set_title("PCA")

axs.flat[1].scatter(Z2[:,0], Z2[:,1], c=y)
axs.flat[1].set_title("Classical MDS, Euclidean dist.")

axs.flat[2].scatter(-Z3[:,0], Z3[:,1], c=y)
axs.flat[2].set_title("Classical MDS, cosine dist.")

axs.flat[3].scatter(Z4[:,0], Z4[:,1], c=y)
axs.flat[3].set_title("Classical MDS, Manhattan dist.")

~~Classical MDS is also set as default initialization for metric/non-metric MDS in the MDS() class.~~

~~For consistency, this PR also adds support for non-Euclidean metrics to the MDS class.~~

github-actions · 2025-05-06T13:51:41Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: fe76707. Link to the linter CI: here}

dkobak · 2025-05-08T09:25:34Z

Pinging @antoinebaker for a review :-) I have been doing some updates to this PR but am now finished with it.

antoinebaker · 2025-05-12T08:32:18Z

Thanks for the PR @dkobak !

Could you maybe use this PR to implement ClassicalMDS only, and do the enhancements of MDS in a separate / follow up PR ? It will ease the reviewing / merging process I think.

dkobak · 2025-05-12T08:50:02Z

Could you maybe use this PR to implement ClassicalMDS only, and do the enhancements of MDS in a separate / follow up PR ? It will ease the reviewing / merging process I think.

All right. I took all changes to the MDS class out of this PR now. I will wait until this PR is merged and then do a follow-up PR to change the MDS class.

antoinebaker · 2025-05-28T13:02:05Z

Could you please add ClassicalMDS to the common test suite for estimators ? It can be done by adding an entry to the INIT_PARAMS dict:

scikit-learn/sklearn/utils/_test_common/instance_generator.py

Lines 185 to 187 in 4493f86

    
           INIT_PARAMS = { 
        
               AdaBoostClassifier: dict(n_estimators=5), 
        
               AdaBoostRegressor: dict(n_estimators=5),

Then it will be tested for common checks on estimators in sklearn/tests/test_common.py::test_estimators

dkobak · 2025-05-28T13:14:32Z

Hi @antoinebaker. I think the INIT_PARAMS dictionary is only needed to provide non-default params (like a small number of iterations), which for ClassicalMDS is not needed. An estimator does not have to be in INIT_PARAMS in order to be tested, it will simply be tested with default params in that case. Let me know if I am wrong.

antoinebaker

Here a first round of reviews

sklearn/manifold/_classical_mds.py

antoinebaker · 2025-05-28T14:07:16Z

Hi @antoinebaker. I think the INIT_PARAMS dictionary is only needed to provide non-default params (like a small number of iterations), which for ClassicalMDS is not needed. An estimator does not have to be in INIT_PARAMS in order to be tested, it will simply be tested with default params in that case. Let me know if I am wrong.

Ah my bad, I think you're right. If ClassicalMDS appears in the sklearn/tests/test_common.py::test_estimators suite that's all good.

doc/whats_new/upcoming_changes/sklearn.manifold/31322.feature.rst

adrinjalali · 2025-06-03T13:27:34Z

I'm not sure if we discussed this, but I favor this comment (#22330 (comment)) (except the default value) as an overall API, instead of introducing a new class. Any blockers for doing so?

dkobak · 2025-06-03T13:38:41Z

@adrinjalali: Yes, we did discuss it. Just below the comment you linked to, @antoinebaker gave detailed reasons for why he prefers a separate class, please see here: #22330 (comment). He convinced me, and you wrote "I'm happy with the suggestions here" (#22330 (comment)), which is why I implemented a separate class...

adrinjalali · 2025-06-03T13:58:42Z

Hmm. Yeah fair enough. Just to avoid future surprises, maybe @lorentzenchr @GaelVaroquaux could also give their opinion?

antoinebaker · 2025-06-04T08:35:14Z

The TLDR of #22330 (comment) is that ClassicalMDS (Principal Coordinates Analysis) implemented in this PR and the current MDS (SMACOF) implemented in sklearn don't have much in common except their names (different algorithms, objectives, arguments, attributes).

EDIT: removed outdated comment.

antoinebaker · 2025-06-23T13:59:51Z

We briefly discussed the API in the sklearn dev meeting and approved the current state of this PR:

adding a new ClassicalMDS class instead of an additional parameter to the current MDS class
use the metric and metric_params parameters for the dissimilarity computation

I'll try to do a final review this week.

antoinebaker

A couple of nitpicks. Otherwise LGTM!

However I couldn't see (and check) the rendered doc of the ClassicalMDS class in the CI. Do you see it when building the doc locally @dkobak ?

doc/whats_new/upcoming_changes/sklearn.manifold/31322.feature.rst

doc/modules/manifold.rst

sklearn/manifold/_classical_mds.py

Co-authored-by: antoinebaker <antoinebaker@users.noreply.github.com>

dkobak · 2025-06-26T07:31:40Z

@antoinebaker Thanks. I committed all your suggestions.

However I couldn't see (and check) the rendered doc of the ClassicalMDS class in the CI. Do you see it when building the doc locally @dkobak ?

Hmm. I have not tried that (not sure how to do it, don't have experience with that).

Regarding the docstring for metric, I simply copied it from somewhere, maybe from the NearestNeighbors class (not sure anymore). I think your suggestion was fine.

antoinebaker · 2025-06-26T08:35:49Z

@dkobak I think the reason why ClassicalMDS doc isn't rendered is because we need to add it to API_REFERENCE:

scikit-learn/doc/api_reference.py

Lines 693 to 696 in 4daff41

    
           "autosummary": [ 
        
               "Isomap", 
        
               "LocallyLinearEmbedding", 
        
               "MDS",

dkobak · 2025-06-26T08:38:41Z

I guess you are right! Added.

codecov · 2025-06-27T13:52:48Z

❌ Unsupported file format

Upload processing failed due to unsupported file format. Please review the parser error message:

Error deserializing json

Caused by: expected value at line 1 column 1

For more help, visit our troubleshooting guide.

adrinjalali

This also needs some sort of an example and a user guide. As for the example, it would be nice if you could improve plot_mds and include this one there as well, and link from the relevant places.

doc/api_reference.py

doc/whats_new/upcoming_changes/sklearn.manifold/31322.feature.rst

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

dkobak · 2025-07-03T09:38:01Z

Regarding the user guide, it is already there.

Regarding examples, good point, I have now added classical MDS (and also non-metric MDS) to plot_compare_methods.py, plot_lle_digits.py, plot_manifold_sphere.py, plot_mds.py.

(Edit: checked the rendered docs and examples are fine.)

Implement Classical MDS

33b9600

github-actions bot added the module:manifold label May 6, 2025

Rename what's new file

5184552

dkobak mentioned this pull request May 6, 2025

ENH Add eigh as a solver in MDS #22330

Open

dkobak added 2 commits May 6, 2025 16:22

validate input data

c215d0a

describe attributes

d6773c7

dkobak mentioned this pull request May 6, 2025

sklearn MDS vs skbio PCoA #15272

Open

dkobak added 4 commits May 6, 2025 22:06

Merge branch 'main' into classical-mds

b4f49d5

Make classical MDS default init for MDS

f018564

Fix tests

99854db

Merge branch 'main' into classical-mds

8134b9f

Take all changes to the MDS class out of this PR

bafac04

dkobak changed the title ~~Implement classical MDS~~ FEA Implement classical MDS May 14, 2025

antoinebaker reviewed May 28, 2025

View reviewed changes

sklearn/manifold/_classical_mds.py Outdated Show resolved Hide resolved

sklearn/manifold/_classical_mds.py Outdated Show resolved Hide resolved

sklearn/manifold/_classical_mds.py Outdated Show resolved Hide resolved

sklearn/manifold/_classical_mds.py Outdated Show resolved Hide resolved

antoinebaker reviewed May 28, 2025

View reviewed changes

doc/whats_new/upcoming_changes/sklearn.manifold/31322.feature.rst Outdated Show resolved Hide resolved

dkobak added 3 commits May 28, 2025 17:00

Add metric_params

c408bd3

Fix kwargs

77732f6

Rename kwargs

804c524

Rename dissimilarity into metric

0999113

Add a test for metric_params

da28a2d

antoinebaker approved these changes Jun 25, 2025

View reviewed changes

Apply suggestions from code review

509c6d1

Co-authored-by: antoinebaker <antoinebaker@users.noreply.github.com>

lint

dbaccd0

Add new class to the API reference

2f66df3

dkobak added 2 commits June 27, 2025 09:25

Improve tests

737aaea

Merge branch 'main' into classical-mds

eb37e39

adrinjalali reviewed Jul 2, 2025

View reviewed changes

doc/api_reference.py Outdated Show resolved Hide resolved

doc/whats_new/upcoming_changes/sklearn.manifold/31322.feature.rst Outdated Show resolved Hide resolved

dkobak and others added 3 commits July 3, 2025 11:02

Update doc/api_reference.py

a088ac4

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

Rename feature into major-feature

5b17eb8

Add Classical MDS to examples

f3fdf51

dkobak added 3 commits July 3, 2025 12:00

Fix one example

ab479c6

Fix examples

5e6c3fe

More example edits

fe76707

Uh oh!

FEA Implement classical MDS #31322

Are you sure you want to change the base?

FEA Implement classical MDS #31322

Uh oh!

Conversation

dkobak commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

dkobak commented May 8, 2025

Uh oh!

antoinebaker commented May 12, 2025

Uh oh!

dkobak commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

antoinebaker commented May 28, 2025

Uh oh!

dkobak commented May 28, 2025

Uh oh!

antoinebaker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

antoinebaker commented May 28, 2025

Uh oh!

Uh oh!

adrinjalali commented Jun 3, 2025

Uh oh!

dkobak commented Jun 3, 2025

Uh oh!

adrinjalali commented Jun 3, 2025

Uh oh!

antoinebaker commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

antoinebaker commented Jun 23, 2025

Uh oh!

antoinebaker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dkobak commented Jun 26, 2025

Uh oh!

antoinebaker commented Jun 26, 2025

Uh oh!

dkobak commented Jun 26, 2025

Uh oh!

codecov bot commented Jun 27, 2025

❌ Unsupported file format

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dkobak commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

dkobak commented May 6, 2025 •

edited

Loading

github-actions bot commented May 6, 2025 •

edited

Loading

dkobak commented May 12, 2025 •

edited

Loading

antoinebaker commented Jun 4, 2025 •

edited

Loading

dkobak commented Jul 3, 2025 •

edited

Loading