FEA Introduce `PairwiseDistances`, a generic back-end for `pairwise_distances` #25561

Vincent-Maladiere · 2023-02-07T09:11:34Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This simplifies the original implementation of PairwiseDistance by @jjerphan, with the following differences:

PairwiseDistance{32,64} doesn't subclass BaseDistancesReduction{32,64} anymore.
This allows to add _parallel_on_{X,Y} methods to PairwiseDistance{32,64}, since these methods are decorated with @final in BaseBaseDistancesReduction{32,64} and thus can't be overwritten.
This also remove the chunk computing mechanism, by considering only the case chunk_size = 1, as proposed by @ogrisel in this comment.
This doesn't implement the Euclidean specialization yet to make benchmarks simpler.

Following this benchmark, we found that this PR yields a significant performance regression when n_jobs = 1 and an improvement when n_jobs > 1, for both euclidean and manhattan distances:

Any other comments?

As suggested by @jjerphan, decorating DistanceMetric subclasses with @final could alleviate some of the overhead and make this implementation competitive with main when n_jobs=1.

jjerphan

Hi @Vincent-Maladiere, it looks like you are on good tracks. 👍

Do you think you can rerun benchmarks on other cases e.g. especially on:

metric=manhattan solely
on the dense-dense and csr-csr combination
on np.float64 and np.float32 data
on 1 thread and on 16 threads

and provide results? 🙂

Also, as discussed in 1:1, I think we better skip the n_jobs=1 for now and work on it as part of a second PR.

Moreover, this PR needs a whatsnew entry for 1.3.

Finally, depending on others maintainers' opinion, we might want to use a feature branch (as recently chosen by @Vincent-Maladiere with feature/PairwiseDistances). In our opinion, this would allow validating parts of the parameters' combinations independently from one another, easing review and integration while avoiding partial work being integrated in main.

What do other reviewers think?

sklearn/metrics/_pairwise_distances_reduction/_dispatcher.py

sklearn/metrics/_pairwise_distances_reduction/_pairwise_distances.pyx.tp

sklearn/metrics/tests/test_pairwise.py

jjerphan · 2023-02-07T13:21:19Z

sklearn/metrics/_pairwise_fast.pyx

@@ -33,79 +29,3 @@ def _chi2_kernel_fast(floating[:, :] X,
                    if nom != 0:
                        res  += denom * denom / nom
                result[i, j] = -res
-
-
-def _sparse_manhattan(


I think this function is the simplest and the most frugal implementation for computing the pairwise manhattan distances on a pair of CSR datasets. Yet I think supporting all of the combinations (of metrics, {dense, sparse}², float32 and float64, etc.) by having such implementations is hardly maintainable.

The abstractions that we develop with Cython have a cost (that we can estimate against this implementation) but I think they ease future extensions.

What are people thinking of this? Are the Cython abstractions reasonable?

As you said elsewhere, I think we need a few comparative benchmarks (dense and sparse) to answer that question.

I agree the tempita Cython-class oriented code offers more flexibility to support all combinations of memory representation / dtypes / metrics and would lean towards removing special cases if the performance impact is ok.

ogrisel · 2023-02-08T08:58:27Z

Also, as discussed in 1:1, I think we better skip the n_jobs=1 for now and work on it as part of a second PR.

I not sure how I feel about merging a PR to main with a performance regression in the single threaded case.

It's true that on the positive side, most workloads are run with at least 2 (or 4 threads) per nodes nowadays. So maybe nobody would notice the performance regression in practice. But still...

ogrisel

As explained below, let's focus this PR on the non-Euclidean case so that we can merge it without implementing the GEMM trick / Euclidean specialization and without risking a performance regression for the single threaded case nor changing the impact or handling the deprecation of the precomputed X/Y_norm_squared parameters.

Then we will more finally study the Euclidean case in a follow-up PR.

I am not 100% convinced the Euclidean specialization is needed if we are careful to benchmark equivalent things by correctly controlling the impact of OMP_NUM_THREADS / OPENBLAS_NUM_THREADS both in main or 1.2.1 and the Cython PR with Euclidean case enabled. But let's delay that for later an finalize the non-Euclidean cases first.

sklearn/metrics/_pairwise_distances_reduction/_dispatcher.py

sklearn/neighbors/_nca.py

sklearn/metrics/tests/test_pairwise.py

ogrisel · 2023-02-08T09:16:14Z

sklearn/metrics/tests/test_pairwise.py

-    )
-    with pytest.raises(AssertionError):
-        assert_allclose(wrong_D, D1)
-


This would no longer would pass because X_norm_squared and Y_norm_squared are no longer used as long as we do not implement the Euclidean specialization / GEMM trick, right?

It seems like a potentially problematic silent change. I we really decide no to do Euclidean specialization, we should deprecate those X_norm_squared and Y_norm_squared parameters everywhere in the public API.

If we plan to re-introduce the Euclidean specialization specialization in the Cython code, then we should just comment out this assertion to explain that it should be re-enabled once we implement the Euclidean specialization in Cython.

Or even better we could make this PR focus on the non-Euclidean cases only, and make sure that the Cython code is not "usable for" metric="(sq)euclidean" / metric="minkowski" with p=2 for now to keep the existing numpy-based code with the GEMM trick for the time being.

This way we would not introduce a performance regression for the single-threaded case for now.

jjerphan · 2023-02-08T10:31:56Z

To answer this remark in #25561 (review):

I not sure how I feel about merging a PR to main with a performance regression in the single threaded case.

As proposed in #25561 (review), we can simultaneously:

use a feature branch to delay the inspection and the resolution
fall-back on the previous implementation when n_jobs=1

jjerphan

A few comments to complete @ogrisel's remark.

sklearn/metrics/tests/test_pairwise.py

Vincent-Maladiere · 2023-02-08T14:33:25Z

Running this asv benchmark with this script on 1 and 16 cores confirm our intuition that we have performances increase for n_core > 1 for 32 and 64bits, on dense-dense matrices.

As discussed IRL with @jjerphan, we have performance regression for the cases sparse-sparse, dense-sparse, and sparse-dense, which could pave the way for the next PRs on this feature branch.

       before           after         ratio
     [7917117e]       [ddc464ba]
     <main>           <pull/25561/head>
-         822+/-0ms          310+/-0ms     0.38  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float32'>, 'dense', 'dense', 16)
-         823+/-0ms          265+/-0ms     0.32  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float64'>, 'dense', 'dense', 16)
       before           after         ratio
     [7917117e]       [ddc464ba]
     <main>           <pull/25561/head>
+         1.07+/-0s          3.39+/-0s     3.17  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float64'>, 'sparse', 'sparse', 1)
+         1.10+/-0s          3.38+/-0s     3.08  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float32'>, 'sparse', 'sparse', 1)
+         1.34+/-0s          3.66+/-0s     2.73  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float64'>, 'sparse', 'dense', 1)
+         185+/-0ms          505+/-0ms     2.73  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float64'>, 'sparse', 'sparse', 16)
+         193+/-0ms          506+/-0ms     2.61  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float32'>, 'sparse', 'sparse', 16)
+         1.61+/-0s          3.61+/-0s     2.24  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float32'>, 'sparse', 'dense', 1)
+         823+/-0ms          1.76+/-0s     2.14  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float64'>, 'dense', 'dense', 1)
+         257+/-0ms          540+/-0ms     2.11  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float64'>, 'sparse', 'dense', 16)
+         822+/-0ms          1.68+/-0s     2.04  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float32'>, 'dense', 'dense', 1)
+         290+/-0ms          566+/-0ms     1.96  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float32'>, 'sparse', 'dense', 16)
+         2.38+/-0s          4.52+/-0s     1.90  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float64'>, 'dense', 'sparse', 1)
+         339+/-0ms          611+/-0ms     1.80  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float64'>, 'dense', 'sparse', 16)
+         2.52+/-0s          4.53+/-0s     1.80  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float32'>, 'dense', 'sparse', 1)
+         365+/-0ms          644+/-0ms     1.76  time_pairwise_distances(10000, 10000, 10, 'manhattan',  <class 'numpy.float32'>, 'dense', 'sparse', 16)

SOME BENCHMARKS HAVE CHANGED SIGNIFICANTLY.
PERFORMANCE DECREASED.

Edit: after IRL discussion @jjerphan, we think this PR brings value because:

Currently, on the main branch, manhattan is the only distance with sparse-sparse support. It relies on the most efficient Cython implementation, with no dispatch nor abstraction cost, which makes reconsidering the observed performance regression.
Moreover, the remaining distance metrics don't provide sparse-sparse, sparse-dense, and dense-sparse support, nor do any other libraries in the scientific python ecosystem.
Therefore, this PR not only improves performance in the dense-dense case but also unifies the sparse-sparse, sparse-dense, and dense-sparse support for all distance metrics.

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

jjerphan

Following the remark given by @Vincent-Maladiere in #25561 (comment) (see the edit), I think this PR has more value than I initially have thought since it enlarge the feature scope for the support of fused sparse and dense datasets pairs. Moreover, I think we can choose to have performance regression for computing Manhattan distances on pairs of CSR datasets.

To me the following items have to be addressed so that this PR can be merge in feature/PairwiseDistances (or alternatively in `main):

rename PairwiseDistance to PairwiseDistances
modify PairwiseDistances.is_usable_for and PairwiseDistances.valid_metrics so that the following cases aren't yet treated:
- ~~n_jobs=1~~ effective_n_threads=1 as pointed out in FEA Introduce PairwiseDistances, a generic back-end for pairwise_distances #25561 (comment)
- metric="sqeuclidean"
treating the remaining comments, e.g.
- adapt docstrings as hinted by https://github.com/scikit-learn/scikit-learn/pull/25561/files#r1098634886 and https://github.com/scikit-learn/scikit-learn/pull/25561/files#r1098634652
- reverting some unrelated changes to ArgKmin
- others
add one changelog entry in whats_new/v1.3.rst for performance improvements for pairwise_distances on the dense-dense case
add one changelog entry in whats_new/v1.3.rst for the support of the CSR-CSR, dense-CSR, CSR-dense cases for pairwise_distances

What do others think?

ogrisel · 2023-02-10T10:47:12Z

Therefore, this PR not only improves performance in the dense-dense case but also unifies the sparse-sparse, sparse-dense, and dense-sparse support for all distance metrics.

This does not seem to always be the case in the benchmark results: there are dense-dense case with a 2x slowdown. How to you explain this?

Edit: those are the sequential cases.

Could you please re-run with the benchmarks with 2 threads?

Edit: actually, this is already visible in the second plot at the beginning of the PR.

ogrisel · 2023-02-10T10:49:44Z

+1 for updating the changelog as quickly as possible in PRs. It helps reviewers better understand the user-facing scope of a PR by making it explicit.

ogrisel · 2023-02-10T10:54:51Z

modify PairwiseDistances.is_usable_for and PairwiseDistances.valid_metrics so that the following cases aren't yet treated:
n_jobs=1

I suppose you mean the effective_n_threads measured via openmp, not the n_jobs passed by the user to a public facing API that would be meant to control the use of joblib?

Is so, +1. This means that at least in the short term we keep on using the scipy metrics implementation in the dense-dense, single threaded case and would use the new Cython infrastructure for all the other cases.

Vincent-Maladiere · 2023-02-11T10:00:17Z

Our new benchmark is consistent with what we expected:

Note that we don't use n_jobs in this benchmark anymore, only limiting threads through threadpoolctl, which alleviates some of the overhead on main that led to the slowdown for n_jobs = 2 observed in the previous benchmark.

sklearn/metrics/_pairwise_distances_reduction/_dispatcher.py

…-pdr-backend

jjerphan

A few lasts remarks and suggestions to complete #25561 (review), which I corrected after @ogrisel's remark in #25561 (comment)

sklearn/metrics/_pairwise_distances_reduction/_dispatcher.py

doc/whats_new/v1.3.rst

sklearn/metrics/_pairwise_distances_reduction/_dispatcher.py

sklearn/metrics/_pairwise_distances_reduction/_pairwise_distances.pyx.tp

sklearn/metrics/_pairwise_distances_reduction/_dispatcher.py

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

…attan

Micky774 · 2023-05-19T15:21:15Z

Hey @Micky774 ! I'm not working on it currently but I can resume this PR in the next 2 weeks if that sounds good to you?

Of course -- no rush either. I just wanted to make sure the project wasn't entirely set aside :)

jjerphan · 2023-06-04T17:00:51Z

Hi @Vincent-Maladiere,

#26471 (which recently has been merged in main) has introduced a separation between the interface of DistanceMetric and its implementations. This created conflicts with this PR.

Feel free to let us know if you need help resolving them. :)

Vincent-Maladiere · 2023-06-21T15:00:35Z

On it, let's try to have something working by the end of the week :)

…-pdr-backend

Micky774 · 2023-06-21T20:30:18Z

It seems that the merge history here got messed up @Vincent-Maladiere

jjerphan · 2023-06-22T07:27:50Z

@Vincent-Maladiere: if your reflog also getting in the previous state, you can find the commit of the previous state using the reflog and reset the branch to it:

# Get the latest changes from the base branch (which is up-to-date)
git checkout feature/PairwiseDistances
git pull upstream feature/PairwiseDistances

# Checkout to the previous commit
git checkout feat/pairwise_distances-pdr-backend
git reflog # find the commit
git checkout <commit>

# Force the branch of this PR back to this commit
git branch -f feat/pairwise_distances-pdr-backend

# Checkout back to this branch
git checkout feat/pairwise_distances-pdr-backend

# Update the branch, you might have conflicts to solve
git merge feature/PairwiseDistances

# After fixing conflicts, run isort on the files that you
# have modified via pre-commit.
pre-commit install
pre-commit run

# Finish the merge
git merge --continue

# Inspect the diff with the base branch
git diff feature/PairwiseDistances...

# If the diff is clear, force-push the branch 
git push -f fork feat/pairwise_distances-pdr-backend

Vincent-Maladiere · 2023-06-22T08:38:11Z

Hey! Yes sorry about that, after rebasing on main it seems that feature/PairwiseDistance hasn't been synced for a while. @glemaitre has rebased feature/PairwiseDistance on main. So, after fixing the conflict hopefully, the large diffs will vanish.

Thanks @jjerphan for thegit branch -f trick, I'll try to fix the conflicts and simply push on the target feature branch one more time.

)

github-actions · 2023-06-22T08:50:58Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

Generated for commit: 54600aa

…-pdr-backend

Vincent-Maladiere · 2023-06-22T09:16:47Z

I'm running the benchmark again on {Dense, CSR} x {Dense, CSR} -- {manhattan, euclidean} distance -- {1, 2, 4, 6} threads, to make sure our conclusion is the same.

Vincent-Maladiere · 2023-06-22T10:18:11Z

Blue is main, orange is this branch.

We obtain a slight speed-up for Euclidean distances in the Dense x CSR case (why?) and no difference otherwise, which matches our expectations, since we're not using PairwiseDistances for the Euclidean distance yet.

However, the Manhattan distance is significantly worse when using Dense x CSR or CSR x CSR, and the n_threads = 1 case with Dense Dense.

The CSR implementation we use are:

main: metrics._pairwise_fast._sparse_manhattan

branch: metrics._dist_metrics.ManhattanDistance.dist_csr

The main difference between these implementations is the parallelization happening within the _sparse_manhattan function, whereas parallelization happens outside the dist_csr in this branch, in _parallel_on_X.

Therefore, this branch makes more function call to the distance routine and this might explain the decrease.

Do you have any other ideas to explain the performance decrease? WDYT?

Vincent-Maladiere · 2023-06-22T10:22:05Z

By the way, our tests are failing because of some connection error to conda forge.

Micky774 · 2023-06-23T13:27:26Z

@Vincent-Maladiere could you generate a speedscope-compatible profile report for both methods? It'll help us spot any less-than-obvious costs, as well as help us confirm valid assumptions. You can see this comment for instructions: #26316 (comment)

Vincent-Maladiere · 2023-06-23T14:17:04Z

Okay, let's profile this. TBH, I was hoping that you or @jjerphan knew what might have impaired the benchmark so that I don't need to run Pyspy and speedscope 😅

Micky774 · 2023-07-04T18:14:54Z

Okay after some initial profiling, it looks like we're running into the same problems we had with the refactor attempts for the entire backend. We forfeit a lot of performance due to indirection and calls with this approach. I didn't expect it to be this dramatic, but alas.

When SparseSparseDataset64.dist is called, despite it being a rather thin wrapper around self.distance_metric.dist_csr, around 50% of the time is spent purely pushing the arguments to the stack, while the rest is the actual computation (a significant portion of which is also taken up by loading the arguments). Overall, only ~32% of time is actually spent on core computation.

This can be partially mitigated by only passing memoryviews by reference rather than value (dropping peripheral information like shape) wherever call overhead is a significant factor (e.g. in tight loops). Specifically here, it would mean changing DistanceMetric.{r}dist_csr to accept a pointer to self.{X, Y}_indices rather than the memoryview itself (#26765). This helps bridge the gap. The remaining performance loss is due to the inability to inline the computations.

I'm not sure that there is a way around this without copy-and-pasting entire swathes of the implementations. Options like code-generation and templating could solve this but at the cost of build time and binary size, along with the obvious maintenance burden.

Abstraction is directly in contest with performance here.

With that being said, this is really only a problem for manhattan distance because we have a cython specialization of it right now. We could keep using the custom specialization a special case (solely to avoid performance regression) and defer to the PairwiseDistance backend for all other cases.

Micky774 · 2023-07-18T23:37:34Z

@Vincent-Maladiere would you like to to keep working on this PR, or would you prefer I take over efforts here and let you focus on other things :)

Micky774 · 2023-08-08T15:33:35Z

Superseded by #26983

Please feel free to leave comments/suggestions on the new PR. Thank you @Vincent-Maladiere for all the work you've done so far!

introduce PairwiseDistance

e4bf4e7

github-actions bot added cython module:metrics module:neighbors labels Feb 7, 2023

Vincent-Maladiere changed the base branch from main to feature/PairwiseDistances February 7, 2023 12:32

temporarily swap 'sqeuclidean' with 'euclidean' to fix tests

ddc464b

jjerphan reviewed Feb 7, 2023

View reviewed changes

ogrisel reviewed Feb 8, 2023

View reviewed changes

jjerphan reviewed Feb 8, 2023

View reviewed changes

sklearn/metrics/tests/test_pairwise.py Outdated Show resolved Hide resolved

sklearn/metrics/tests/test_pairwise.py Outdated Show resolved Hide resolved

Add nogil for benchmark purposes

2a8b47a

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

jjerphan reviewed Feb 9, 2023

View reviewed changes

update is_usable_for and docstrings

fd4117b

update whats_new

cfa7d8c

Vincent-Maladiere commented Feb 11, 2023

View reviewed changes

sklearn/metrics/_pairwise_distances_reduction/_dispatcher.py Outdated Show resolved Hide resolved

Merge branch 'feature/PairwiseDistances' into feat/pairwise_distances…

dd60dd5

…-pdr-backend

jjerphan changed the title ~~FEA Introduce PairwiseDistance~~ FEA Introduce PairwiseDistances, a general back-end for pairwise_distances Feb 13, 2023

jjerphan changed the title ~~FEA Introduce PairwiseDistances, a general back-end for pairwise_distances~~ FEA Introduce PairwiseDistances, a generic back-end for pairwise_distances Feb 13, 2023

jjerphan reviewed Feb 13, 2023

View reviewed changes

Vincent-Maladiere force-pushed the feat/pairwise_distances-pdr-backend branch from b289d9f to a569758 Compare February 14, 2023 17:22

Vincent-Maladiere and others added 4 commits February 14, 2023 18:39

remove chunksize and extend tests

5d9dcdb

Apply suggestions from code review

8787747

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

add test_pairwise_distances_is_usable_for

3fd55b9

fix monkeypatch test and extend is_usable_for to single threaded manh…

e8dfef5

…attan

Micky774 self-requested a review May 19, 2023 15:21

Vincent-Maladiere added 3 commits June 21, 2023 17:39

Merge branch 'main' into feat/pairwise_distances-pdr-backend

40e4ff3

finalize merging with main by removing deprecated DTYPE and ITYPE

cb518ab

Merge branch 'feature/PairwiseDistances' into feat/pairwise_distances…

1754d18

…-pdr-backend

lucyleeow and others added 3 commits June 22, 2023 10:44

DOC Update link to best sphinx version for doc build (scikit-learn#26626

8e94b7f

)

MNT add isort to ruff's rules (scikit-learn#26649)

53bdbe5

FIX use None as default in HDBSCAN (scikit-learn#26650)

e04160c

Merge branch 'feature/PairwiseDistances' into feat/pairwise_distances…

54600aa

…-pdr-backend

Micky774 mentioned this pull request Jul 4, 2023

PERF Pass buffers via pointers in PairwiseDistancesReductions routines for sparse data #26765

Merged

Micky774 mentioned this pull request Jul 19, 2023

API Allow users to pass DistanceMetric objects to metric keyword argument in neighbors.KNeighborsRegressor #26267

Merged

This was referenced Aug 1, 2023

FEA Introduce PairwiseDistances, a generic back-end for pairwise_distances #26983

Open

FIX Pop unnecessary elements from metric_kwargs in datasets_pair.pyx.tp #26987

Merged

Micky774 closed this Aug 8, 2023

Micky774 added the Superseded PR has been replace by a newer PR label Aug 8, 2023

jjerphan mentioned this pull request Jan 29, 2024

MAINT remove old git branches #28307

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEA Introduce `PairwiseDistances`, a generic back-end for `pairwise_distances` #25561

FEA Introduce `PairwiseDistances`, a generic back-end for `pairwise_distances` #25561

Vincent-Maladiere commented Feb 7, 2023 •

edited by jjerphan

Loading

jjerphan left a comment •

edited

Loading

jjerphan Feb 7, 2023

ogrisel Feb 8, 2023

ogrisel commented Feb 8, 2023

ogrisel left a comment

ogrisel Feb 8, 2023

jjerphan commented Feb 8, 2023

jjerphan left a comment

Vincent-Maladiere commented Feb 8, 2023 •

edited

Loading

jjerphan left a comment •

edited

Loading

ogrisel commented Feb 10, 2023 •

edited

Loading

ogrisel commented Feb 10, 2023

ogrisel commented Feb 10, 2023

Vincent-Maladiere commented Feb 11, 2023 •

edited

Loading

jjerphan left a comment

Micky774 commented May 19, 2023

jjerphan commented Jun 4, 2023

Vincent-Maladiere commented Jun 21, 2023

Micky774 commented Jun 21, 2023

jjerphan commented Jun 22, 2023

Vincent-Maladiere commented Jun 22, 2023

github-actions bot commented Jun 22, 2023 •

edited

Loading

Vincent-Maladiere commented Jun 22, 2023

Vincent-Maladiere commented Jun 22, 2023

Vincent-Maladiere commented Jun 22, 2023

Micky774 commented Jun 23, 2023 •

edited

Loading

Vincent-Maladiere commented Jun 23, 2023

Micky774 commented Jul 4, 2023 •

edited

Loading

Micky774 commented Jul 18, 2023

Micky774 commented Aug 8, 2023

FEA Introduce PairwiseDistances, a generic back-end for pairwise_distances #25561

FEA Introduce PairwiseDistances, a generic back-end for pairwise_distances #25561

Conversation

Vincent-Maladiere commented Feb 7, 2023 • edited by jjerphan Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

jjerphan left a comment • edited Loading

Choose a reason for hiding this comment

jjerphan Feb 7, 2023

Choose a reason for hiding this comment

ogrisel Feb 8, 2023

Choose a reason for hiding this comment

ogrisel commented Feb 8, 2023

ogrisel left a comment

Choose a reason for hiding this comment

ogrisel Feb 8, 2023

Choose a reason for hiding this comment

jjerphan commented Feb 8, 2023

jjerphan left a comment

Choose a reason for hiding this comment

Vincent-Maladiere commented Feb 8, 2023 • edited Loading

jjerphan left a comment • edited Loading

Choose a reason for hiding this comment

ogrisel commented Feb 10, 2023 • edited Loading

ogrisel commented Feb 10, 2023

ogrisel commented Feb 10, 2023

Vincent-Maladiere commented Feb 11, 2023 • edited Loading

jjerphan left a comment

Choose a reason for hiding this comment

Micky774 commented May 19, 2023

jjerphan commented Jun 4, 2023

Vincent-Maladiere commented Jun 21, 2023

Micky774 commented Jun 21, 2023

jjerphan commented Jun 22, 2023

Vincent-Maladiere commented Jun 22, 2023

github-actions bot commented Jun 22, 2023 • edited Loading

✔️ Linting Passed

Vincent-Maladiere commented Jun 22, 2023

Vincent-Maladiere commented Jun 22, 2023

Vincent-Maladiere commented Jun 22, 2023

Micky774 commented Jun 23, 2023 • edited Loading

Vincent-Maladiere commented Jun 23, 2023

Micky774 commented Jul 4, 2023 • edited Loading

Micky774 commented Jul 18, 2023

Micky774 commented Aug 8, 2023

FEA Introduce `PairwiseDistances`, a generic back-end for `pairwise_distances` #25561

FEA Introduce `PairwiseDistances`, a generic back-end for `pairwise_distances` #25561

Vincent-Maladiere commented Feb 7, 2023 •

edited by jjerphan

Loading

jjerphan left a comment •

edited

Loading

Vincent-Maladiere commented Feb 8, 2023 •

edited

Loading

jjerphan left a comment •

edited

Loading

ogrisel commented Feb 10, 2023 •

edited

Loading

Vincent-Maladiere commented Feb 11, 2023 •

edited

Loading

github-actions bot commented Jun 22, 2023 •

edited

Loading

Micky774 commented Jun 23, 2023 •

edited

Loading

Micky774 commented Jul 4, 2023 •

edited

Loading