Add support for: ML-kNN #2606

mchangun · 2013-11-22T03:19:04Z

Add support for the Multi-Label kNN algorithm as described here in this paper http://cs.nju.edu.cn/zhouzh/zhouzh.files/publication/pr07.pdf. A brief description of the algorithm from the above paper:

"As its name implied, Ml-knn is derived from the popular k-Nearest Neighbor (kNN) algorithm [1]. Firstly, for each test instance, its k nearest neighbors in the training set are identi¯ed. Then, according to statistical information gained from the label sets of these neighboring instances, i.e. the number of neighboring instances belonging to each possible class, maximum a posteriori (MAP) principle is utilized to determine the label set for the test instance."

Supporting sparse matrices should be a key requirement - a lot of multi label tasks are for text categorization and these are usually represented as sparse matrices.

ML-kNN is actually already implemented in this library (http://orange.biolab.si/docs/latest/reference/rst/Orange.multilabel/#ml-knn-learner) but it would be good to bring it under the scikit-learn framework as well.

arjoly · 2013-11-22T08:13:50Z

Hi @mchangun,

I think this would a nice addition and that you would be interested in #399 and #970.

mchangun · 2013-12-12T10:15:08Z

I'm just wondering what the next step for this would be? Does someone have to mark it as "feature"? How does it get picked up by one of the developers?

I've coded up my version of this ML-kNN, if someone is interested in taking a look, I will happily send it over.

arjoly · 2013-12-12T13:20:47Z

The best way to add new features is to make a pull request. Please have a look to the contributing guidelines http://scikit-learn.org/dev/developers/index.html. Be aware that adding new features require more than just writing the plain algorithm, but also to write tests, code documentations and a narrative documentation.

amueller · 2013-12-14T05:47:11Z

Is this not implemented already? http://scikit-learn.org/dev/modules/multiclass.html lists KNN as being multi-label.

mchangun · 2013-12-14T16:14:29Z

@amueller Where in the link do you see that? I can only find this:

Inherently multiclass: Naive Bayes, sklearn.lda.LDA, Decision Trees, Random Forests, Nearest Neighbors.

I.e. multiclass, not multilabel

I've been looking through the sklearn docs for multi-label support and at the moment, it seems to only support it via OneVsRest / LabeBinarizer.

arjoly · 2013-12-15T10:08:52Z

Is this not implemented already? http://scikit-learn.org/dev/modules/multiclass.html lists KNN as being multi-label.

Yes, the binary relevance (one-vs-rest) is already implemented. However, ML-KNN propose to compute to perform prediction in a bayesian fashion using Bayes rule as in #399 and #970..

medhini · 2016-03-20T16:13:13Z

Has this been implemented already? Is the issue still open?

bhaveshoswal · 2016-05-25T07:33:07Z

@mchangun Can you give me the ML-KNN code it will be a great help to me as i am working on Multi-Label Problem Thanks
email id = oswal.bhavesh2010@gmail.com

jnothman · 2016-05-25T07:43:04Z

The issue is still open, @medhini, it has not been implemented (the implementation at #970 is not multi-label); and no, we don't have code for you, @bhaveshoswal .

medhini · 2016-05-25T12:12:00Z

I would like to take up this issue and implement MLKnn. Can it be assigned to me ?

jnothman · 2016-05-25T12:42:29Z

As far as I can tell, @medhini, you would be welcome. We've rarely used the 'assignee' feature, but please write some tests, perhaps, open a WIP pull request and show us it's actually going to happen.

amueller · 2016-10-11T02:24:34Z

@jnothman but our KNN is multi-label, right? Even multi-output multi-class. Or is this about something else?

jnothman · 2016-10-11T02:43:55Z

I didn't look through clearly enough. #2606 (comment) above suggests ML-KNN differs in its use of bayesian priors, but I suppose bayesian priors is independent of the multilabelness? I haven't looked into the bayesian priors issue, but it looks interesting. Presumably, though, this issue adds little.

jnothman · 2016-10-11T02:45:16Z

Or does ML-KNN model covariance between the multiple labels? I think someone will need to read to work out if this is valuable.

amueller · 2016-10-27T20:38:09Z

No covariance, but uses global class probabilities. Wouldn't call that bayesian priors. The paper is really obscurely written. There's a smoothing of the class distribution, which is 1.

gerdinard · 2018-01-10T11:10:36Z

Hello guys. Is this still on?

I have finished an implementation of ML-kNN based on sklearn's kNN and the original paper "ML-KNN: A lazy learning approach to multi-label learning".

jnothman · 2018-01-10T11:33:33Z

A PR is welcome but expect a slow process to merge; so is a summary of the algorithm.

Oktai15 · 2018-10-07T19:24:09Z

@gerdinard please share your implementation if it is possible

sandeepeecs · 2018-11-03T17:09:40Z

Hello guys. Is this still on?

I have finished an implementation of ML-kNN based on sklearn's kNN and the original paper "ML-KNN: A lazy learning approach to multi-label learning".

Can you share a link to your implementation?

gerdinard · 2019-07-19T14:15:13Z

Hello. Here is a link to my python MLkNN implementation.
https://github.com/gerdinard/MLkNN.git

adrinjalali · 2024-04-17T15:30:23Z

Closing as it's unlikely we'd be adding it to sklearn, and it's better existing in a separate project/repo.

arjoly mentioned this issue May 22, 2015

Classifier chains & Homer algorithm for multilabel classification #4759

Open

webber26232 mentioned this issue Aug 24, 2017

[MRG+1] Add predict_proba(X) and outlier handler for RadiusNeighborsClassifier #9597

Merged

cmarmo added help wanted module:cluster labels Mar 1, 2021

cmarmo added Low Priority Low priority issues and pull requests and removed help wanted labels Aug 9, 2022

adrinjalali closed this as not planned Won't fix, can't repro, duplicate, stale Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for: ML-kNN #2606

Add support for: ML-kNN #2606

mchangun commented Nov 22, 2013

arjoly commented Nov 22, 2013

mchangun commented Dec 12, 2013

arjoly commented Dec 12, 2013

amueller commented Dec 14, 2013

mchangun commented Dec 14, 2013

arjoly commented Dec 15, 2013

medhini commented Mar 20, 2016

bhaveshoswal commented May 25, 2016 •

edited

Loading

jnothman commented May 25, 2016

medhini commented May 25, 2016

jnothman commented May 25, 2016 •

edited

Loading

amueller commented Oct 11, 2016

jnothman commented Oct 11, 2016

jnothman commented Oct 11, 2016

amueller commented Oct 27, 2016

gerdinard commented Jan 10, 2018

jnothman commented Jan 10, 2018

Oktai15 commented Oct 7, 2018

sandeepeecs commented Nov 3, 2018

gerdinard commented Jul 19, 2019

adrinjalali commented Apr 17, 2024

Add support for: ML-kNN #2606

Add support for: ML-kNN #2606

Comments

mchangun commented Nov 22, 2013

arjoly commented Nov 22, 2013

mchangun commented Dec 12, 2013

arjoly commented Dec 12, 2013

amueller commented Dec 14, 2013

mchangun commented Dec 14, 2013

arjoly commented Dec 15, 2013

medhini commented Mar 20, 2016

bhaveshoswal commented May 25, 2016 • edited Loading

jnothman commented May 25, 2016

medhini commented May 25, 2016

jnothman commented May 25, 2016 • edited Loading

amueller commented Oct 11, 2016

jnothman commented Oct 11, 2016

jnothman commented Oct 11, 2016

amueller commented Oct 27, 2016

gerdinard commented Jan 10, 2018

jnothman commented Jan 10, 2018

Oktai15 commented Oct 7, 2018

sandeepeecs commented Nov 3, 2018

gerdinard commented Jul 19, 2019

adrinjalali commented Apr 17, 2024

bhaveshoswal commented May 25, 2016 •

edited

Loading

jnothman commented May 25, 2016 •

edited

Loading