ClassifierChain should only accept multilabel-indicator #19853

glemaitre · 2021-04-09T17:34:43Z

By trying to solve #19357 and writing some test, it seems that ClassifierChain is expected to be fitted with multilabel-indicator target (each column should only contain 0/1 classes).

However, there is no check and one can fit a multiclass-multioutput target. The classifier will later fail if calling decision_function that would return an array of shape n_samples, 3 while an array of n_samples, is expected. I assume a similar behaviour for predict_proba.

I think that we should check the type of target to raise a proper error at fit.

The text was updated successfully, but these errors were encountered:

jnothman · 2021-04-10T11:40:57Z

Duplicate of #13339?

jnothman · 2021-04-10T11:41:23Z

Alternatively, #14654 could be completed to support multioutput-multiclass. It's waiting for review

glemaitre · 2021-04-10T11:45:09Z

Thanks @jnothman I did not recall #14654. In case that we want to have support for multiclass-multioutput, what shape of y_pred do we expect?

glemaitre · 2021-04-10T13:01:57Z

I was looking at the original paper: https://www.cs.waikato.ac.nz/~ml/publications/2009/chains.pdf since that I am not really familiar with chains.

I have the impression that supporting multiclass-multioutput will solve a really different problem than multilabel-indicator.
By adding into the feature space, a column of y, you create an interaction. Thus, in multiclass-multioutput, the interaction created is between outputs, while in multilabel, this is between classes, isn't it?

If this is the case, we should rewrite the documentation to explain the difference in modelling.

glemaitre added Bug: triage Bug and removed Bug: triage labels Apr 9, 2021

glemaitre mentioned this issue Apr 10, 2021

[MRG] ENH add support for multiclass-multioutput to ClassifierChain #14654

Closed

cmarmo added the module:multioutput label Sep 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ClassifierChain should only accept multilabel-indicator #19853

ClassifierChain should only accept multilabel-indicator #19853

glemaitre commented Apr 9, 2021

jnothman commented Apr 10, 2021

jnothman commented Apr 10, 2021 •

edited

Loading

glemaitre commented Apr 10, 2021

glemaitre commented Apr 10, 2021

ClassifierChain should only accept multilabel-indicator #19853

ClassifierChain should only accept multilabel-indicator #19853

Comments

glemaitre commented Apr 9, 2021

jnothman commented Apr 10, 2021

jnothman commented Apr 10, 2021 • edited Loading

glemaitre commented Apr 10, 2021

glemaitre commented Apr 10, 2021

jnothman commented Apr 10, 2021 •

edited

Loading