You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
By trying to solve #19357 and writing some test, it seems that ClassifierChain is expected to be fitted with multilabel-indicator target (each column should only contain 0/1 classes).
However, there is no check and one can fit a multiclass-multioutput target. The classifier will later fail if calling decision_function that would return an array of shape n_samples, 3 while an array of n_samples, is expected. I assume a similar behaviour for predict_proba.
I think that we should check the type of target to raise a proper error at fit.
The text was updated successfully, but these errors were encountered:
I have the impression that supporting multiclass-multioutput will solve a really different problem than multilabel-indicator.
By adding into the feature space, a column of y, you create an interaction. Thus, in multiclass-multioutput, the interaction created is between outputs, while in multilabel, this is between classes, isn't it?
If this is the case, we should rewrite the documentation to explain the difference in modelling.
By trying to solve #19357 and writing some test, it seems that
ClassifierChain
is expected to be fitted withmultilabel-indicator
target (each column should only contain 0/1 classes).However, there is no check and one can fit a
multiclass-multioutput
target. The classifier will later fail if callingdecision_function
that would return an array of shapen_samples, 3
while an array ofn_samples,
is expected. I assume a similar behaviour forpredict_proba
.I think that we should check the type of target to raise a proper error at
fit
.The text was updated successfully, but these errors were encountered: