You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When training a GBT with sample weights, the partial dependence plot completely ignores the sample weights.
Steps/Code to Reproduce
Create a dataset with two subpopulations, one subpopulation where y = X[:,1] and the other where y = -X[:,1] so that without sample weights, the partial dependences cancel out.
Add a large sample weight to the first subpopulation compared to the other (e.g. a 100 to 1 ratio) so that the resulting model should reflect the dependence y = X[:,1].
Thanks, I had not caught up with this PR. Indeed, with the "recursion" method, the same underlying _partial_dependence_tree function from ensemble._gradient_boosting.pyx is called, so the issue remains.
Description
When training a GBT with sample weights, the partial dependence plot completely ignores the sample weights.
Steps/Code to Reproduce
Create a dataset with two subpopulations, one subpopulation where
y = X[:,1]
and the other wherey = -X[:,1]
so that without sample weights, the partial dependences cancel out.Add a large sample weight to the first subpopulation compared to the other (e.g. a 100 to 1 ratio) so that the resulting model should reflect the dependence
y = X[:,1]
.Expected Results
Partial dependence with sample weights should mostly reflect the points of the dataset where
y = X[:,1]
.Actual Results
Versions
0.21.dev0
The text was updated successfully, but these errors were encountered: