TST mark test as xfail due to bug fix in pandas-dev #26344

glemaitre · 2023-05-07T11:14:23Z

Partially address #26154

Solving the issue pointed out here: #26154 (comment)

In short, pandas will better infer type during DataFrame concatenation with missing values. Previously, due to the way we read by chunk in the liac-arff parser, we could end up with None and np.nan in the same column. The new version of pandas will identify both values are missing values.

Since the new behaviour is what one would expect but we cannot make a backport, a way is to mark the test as xfail.

adrinjalali · 2023-05-08T09:23:46Z

So this means we won't be really supporting as_frame=True with parser="liac-arff", right? Should we then at least deprecated that usage?

glemaitre · 2023-05-16T14:44:15Z

So this means we won't be really supporting as_frame=True with parser="liac-arff", right?

This does not change. Here, it is just that the number of column detected as numerical or categorical will changed depending if None will be map to a proper missing value (which was not in the passed).

thomasjpfan · 2023-05-16T20:37:19Z

I think we can adjust the implementation to infer better dtypes. I opened #26386 as an alternative to this PR.

lesteve · 2023-05-23T13:02:31Z

Closing since #26386 has been merged

TST mark test as xfail due to bug fix in pandas-dev

df5eb21

github-actions bot added the module:datasets label May 7, 2023

glemaitre added the No Changelog Needed label May 7, 2023

thomasjpfan mentioned this pull request May 16, 2023

TST Fix openml parser implementation for pandas-dev #26386

Merged

lesteve closed this May 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

TST mark test as xfail due to bug fix in pandas-dev #26344

TST mark test as xfail due to bug fix in pandas-dev #26344

Uh oh!

glemaitre commented May 7, 2023

Uh oh!

adrinjalali commented May 8, 2023

Uh oh!

glemaitre commented May 16, 2023

Uh oh!

thomasjpfan commented May 16, 2023

Uh oh!

lesteve commented May 23, 2023

Uh oh!

Uh oh!

Uh oh!

TST mark test as xfail due to bug fix in pandas-dev #26344

TST mark test as xfail due to bug fix in pandas-dev #26344

Uh oh!

Conversation

glemaitre commented May 7, 2023

Uh oh!

adrinjalali commented May 8, 2023

Uh oh!

glemaitre commented May 16, 2023

Uh oh!

thomasjpfan commented May 16, 2023

Uh oh!

lesteve commented May 23, 2023

Uh oh!

Uh oh!