Skip to content

Expanded ColumnTransformer functionality -- transforming subsets of data #28130

@rebeccaherman1

Description

@rebeccaherman1

Describe the workflow you want to enable (edited)

the ability to (inverse_)transform data corresponding to a subset of the ColumnTransformer's component transformations

Describe your proposed solution

Data of a smaller size can be passed in with a new keyword that identifies the relevant component transformations by name

Describe alternatives you've considered, if relevant

a function that subsets a ColumnTransformer object, including adjusting the column numbers

Additional context

In artificial intelligence applications, the researcher may want to transform an entire dataset with column groups for learning, but then transform new data corresponding just to interventions or predictions using the same transformations at a later time. Hence, the need to be able to subset a ColumnTransform or pass in only a part of the data.

See #27957 for more discussion.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions