ENH use log1p and expm1 in Yeo-Johnson transformation and its inverse #27868

xuefeng-xu · 2023-11-29T07:17:12Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This PR was inspired by scipy's YJ transformation and also implement its inverse.
https://github.com/scipy/scipy/blob/fcf7b652bc27e47d215557bda61c84d19adc3aae/scipy/stats/_morestats.py#L1495-L1516

Specifically, if $\lambda=1$, we could skip the computation and return x directly.

Any other comments?

The formula of YJ transformation

github-actions · 2023-11-29T07:18:28Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 92bba12. Link to the linter CI: here}

s-banach · 2023-11-29T17:03:44Z

If _yeo_johnson_transform accepted an optional out parameter, then _yeo_johnson_optimize could reuse the same out array every time it calls _yeo_johnson_transform and reduce some array allocations.

I do this in my own personal work to speedup yeo johnson, maybe you could add it here along with this optimization?

xuefeng-xu · 2023-11-30T02:00:47Z

Thanks, @s-banach. But later scikit-learn will use scipy.stats.yeojohnson for YJ to resolve another issue, see #26308. So I will probably remain the current status. Maybe you could open a PR at scipy?

lorentzenchr · 2025-03-31T06:59:30Z

As explained in #26308, we want to (sooner or later) rely on scipy and get rid of our own implementation.

xuefeng-xu added 2 commits November 29, 2023 14:39

ENH use log1p and expm1 in Yeo-Johnson transformation and its inverse

e531ebd

skip computation for lambda=1

92bba12

github-actions bot added the module:preprocessing label Nov 29, 2023

lorentzenchr closed this Mar 31, 2025

github-project-automation bot moved this from Open to Closed in @xuefeng-xu's scikit-learn project Mar 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH use log1p and expm1 in Yeo-Johnson transformation and its inverse #27868

ENH use log1p and expm1 in Yeo-Johnson transformation and its inverse #27868

xuefeng-xu commented Nov 29, 2023

github-actions bot commented Nov 29, 2023

s-banach commented Nov 29, 2023

xuefeng-xu commented Nov 30, 2023

lorentzenchr commented Mar 31, 2025

ENH use log1p and expm1 in Yeo-Johnson transformation and its inverse #27868

ENH use log1p and expm1 in Yeo-Johnson transformation and its inverse #27868

Conversation

xuefeng-xu commented Nov 29, 2023

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

github-actions bot commented Nov 29, 2023

✔️ Linting Passed

s-banach commented Nov 29, 2023

xuefeng-xu commented Nov 30, 2023

lorentzenchr commented Mar 31, 2025