better first guess for `fit_yeo_johnson_transform` #492

mathause · 2024-08-08T13:24:22Z

We can speed up the fit_yeo_johnson_transform by passing a better first guess, assuming the trend is 0. We can get the first guess using:

from sklearn.preprocessing import PowerTransformer

l = PowerTransformer().fit(tas_stacked_y.tas).lambdas_

# we can calculate xi_0 from lambda as
xi_0 = (2 - l) / l

The text was updated successfully, but these errors were encountered:

veni-vidi-vici-dormivi · 2024-08-09T07:41:23Z

Hm but instead of tas_stacked_y.tas with would use resids_after_hm.tas[month] right? So the assumption would be that there is a skew of the monthly residuals w.r.t. to the yearly values but that it is constant and not dependent on the yearly temperature value. That's a good idea. But we would need to do it 12 times too. Does that pay off?

mathause · 2024-08-09T09:23:14Z

Hm but instead of tas_stacked_y.tas with would use resids_after_hm.tas[month] right?

Yes

Does that pay off?

The idea is that there is not much trend and that it's much faster to fit one param than 2 and that starting at a good point for $\xi_0$ speeds up the minimization. It helps, but only by about 10% - so much less than I would have hoped.

mathause · 2024-08-12T19:03:37Z

I could try again with much lower precision for the first guess - most of the iterations are spent honing in the estimate. The fit uses sp.optimize.brent with a tolerance of about 1e-8. For our purpose 1e-2 is probably enough.

Only problem: the tol param is not exposed in PowerTransformer().fit.

https://docs.scipy.org/doc/scipy/reference/generated/scipy.optimize.brent.html

Just for clarity: this yields a maximum of another 10% speed gain - so still debatable if its worth the trouble.

mathause added topic-performance topic-stats labels Aug 8, 2024

mathause mentioned this issue Sep 2, 2024

fourier series: more consistent with standard definition #512

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

better first guess for `fit_yeo_johnson_transform` #492

better first guess for `fit_yeo_johnson_transform` #492

mathause commented Aug 8, 2024 •

edited

Loading

veni-vidi-vici-dormivi commented Aug 9, 2024

mathause commented Aug 9, 2024 •

edited

Loading

mathause commented Aug 12, 2024 •

edited

Loading

better first guess for fit_yeo_johnson_transform #492

better first guess for fit_yeo_johnson_transform #492

Comments

mathause commented Aug 8, 2024 • edited Loading

veni-vidi-vici-dormivi commented Aug 9, 2024

mathause commented Aug 9, 2024 • edited Loading

mathause commented Aug 12, 2024 • edited Loading

better first guess for `fit_yeo_johnson_transform` #492

better first guess for `fit_yeo_johnson_transform` #492

mathause commented Aug 8, 2024 •

edited

Loading

mathause commented Aug 9, 2024 •

edited

Loading

mathause commented Aug 12, 2024 •

edited

Loading