Add Example for several Scenarios #521

veni-vidi-vici-dormivi · 2024-09-10T10:02:11Z

if this works could also add an integration test for this

Closes add later
Tests added
Fully documented, including CHANGELOG.rst

for more information, see https://pre-commit.ci

veni-vidi-vici-dormivi · 2024-09-10T12:24:57Z

Okay so when I plot the residuals after removing the global trend (including volcanic forcing) where I treat the historical members as their own scenario I get this "mismatch" at the transition from historical to projected period. At the moment I would say that this is not pretty but actually, for the fitting it should be fine as long as we keep treating the historical data as its own scenario for the AR processes. For the linear regressions and the variances/covariances it doesn't matter since these do not consider time dependency. But I would be happy if a second brain went over this as well @mathause 🙂

We should point out however that in the emulation process one should use a continuous time series not ensure continuity of the realization.

for more information, see https://pre-commit.ci

codecov · 2024-09-11T17:27:00Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 49.77%. Comparing base (0e15d4e) to head (ef315d7).
Report is 12 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #521      +/-   ##
==========================================
+ Coverage   49.76%   49.77%   +0.01%     
==========================================
  Files          50       50              
  Lines        3563     3572       +9     
==========================================
+ Hits         1773     1778       +5     
- Misses       1790     1794       +4

Flag	Coverage Δ
unittests	`49.77% <ø> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

veni-vidi-vici-dormivi · 2024-09-12T11:53:45Z

Okay, I'm actually surprisingly happy with this approach and impressed by what xarray and datatree can do. I feel like the data tree approach I went for here (holding one dataset per scenario that holds the members along the dimensions) is nice.

Nevertheless, I want to rewrite the autoregression functions to work on data trees instead of the arg list. For the linear regression and covariance we could think about implementing functions that take care of the stacking and weighting automatically. Actually I think this would be quite fun.

But I want to focus on MESMER-X for the rest of the week.

for more information, see https://pre-commit.ci

veni-vidi-vici-dormivi · 2024-09-18T12:19:28Z

@yquilcaille You can use this now. All functionality should stay the same as it is in here now, just that some of the manual data prepping I do will be moved into functions, which needs more time to implement cleanly.

One thing, if you calibrate on all the ESMs, could you tell me if you ever run into singular correlation matrices when fitting for the best localization radius? At the moment this should abort the fitting and we are still debating if it is worth to implement a version where we singular matrices are allowed. Thank you!

yquilcaille

Thanks @veni-vidi-vici-dormivi! The "surfer" looks good, no problem to add it. I agree that the preparation of the data should be moved into functions with the future cleaning. Also, some users may benefit from easy wrappers, like one for training, one for emulation.

I will now use this surfer to prepare the training of all ESMs and emulations for FASTMIP. Promised, if any issue appears on the singular matrices, I will let you know :)

mathause · 2024-09-30T05:54:53Z

Thanks! Cool that this works & sorry for the late reply. I would like to see some changes before merging, though,

Should we use the example datasets so it's self-contained? Maybe could mention how to use with cmip6-ng.
Did you ever double check this is consistent with Lea's results?
rename notebook to e.g. example_mesmer_multi_ens_multi_scen.ipynb?
There is quite a bit of clean-up possible
- unused functionality and imports (but see the suggestions above)
- in Cell 12 you manipulate the data in a function call which is not nice (in a tutorial)
- there is more but commenting a notebook is annoying - I might go over the notebook later, but would appreciate if you gave it a first pass
the emulations have the same variability. Haven't we discussed this?
- can you gather the emulation part into a function (in the notebook) - maybe needs a function for the variability and one for the trend?
- should the historical part have for two different scenarios have the same variability? I think both should be possible

mathause · 2024-09-30T07:39:35Z

TODO: check the status of datatree in xarray - it would be good if we can enable using it from xarray (DataTree and map_over_subtree is now available from the main xarray namespace) but needs a relatively new version.

veni-vidi-vici-dormivi · 2024-10-01T18:32:58Z

Should we use the example datasets so it's self-contained? Maybe could mention how to use with cmip6-ng.

Yes absolutely, have done this locally already, will push it soon. I am currently working on implementing the data tree approach in the repo and moving this into the integration tests.

Did you ever double check this is consistent with Lea's results?

It is not because I treat the historical period as a completely independent scenario, i.e. I smooth historical and scenario separately thus leading to different values around the transition from historical to future period. This leads to different values than Lea's in the smoothed global mean and thus the residuals and everything thereafter, thus all the parameters. What do you think about this? I think that it is more elegant as there is no duplication of the historical period. As I see it, Lea solved this by taking the median over the scenario hists before:

mesmer/mesmer/calibrate_mesmer/train_gt.py

Lines 100 to 105 in 72d2fd9

    
           gt_s, time_s = separate_hist_future(gt, time, cfg) 
        
           # compute median LOWESS estimate of historical part across all scenarios 
        
           gt_hist_all = gt_s.pop("hist") 
        
           gt_hist_median = np.median(gt_hist_all, axis=0)

rename notebook to e.g. example_mesmer_multi_ens_multi_scen.ipynb?

Agree. Will do.

There is quite a bit of clean-up possible

Will get back to this later

the emulations have the same variability. Haven't we discussed this?
can you gather the emulation part into a function (in the notebook) - maybe needs a function for the variability and one for the trend?
should the historical part have for two different scenarios have the same variability? I think both should be possible

Ah right. If we want different historical variability we just need different seeds for each scenario right, so both is possible depending on the seed?
Yes, good idea to gather it into a function in the notebook but not in the repo, I like that. I need to think about how to write it so that we can potentially reuse the trend.

veni-vidi-vici-dormivi and others added 5 commits September 10, 2024 11:56

adjust volc

bd0c445

NB

b99d747

[pre-commit.ci] auto fixes from pre-commit.com hooks

ee75700

for more information, see https://pre-commit.ci

plot difference

e567639

[pre-commit.ci] auto fixes from pre-commit.com hooks

1d1bbe8

for more information, see https://pre-commit.ci

veni-vidi-vici-dormivi and others added 7 commits September 10, 2024 17:08

switch to datatree

e8eb676

advance to local trend module

61e92dc

[pre-commit.ci] auto fixes from pre-commit.com hooks

f46a332

for more information, see https://pre-commit.ci

finish local linear regression

8adf67e

finish fitting local variability

c11e45c

finish

ad3256a

revert changes in volc

a3f698d

veni-vidi-vici-dormivi and others added 3 commits September 12, 2024 16:40

comments

51cdfbe

add weigths to the local trend and covarinace estimation

26ea224

[pre-commit.ci] auto fixes from pre-commit.com hooks

ef315d7

for more information, see https://pre-commit.ci

yquilcaille approved these changes Sep 20, 2024

View reviewed changes

mathause mentioned this pull request Oct 1, 2024

Implement eigh() fallback #493

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Example for several Scenarios #521

Add Example for several Scenarios #521

veni-vidi-vici-dormivi commented Sep 10, 2024

veni-vidi-vici-dormivi commented Sep 10, 2024

codecov bot commented Sep 11, 2024 •

edited

Loading

veni-vidi-vici-dormivi commented Sep 12, 2024

veni-vidi-vici-dormivi commented Sep 18, 2024

yquilcaille left a comment

mathause commented Sep 30, 2024

mathause commented Sep 30, 2024

veni-vidi-vici-dormivi commented Oct 1, 2024

Add Example for several Scenarios #521

Are you sure you want to change the base?

Add Example for several Scenarios #521

Conversation

veni-vidi-vici-dormivi commented Sep 10, 2024

veni-vidi-vici-dormivi commented Sep 10, 2024

codecov bot commented Sep 11, 2024 • edited Loading

Codecov Report

veni-vidi-vici-dormivi commented Sep 12, 2024

veni-vidi-vici-dormivi commented Sep 18, 2024

yquilcaille left a comment

Choose a reason for hiding this comment

mathause commented Sep 30, 2024

mathause commented Sep 30, 2024

veni-vidi-vici-dormivi commented Oct 1, 2024

codecov bot commented Sep 11, 2024 •

edited

Loading