The replication issues with the downscaling task. #37

Tttizi · 2023-12-18T14:04:57Z

In attempting the Downscaling task, following the publicly available code on GitHub did not yield the reported performance in the paper. Specifically, the Root Mean Squared Error (RMSE) for T2m was 6.08, whereas the paper reports 2.79. I am uncertain if there are key points I should be mindful of to address this discrepancy.
I noticed some discrepancies between the descriptions in the paper and the provided code, such as the setting of the learning rate. Despite trying various combinations, I have been unable to obtain the correct results. I would appreciate your advice and guidance on this matter.
I would like to inquire about the choice of the pre-training model—should I select the 1.40625-degree model? I have encountered some confusion during my attempts, and I am seeking your professional opinion on this matter.

tung-nd · 2023-12-21T22:59:46Z

Hi, thank you for your interest in ClimaX. I answer the questions as follows:

Can you elaborate on what the differences are?
Yes, you should use the 1.40625deg model. What issues did you run into when trying to use it?

Tttizi · 2023-12-22T02:41:15Z

Thank you very much for your response. I have noticed three differences between the paper and the code. First, in the paper, the learning rate for the downscaling task is 5e-5, while in the code, it is set to 5e-4. Second, the warmup setting in the paper is not explicitly mentioned, but from the code, it seems to have exceeded 5 epochs. Third, in the paper, it is stated that you trained different networks for different features, while in the code, these features are predicted together. I have attempted to adjust these settings, but the performance is still not satisfactory. Therefore, I hope you can provide more details on how each feature corresponds to specific settings or offer more detailed guidance on how to reproduce the results from the paper.

Tttizi · 2023-12-22T02:53:05Z

I have another question regarding the data. There are two issues with the data provided in the Hugging Face link. First, it lacks data for the features "10_m_u_component_of_wind" and "10_m_v_component_of_wind." Second, the data does not match the WeatherBench dataset. Since there are no timestamps, I extracted data for one day and compared it with the data corresponding to that year in the WeatherBench dataset. Unfortunately, I couldn't find matching data.

Tttizi · 2024-01-05T02:31:24Z

Hi, just wanted to check if there have been any updates on this issue.

Tttizi · 2024-01-26T06:40:50Z

Hi, just wanted to check if there have been any updates on this issue.

Tttizi · 2024-02-01T10:12:37Z

I've noticed in the code that during the network initialization, there is a feature called land_sea_mask. Is this feature used in the downscaling task? Where is this data obtained from?

Escape142 · 2024-05-22T07:12:57Z

@tung-nd is Cli-ViT from ClimaX paper is the same as ViT in Climate Learn paper?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The replication issues with the downscaling task. #37

The replication issues with the downscaling task. #37

Tttizi commented Dec 18, 2023 •

edited

Loading

tung-nd commented Dec 21, 2023

Tttizi commented Dec 22, 2023

Tttizi commented Dec 22, 2023

Tttizi commented Jan 5, 2024

Tttizi commented Jan 26, 2024

Tttizi commented Feb 1, 2024

Escape142 commented May 22, 2024 •

edited

Loading

The replication issues with the downscaling task. #37

The replication issues with the downscaling task. #37

Comments

Tttizi commented Dec 18, 2023 • edited Loading

tung-nd commented Dec 21, 2023

Tttizi commented Dec 22, 2023

Tttizi commented Dec 22, 2023

Tttizi commented Jan 5, 2024

Tttizi commented Jan 26, 2024

Tttizi commented Feb 1, 2024

Escape142 commented May 22, 2024 • edited Loading

Tttizi commented Dec 18, 2023 •

edited

Loading

Escape142 commented May 22, 2024 •

edited

Loading