Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issues with performance of GraphCast operational model #85

Closed
LeBronQ opened this issue Jun 28, 2024 · 5 comments
Closed

Issues with performance of GraphCast operational model #85

LeBronQ opened this issue Jun 28, 2024 · 5 comments

Comments

@LeBronQ
Copy link

LeBronQ commented Jun 28, 2024

Hi! We are using graphcast for forecasting. We downloaded the “Complete ERA5 global atmospheric reanalysis” 0.25*0.25 longitude data from CDS, merged it with the “reanalysis-era5-single-levels” data into a one-day file, and then merged multiple one-day files for ten-day forecasting. The model parameters we used are GraphCast_operational - ERA5-HRES 1979-2021 - resolution 0.25 - pressure levels 13 - mesh 2to6 - precipitation output only.npz.
We analyzed the forecast data for the United States and Brazil and found that there were relatively large errors in temperature. The errors on some days would suddenly become very large (for example, there would be high temperatures of more than 60 Celsius degree). The figure below is our experimental results of Minnesota State. What's more, the mean temperature is much lower overall, whether in Brazil or the United States.
In addition, we found that the “dataset_source-hres_date-2022-01-01_res-0.25_levels-13_steps-01.nc” file provided on GCS is slightly different from the 2022-01-01 data we downloaded from CDS. I would like to ask if there is a problem with the input data?
20240628-162355

@alvarosg
Copy link
Collaborator

alvarosg commented Jul 1, 2024

Thanks for your message. It is a big hard to say, but from your message. Could you confirm, is this data that wyou are downloading ERA5, data HRES Analysis data, or HRES forecasts inputs data?

You mention you are downloading ERA5, data, however HRES operational does not take ERA5 as inputs, but HRES forecasts, inputs which we usually download from the MARS repository.

So if you are comparing ERA5 data to this file: dataset_source-hres_date-2022-01-01_res-0.25_levels-13_steps-01.nc, then I would nto expect it to match.

@alvarosg
Copy link
Collaborator

alvarosg commented Jul 1, 2024

Could you check if you have a good match when comparing your data to the ERA5 example data (the ones that start with source-era5), and that the forecast is good if the you use the ERA5 weights.

Otherwise it seems like the main problem is that may have to download a separate dataset form MARS to initialize the operational model.

@LeBronQ
Copy link
Author

LeBronQ commented Jul 2, 2024

Thank you a lot for answering my question! We downloaded 0.25*0.25 levels 13 ERA5 reanalysis data from CDS website. Can we use it in "GraphCast - ERA5 1979-2017 - resolution 0.25 - pressure levels 37 - mesh 2to6 - precipitation input and output.npz" model? I see that there is "source-era5_date-2022-01-01_res-0.25_levels-13_steps-01.nc" in the GCS bucket.

@LeBronQ
Copy link
Author

LeBronQ commented Jul 2, 2024

We have another question. This model has obtained good results in precipitation indicators. Have you tried to use ERA5 data as input for operational models before? Can you get better results in some indicators?
image

@alvarosg
Copy link
Collaborator

Have you tried to use ERA5 data as input for operational models before?

We don't typically initialize operationl model son ERA5 data, because ERA5 data is not available in real time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants