-
-
Notifications
You must be signed in to change notification settings - Fork 66
AIFS Model #81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I would appreciate it if you could share more about your vision and priorities, as this would greatly assist us in aligning our efforts and working more effectively together. I have a few questions:
Thank you! |
One thing to note is AIFS is now public, so it should be easier to see how it works/how it builds its graph and maybe incorporate that here.
My suggestion with this one would be more to make two PRs, one to add icosahedral grids that is also fast to generate, and one for supporting Anemoi graphs, as put in #102. A different PR to add AIFS could be helpful, but as the model is fully open and in PyTorch already, it is less of a priority. |
Thanks for the details! I’ve gone through your comments, and I think breaking this into separate PRs makes the most sense. Here’s how I plan to approach it:
I’ll start by reviewing the AIFS model code to understand how it constructs graphs and how best to integrate it. If you have any preferences on implementation details or benchmarks you'd like to see, let me know! Thanks! |
Great! Yeah, that sounds like a good plan |
While working on the graph generators, I've identified a few enhancements that could significantly improve the scalability and flexibility of our system. Here are my suggestions:
I think this may be better suited for a separate PR. I would greatly appreciate your insights on this matter. |
https://www.ecmwf.int/en/about/media-centre/aifs-blog/2024/first-update-aifs
Detailed Description
AIFS model code isn't public, so not exactly sure how it is working, although its apparently similar to GraphCast. On a new update, it uses a reduced Gaussian octahedral grid for the processor, and the encoder an decoder use attention-based graph neural networks. The processor grid has 40320 grid points, which is processes as a sequence with a sliding attention window.
Context
It is another approach that might be worth including here. Good use for #76, as it could be another internal graph representation to easily slot in and try out.
Possible Implementation
The text was updated successfully, but these errors were encountered: