Skip to content

Add propagate w_mul_xj CUDA sparse support using matrix mul #610

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jul 16, 2025

Conversation

dferre97
Copy link
Contributor

Added fast w_mul_xj propagate CUDA support for sparse graphs using SpMM.
Added benchmarks to compare with gather/scatter approach, speedup from 40x to 300x on my machine, huge memory allocation benefits (up to 1000x less or more depending on size of graph and sparsity level).
CUDA tests on GraphNeuralNetworks passed.

@CarloLucibello
Copy link
Member

Build kite Cuda issues doesn't seem related to this pr, but we have to fix it at some point.

@CarloLucibello CarloLucibello merged commit 9b327d6 into JuliaGraphs:master Jul 16, 2025
5 of 7 checks passed
@dferre97 dferre97 deleted the df/wmulxj-sparse-cuda branch July 17, 2025 05:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants