CUDA copy_xj propagate sparse support and benchmarks #605

dferre97 · 2025-07-03T15:50:04Z

Added fast copy_xj propagate CUDA support for sparse graphs using SpMM.
Added benchmarks to compare with gather/scatter approach, speedup of 100-200x on average.

GNNlib/ext/GNNlibCUDAExt.jl

CarloLucibello · 2025-07-04T07:40:02Z

GNNlib/src/msgpass.jl

@@ -213,7 +213,7 @@ end
 ## COPY_XJ 

 function propagate(::typeof(copy_xj), g::GNNGraph, ::typeof(+), xi, xj::AbstractMatrix, e)
-    A = adjacency_matrix(g, weighted = false)
+    A = adjacency_matrix(g, eltype(xj); weighted = false)


Is the cast to the xj type necessary?

Yes, for the sparse case, because the SpMM mm! function in CUSPASE expects the type of the adjmat and the feature matrix to be the same, so we need to cast the adjmat before multiplying.

dferre97 added 2 commits July 3, 2025 09:21

Added propagate copy_xj CUDA sparse support using matrix mul

56f9c5e

Add benchmarks for CUDA sparse propagate copy_xj

f6ff8e3

CarloLucibello reviewed Jul 4, 2025

View reviewed changes

Keep old gather/scatter implementation for COO_T

4d9ddac

CarloLucibello merged commit 0221593 into JuliaGraphs:master Jul 6, 2025
5 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA copy_xj propagate sparse support and benchmarks #605

CUDA copy_xj propagate sparse support and benchmarks #605

Uh oh!

dferre97 commented Jul 3, 2025

Uh oh!

Uh oh!

CarloLucibello Jul 4, 2025

Uh oh!

dferre97 Jul 4, 2025

Uh oh!

Uh oh!

Uh oh!

CUDA copy_xj propagate sparse support and benchmarks #605

CUDA copy_xj propagate sparse support and benchmarks #605

Uh oh!

Conversation

dferre97 commented Jul 3, 2025

Uh oh!

Uh oh!

CarloLucibello Jul 4, 2025

Choose a reason for hiding this comment

Uh oh!

dferre97 Jul 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!