Understanding the performance implications of using multiple GPUs (zkVM newbie) #2371

ceyhunalp · 2025-07-17T10:20:59Z

ceyhunalp
Jul 17, 2025

Component

sp1-zkvm

Describe the feature you would like

I have recently started exploring the zkVM ecosystem and my main focus has been experimenting with RSP to better understand the performance of SP1. If I understand correctly, the out-of-box proving software does not use multiple GPUs on a single machine, which means it is not possible to quantify the performance improvement in proving due to GPU parallelization without setting up a cluster.

I was wondering if Succinct has any numbers on the parallelizability trade-off? Basically, what is the marginal benefit of increasing the number of discrete GPUs in the cluster, i.e., how much speed-up does SP1 get from doubling/tripling/quadrupling/etc. the number of GPUs in the cluster? I am assuming there will be some communication and coordination overhead due to adding more GPUs to the cluster. I am also wondering if there are any unparallelizable portions of the proving process that would not benefit from the additional GPUs. If you have any measurements, even if they are rough estimates, I would love to hear them.

Additional context

No response

nhtyy · 2025-07-17T17:33:16Z

nhtyy
Jul 17, 2025
Maintainer

Moved from issues to discussion\

0 replies

nhtyy · 2025-07-17T17:37:16Z

nhtyy
Jul 17, 2025
Maintainer

I dont have any numbers off the top of my head but the way to think about it is that the initial execution, and to some degree even doing trace generation is nonparallelizable.

In practice, and you can see this in the executor, we will "checkpoint" a very light weight version of the execution, which we can then use to start the execution at "arbitrary" spot in parallel.

From this each "checkpoint" reexecutes and collects trace data. Pretty much everything after this is parallelizable.

0 replies

quietBlockchain · 2025-07-22T13:12:46Z

quietBlockchain
Jul 22, 2025

Thanks for raising this, I was also curious about SP1’s scalability across multiple GPUs.

It would be really helpful to see any benchmarks or performance scaling results the team might have internally. Especially interested in where the diminishing returns start to kick in as more GPUs are added. If some components of the proof system are inherently sequential, knowing that would help set realistic expectations when designing a prover setup.

Looking forward to hearing the team’s insights!

0 replies

ceyhunalp · 2025-07-23T09:30:14Z

ceyhunalp
Jul 23, 2025
Author

Exactly! I think the community could benefit greatly from having access to performance results from multi-GPU benchmarks. These benchmarks would not only provide a baseline for comparing prover performance, but also help people make more informed economic decisions. At the end of the day, the ultimate goal for provers is profitability. Since adding an extra GPU to a cluster introduces new costs, such as purchasing/renting the hardware, increased electricity consumption, and ongoing maintenance, it is critical to understand at what point adding another GPU stops being financially beneficial.

1 reply

quietBlockchain Jul 23, 2025

Absolutely agree the economic side is just as important as the technical. Benchmark data would be super helpful in understanding when scaling stops being worth the cost. Hope the team shares something on this soon.

koxu1996 · 2025-07-25T07:33:41Z

koxu1996
Jul 25, 2025

Straight answer: SP1's GPU acceleration is closed-source, and zkVM itself does NOT support parallel proving.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Understanding the performance implications of using multiple GPUs (zkVM newbie) #2371

Uh oh!

{{title}}

Uh oh!

Replies: 5 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Understanding the performance implications of using multiple GPUs (zkVM newbie) #2371

Uh oh!

ceyhunalp Jul 17, 2025

Component

Describe the feature you would like

Additional context

Replies: 5 comments · 1 reply

Uh oh!

nhtyy Jul 17, 2025 Maintainer

Uh oh!

Uh oh!

nhtyy Jul 17, 2025 Maintainer

Uh oh!

quietBlockchain Jul 22, 2025

Uh oh!

ceyhunalp Jul 23, 2025 Author

Uh oh!

quietBlockchain Jul 23, 2025

Uh oh!

koxu1996 Jul 25, 2025

ceyhunalp
Jul 17, 2025

Replies: 5 comments 1 reply

nhtyy
Jul 17, 2025
Maintainer

nhtyy
Jul 17, 2025
Maintainer

quietBlockchain
Jul 22, 2025

ceyhunalp
Jul 23, 2025
Author

koxu1996
Jul 25, 2025