Skip to content

Restrict Register Per Thread in CUDA #4574

Answered by WeiqunZhang
YqWangcfd asked this question in Q&A
Discussion options

You must be logged in to vote

I thought maxrregcount is a hard ceiling for nvcc. But it is not. https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/index.html#maxrregcount-amount-maxrregcount

A value less than the minimum registers required by ABI will be bumped up by the compiler to ABI minimum limit.

I guess for some of the kernels it's impossible to run without bumping up the register counts. Then there is probably nothing you can do.

Replies: 4 comments 10 replies

Comment options

You must be logged in to vote
2 replies
@WeiqunZhang
Comment options

@YqWangcfd
Comment options

Comment options

You must be logged in to vote
4 replies
@WeiqunZhang
Comment options

@YqWangcfd
Comment options

@WeiqunZhang
Comment options

@WeiqunZhang
Comment options

Comment options

You must be logged in to vote
4 replies
@WeiqunZhang
Comment options

@YqWangcfd
Comment options

@WeiqunZhang
Comment options

@YqWangcfd
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by YqWangcfd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants