Skip to content

Issues with CUDA #1822

Answered by Fabian188
Fabian188 asked this question in Q&A
Apr 4, 2025 · 1 comments · 7 replies
Discussion options

You must be logged in to vote

The code in my first post was from the mixed-precision-ir.cpp example.

I now have

auto x = gko::matrix::Dense<GK_T>::create(exec->get_master(), gko::dim<2>(rhs.GetSize(), 1));
for(unsigned int i = 0; i < rhs.GetSize(); i++)
    x->at(i) = ...
solver->apply(b, x);

where only x is generated on exec->get_master(), csr and b are pure exec, which is cuda in this case. It seems that x is transferred from host to cuda implicitly.

With your help and more investigations I understood the concept and the meaning of exec->get_master() better, thanks a lot!

Replies: 1 comment 7 replies

Comment options

You must be logged in to vote
7 replies
@Fabian188
Comment options

@Fabian188
Comment options

@yhmtsai
Comment options

yhmtsai Apr 4, 2025
Collaborator

@Fabian188
Comment options

Answer selected by Fabian188
@yhmtsai
Comment options

yhmtsai Apr 7, 2025
Collaborator

@Fabian188
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants