-
Notifications
You must be signed in to change notification settings - Fork 2
Closed
Description
Hi,
It's an intriguing approach. I see you take over similar, basically "malloc" for Julia... and you reduce allocations, but the timing is still worse. Is that maybe not in general, since that's the whole point?
I like Bumper, since there are no allocations, basically, so I'm confused why you have any at all? Probably for the output array.
Taking over similar seems clever, should it ideally be done too for zeros, ones (and fill?) or does it in fact happen implicitly? I'm thinking though, should it happen in all cases, and what you do is I think so-called type-piracy, so it's all or nothing?
I suppose this does not work for GPUs despite me seeing something Flux related. And neither Bumper.jl... but do you think it would be plausible?
Metadata
Metadata
Assignees
Labels
No labels