Closed
Description
It seems the style used here is being deprecated and won't work with Flux 0.14:
Line 37 in 452c09d
edit After discussion below, I suggest we wait on
and refactor to use a optimiser-based solution to weight regularisation, which will avoid current limitations of explicit differentiation outlined in the discussion. Note, this will likely mean the reported training_loss
must change, as it will no longer include the weight penalty. So this will be breaking.