Skip to content

Stop using implicit style differentiating #221

Closed
@ablaom

Description

@ablaom

It seems the style used here is being deprecated and won't work with Flux 0.14:

gs = Flux.gradient(parameters) do


edit After discussion below, I suggest we wait on

and refactor to use a optimiser-based solution to weight regularisation, which will avoid current limitations of explicit differentiation outlined in the discussion. Note, this will likely mean the reported training_loss must change, as it will no longer include the weight penalty. So this will be breaking.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions