Skip to content

SneaksAndData/nexus-supervisor

Repository files navigation

Nexus Supervisor

Nexus Supervisor is an essential component of a Nexus deployment, responsible for setting run result and trace information for jobs that encounter issues such as running out of memory, image pull back off, etc. It is deployed in the same cluster where algorithm jobs are spawned and requires access to checkpoint store.

Quickstart

-- TBD --

Packages