Implement a workflow allowing both git-oriented and JupyterLab-oriented notebook development

Just to reiterate from our (me, @PiotrZierhoffer and @proppy) short email discussion:

While working on some additional notebooks, we've started getting hit by the issue where subsequent pushes to the rad-lab-deploy repository would not trigger code update on the already running JupyterLab instance. 

In other words, the code located in [examples](https://github.com/proppy/rad-lab-deploy/tree/tuning/rad-lab/modules/silicon_design/scripts/build/notebooks/examples) is only ever used during the initial setup phase. The [secondary build](https://github.com/proppy/rad-lab-deploy/blob/0fd881797e9d52deaa7a1ad59ffaa906ffab34dd/rad-lab/modules/silicon_design/scripts/build/cloudbuild.yaml) handles uploading these files to a staging bucket. Then, the script that gets generated in the `notebooks-build` step is attached to the notebook instance as a post-startup script. 

As per [official documentation](https://cloud.google.com/vertex-ai/docs/workbench/user-managed/create-new#set-instance-properties), this script gets run only once – upon first boot of the newly created notebook instance. This creates an inconvenient situation where the developer is given a false impression that updating the repository will result in the code getting updated on the notebook. This obviously doesn’t happen as the code gets synced up only once – during the initial deployment.

Our development workflow mainly consists of prototyping in JupyterLab, committing changes back in git and prototyping again starting from the copy derived from git. This way we can ensure reproducibility in a sense that somebody can grab the repository, deploy the code to their project and start working on it.

It seems that an early attempt to address this issue has already been proposed in https://github.com/proppy/rad-lab/issues/5. Personally, I quite like this approach!

So far I see the following possible solutions on how to address this:
* make the `post-startup` script run upon every boot. This would of course require us to restart the notebook instance upon each push but this could either be added to Terraform or it could simply be added as another step in the secondary build pipeline.
* add an explicit step that would SSH into the notebook instance and run the bucket synchronization script.
* https://github.com/proppy/rad-lab/issues/5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement a workflow allowing both git-oriented and JupyterLab-oriented notebook development #16

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Implement a workflow allowing both git-oriented and JupyterLab-oriented notebook development #16

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions