jupyterhub-aws-spawner

Custom spawner for launching and organizing EC2 instances with jupyterhub, using boto3 and tornado.

So far:

Scaling instance size via options form
Keeping track of EBS volume even if instance is terminated
Attaching and mounting custom EBS volumes
Create and attach new volume from snapshot
Assignment of IAM roles (via DB entry)

Todo:

Hardening and testing (Data integrity first)
Automatically mounting S3 buckets via s3fs and IAM
Make options form more shiny and userfriendly

Long term todo:

S3 bucket lookup and selection in options
Handling multiple volumes per user
Share volumes with other users
Multiple instances per User
Decouple from AWS -> jupyterhub-cloud-spawner

General setup

Its strongly recommended to set up a AWS VPC configuration according to cloudJhub and configure your server_config.json as shown below. This repo is built upon the spawner from cloudJhub and the instructions there are good guidance to create a running environemnt.

  {"JUPYTER_MANAGER_IP": "", 
  "MANAGER_IP_ADDRESS": "", 
  "SERVER_USERNAME": "", 
  "JUPYTER_CLUSTER": "", 
  "WORKER_USERNAME": "", 
  "REGION": "", 
  "SUBNET_ID": "", 
  "WORKER_EBS_SIZE": , 
  "WORKER_SERVER_OWNER": "", 
  "AVAILABILITY_ZONE": " {a,b,c}", 
  "WORKER_SERVER_NAME": "", 
  "USER_HOME_EBS_SIZE": , 
  "KEY_NAME": "jupyter_key.pem", 
  "WORKER_AMI": "", 
  "WORKER_SECURITY_GROUPS": [""], 
  "JUPYTER_NOTEBOOK_TIMEOUT": 3600, 
  "INSTANCE_TYPE": ""
  }

Next create a bastion_info.json to ssh-jump to the server in the subnet. Use the key from the configuration above.

Note: To make jumpssh work you should set StrictHostKeyChecking no on the bastion server for the worker subnet, otherwise the jumper will be stuck.

{
  "bastion" : "example.bastion.com",
  "key_path" : "/path/to/.ssh/jupyter_key.pem",
  "user": "hostuser"
}

If you want to use an external db you can create a db_creds.json. Every MySqlAlchemy supported DB should work but Postgres is recommended by Jupyterhub and this way you can use one DB for Jupyterhub and spawner meta information. Otherwise sqlite can be used.

{
  "database" : "dbname", 
  "user" : "dbuser", 
  "password":"dbpw",
  "host":"dburl", 
  "port":""
}```

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
aws_ressources.py		aws_ressources.py
models.py		models.py
options_form.html		options_form.html
options_screen.png		options_screen.png
spawner.py		spawner.py
ssh_run_debug.py		ssh_run_debug.py
test_normal_start.py		test_normal_start.py
test_stop.py		test_stop.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

jupyterhub-aws-spawner

So far:

Todo:

Long term todo:

General setup

About

Uh oh!

Releases

Packages

Languages

License

Ridingst/jupyterhub-aws-spawner

Folders and files

Latest commit

History

Repository files navigation

jupyterhub-aws-spawner

So far:

Todo:

Long term todo:

General setup

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages