How to pretrain on GPU?

Hello!

I am trying to pretrain an adapter using the `4_pretrain_adapter.sh` script.
I have a GeForce RTX 2080 SUPER installed (~8GB VRAM), with NVIDIA Driver Version: 440.33.01,  CUDA Version: 10.2 and tensorflow-gpu 1.15.5.
[I set `CUDA_VISIBLE_DEVICES` to 0 in the `4_pretrain_adapter.sh` script since I only have a single GPU](https://github.com/Wluper/Retrograph/blob/master/4_pretrain_adapter.sh#L13)
Pretraining have been running for 12-16hrs now and is just about completing the warmup phase (~10000 steps).
I noticed that the pretraining only uses ~115MB of VRAM but several threads on CPU driving its usage up to ~100%.
I started perusing the code for GPU usage options/parameters, but, but so far only found a [switch for TPU usage](https://github.com/Wluper/Retrograph/blob/master/training_utility/run_pretraining_wo_nsp_adapter.py#L82) and [a comment stipulating that if TPU is not available, then the Estimator (`tf.contrib.tpu.TPUEstimator`) will fall back on CPU or GPU](https://github.com/Wluper/Retrograph/blob/master/training_utility/run_pretraining_wo_nsp_adapter.py#L82).
I then looked at the [official tensorflow documentation for TPUEstimator](https://www.tensorflow.org/api_docs/python/tf/compat/v1/estimator/tpu/TPUEstimator) but no luck there either.

As I continue to look into this, I was wondering if you add some tips or advices about running the code locally on a single GPU.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to pretrain on GPU? #13

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to pretrain on GPU? #13

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions