Skip to content
This repository was archived by the owner on Jul 10, 2024. It is now read-only.

Commit 7bbc1d2

Browse files
committed
rename docker image tags
1 parent eb2017f commit 7bbc1d2

File tree

4 files changed

+19
-19
lines changed

4 files changed

+19
-19
lines changed

README.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -122,7 +122,7 @@ echo $LOCAL_DIR
122122
echo $MODEL_NAME
123123
```
124124
**Important Notes**:
125-
1. Users need to have adequate write permissions to the folders on NFS. IT policies of NFS may prvent you from mounting volumes on NFS to containers. Please refer to the [Troubleshooting](#troubleshooting) section if you ever ran into "permission denied" error when running the pipelines.
125+
1. Users need to have adequate write permissions to the folders on NFS. IT policies of NFS may prevent you from mounting volumes on NFS to containers. Please refer to the [Troubleshooting](#troubleshooting) section if you ever ran into "permission denied" error when running the pipelines.
126126
2. It is important to set up the $LOCAL_DIR variable. Make sure you have it set up.
127127
3. You may want to double check your http and https proxies on your machines to make sure you can download models from external internet, more specifically, from Hugging Face model hub and from PaddleOCR.
128128

@@ -213,8 +213,8 @@ docker compose build
213213
OR
214214

215215
```bash
216-
docker pull intel/ai-workflows:beta-doc-automation-fine-tuning
217-
docker pull intel/ai-workflows:beta-doc-automation-indexing
216+
docker pull intel/ai-workflows:doc-automation-fine-tuning
217+
docker pull intel/ai-workflows:doc-automation-indexing
218218
```
219219

220220
### Run Single-Node Preprocessing Pipeline
@@ -670,7 +670,7 @@ docker run -a stdout ${DOCKER_RUN_ENVS} \
670670
-v ${SAVEPATH}:/home/user/output/processed_data \
671671
--privileged --network host --init -it --rm --pull always \
672672
-w /home/user/application \
673-
intel/ai-workflows:beta-doc-automation-fine-tuning \
673+
intel/ai-workflows:doc-automation-fine-tuning \
674674
/bin/bash
675675
```
676676
Then you will be taken inside the container and you can run the command below for pre-processing:
@@ -687,7 +687,7 @@ To read about other use cases and workflows examples, see these resources:
687687

688688

689689
## Troubleshooting
690-
1. If you got "permissions denied" error to write to the output folder in the fine-tuning pipeline, it is very likely that you do not have adequate write permissions to write to the folder on NFS. You can use `chmod` to change the permissions of the output folder. If you cannot change the permissions, you can try to set up the work directory on a local disk where you have adequate write permissions.
690+
1. If you got "permissions denied" error to write to the output folder in the preprocessing or fine-tuning pipeline, it is very likely that you do not have adequate write permissions to write to the folder on NFS. You can use `chmod` to change the permissions of the output folder. If you cannot change the permissions, you can try to set up the work directory on a local disk where you have adequate write permissions.
691691
2. If you got errors about not being able to write to databases,or if either postgresql or elasticsearch container did not get started with docker-compose commands, the errors are likely due to the postgresql container user and the elasticsearch container user being different, and they have different/inadequate permissions to write to the database directory that you have set up on your machine. Change the write permissions of the `$LOCAL_DIR` with `chmod` commands so that both postgresql and elasticsearch containers can write to it.
692692
3. If you got error from docker daemon when running distributed indexing pipeline that "mkdir permission denied", it is due to NFS policy not allowing mounting folders on NFS to docker containers. Contact your IT to get permission or change to an NFS that allows container volume mounting.
693693
4. If you got out of memory (OOM) error from Ray actors, or the ray process got stuck for a very long time, try reducing the number of actors and increasing the number of CPUs per actor. As a rule of thumb, max_num_ray_actors * num_cpus_per_actor should not exceed the total number of threads of your CPU. For example, you have 48 CPU cores in your system and have hyperthreading turned on, in this case you have in total 48 * 2 = 96 threads, max_num_ray_actors * num_cpus_per_actor should not exceed 96.

chart/templates/workflowTemplate.yaml

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -12,15 +12,15 @@ spec:
1212
arguments:
1313
parameters:
1414
- name: workflow
15-
value: beta-doc-automation-fine-tuning
15+
value: doc-automation-fine-tuning
1616
- name: cmd
1717
value: 'bash scripts/run_process_dataset_argo.sh'
1818
- name: fine-tuning
1919
template: fine-tuning-phase
2020
arguments:
2121
parameters:
2222
- name: workflow
23-
value: beta-doc-automation-fine-tuning
23+
value: doc-automation-fine-tuning
2424
- name: cmd
2525
value: 'bash scripts/run_dpr_training.sh'
2626
dependencies:
@@ -38,7 +38,7 @@ spec:
3838
arguments:
3939
parameters:
4040
- name: workflow
41-
value: beta-doc-automation-indexing
41+
value: doc-automation-indexing
4242
- name: postgre_ip
4343
value: "{{ `{{tasks.postgresql-db.ip}}` }}"
4444
- name: es_ip
@@ -54,7 +54,7 @@ spec:
5454
arguments:
5555
parameters:
5656
- name: workflow
57-
value: beta-doc-automation-indexing
57+
value: doc-automation-indexing
5858
- name: postgre_ip
5959
value: "{{ `{{tasks.postgresql-db.ip}}` }}"
6060
- name: es_ip
@@ -72,7 +72,7 @@ spec:
7272
- name: postgre_ip
7373
value: "{{ `{{tasks.postgresql-db.ip}}` }}"
7474
- name: workflow
75-
value: beta-doc-automation-indexing
75+
value: doc-automation-indexing
7676
dependencies:
7777
- performance-retrieval
7878
- name: ui

docker/docker-compose.yml

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ services:
1111
no_proxy: ${no_proxy}
1212
context: ../../document-automation
1313
dockerfile: docker/Dockerfile.fine-tuning
14-
image: intel/ai-workflows:beta-doc-automation-fine-tuning
14+
image: intel/ai-workflows:doc-automation-fine-tuning
1515
network_mode: "host"
1616
privileged: true
1717
command: sh -c "bash scripts/run_process_dataset.sh"
@@ -26,7 +26,7 @@ services:
2626
working_dir: /home/user/application
2727

2828
fine-tuning:
29-
image: intel/ai-workflows:beta-doc-automation-fine-tuning
29+
image: intel/ai-workflows:doc-automation-fine-tuning
3030
network_mode: "host"
3131
privileged: true
3232
command: sh -c "bash scripts/run_dpr_training.sh"
@@ -76,7 +76,7 @@ services:
7676
no_proxy: ${no_proxy}
7777
dockerfile: docker/Dockerfile.indexing
7878
context: ../../document-automation
79-
image: intel/ai-workflows:beta-doc-automation-indexing
79+
image: intel/ai-workflows:doc-automation-indexing
8080
privileged: true
8181
network_mode: "host"
8282
command: sh -c "ray start --node-ip-address=${HEAD_IP} --head --dashboard-host='0.0.0.0' --dashboard-port=8265 --disable-usage-stats && \ python src/test_pocr.py && \ bash scripts/run_distributed_indexing.sh"
@@ -100,7 +100,7 @@ services:
100100
working_dir: /home/user/application
101101

102102
performance-retrieval:
103-
image: intel/ai-workflows:beta-doc-automation-indexing
103+
image: intel/ai-workflows:doc-automation-indexing
104104
privileged: true
105105
depends_on:
106106
- indexing
@@ -119,7 +119,7 @@ services:
119119

120120
dev:
121121
command: sh -c "${WORKFLOW:-bash scripts/run_dpr_training.sh}"
122-
image: intel/ai-workflows:${TAG:-beta-doc-automation-fine-tuning}
122+
image: intel/ai-workflows:${TAG:-doc-automation-fine-tuning}
123123
environment:
124124
- MODEL_NAME=${MODEL_NAME:-my_dpr_model}
125125
- http_proxy=${http_proxy}
@@ -145,7 +145,7 @@ services:
145145
no_proxy: ${no_proxy}
146146
dockerfile: docker/Dockerfile.indexing
147147
context: ../../document-automation
148-
image: intel/ai-workflows:beta-doc-automation-indexing
148+
image: intel/ai-workflows:doc-automation-indexing
149149
privileged: true
150150
network_mode: "host"
151151
command: sh -c "ray start --node-ip-address=${HEAD_IP} --head --dashboard-host='0.0.0.0' --dashboard-port=8265 --disable-usage-stats && \ python src/test_pocr.py && \ bash scripts/run_distributed_indexing.sh && \ bash scripts/make_retrieval_eval_csv.sh && \ bash scripts/run_retrieval_eval.sh"

scripts/run-ray-cluster.sh

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -209,7 +209,7 @@ if [[ $run_type = "startup_all" ]]; then
209209
docker run -itd -p 8265:8265 --cap-add=NET_ADMIN --network host -v ${workspace}:/home/user/application -v ${dataset}:/home/user/dataset -v ${output}:/home/user/output -v ${docvqa}:/home/user/docvqa \
210210
--cpuset-cpus=${ray_head_cores_range} \
211211
--env HEAD_IP=$HEAD_IP --env http_proxy=$http_proxy --env https_proxy=$https_proxy \
212-
--shm-size=64gb --name $head_name intel/ai-workflows:beta-doc-automation-indexing /bin/bash & #intel/ai-workflows:odqa-haystack-api
212+
--shm-size=64gb --name $head_name intel/ai-workflows:doc-automation-indexing /bin/bash & #intel/ai-workflows:odqa-haystack-api
213213
sleep 5
214214
#docker exec -d $head_name /bin/bash -c "ip link del dev eth1"
215215
docker exec -d $head_name /bin/bash -c "ray start --node-ip-address=${head_address} --head --dashboard-host='0.0.0.0' --dashboard-port=8265"
@@ -227,7 +227,7 @@ if [[ $run_type = "startup_all" ]]; then
227227
docker run -itd --cpuset-cpus=${ray_worker_cores_range} --cap-add=NET_ADMIN --network host -v ${workspace}:/home/user/application -v ${dataset}:/home/user/dataset -v ${output}:/home/user/output \
228228
--env HEAD_IP=$HEAD_IP --env http_proxy=$http_proxy --env https_proxy=$https_proxy \
229229
--shm-size=2gb \
230-
--name $worker_name intel/ai-workflows:beta-doc-automation-indexing /bin/bash &
230+
--name $worker_name intel/ai-workflows:doc-automation-indexing /bin/bash &
231231
sleep 5
232232
#docker exec -d $worker_name /bin/bash -c "ip link del dev eth1"
233233
docker exec -d $worker_name /bin/bash -c "ray start --address=$head_address"
@@ -251,7 +251,7 @@ elif [[ $run_type = "startup_workers" ]]; then
251251
docker run -itd --cpuset-cpus=${ray_worker_cores_range} --cap-add=NET_ADMIN --network host -v ${workspace}:/home/user/application -v ${dataset}:/home/user/dataset -v ${output}:/home/user/output\
252252
--env HEAD_IP=$HEAD_IP --env http_proxy=$http_proxy --env https_proxy=$https_proxy \
253253
--shm-size=2gb \
254-
--name $worker_name intel/ai-workflows:beta-doc-automation-indexing /bin/bash &
254+
--name $worker_name intel/ai-workflows:doc-automation-indexing /bin/bash &
255255
sleep 5
256256
#docker exec -d $worker_name /bin/bash -c "ip link del dev eth1"
257257
docker exec -d $worker_name /bin/bash -c "ray start --address=$head_address"

0 commit comments

Comments
 (0)