[CV2-6066] 4X workerprocesses from 16 to 64 ... #497

skyemeedan · 2025-03-03T21:43:45Z

Adjust the gunicorn configuration to 4X worker processes from 16 to 64 and also adding max-requests and jitter to restart process to reduce impact of memory growth

https://docs.gunicorn.org/en/stable/settings.html

--max-requests INT
Any value greater than zero will limit the number of requests a worker will process before automatically restarting. This is a simple method to help limit the damage of memory leaks.

--max_requests_jitter INT
The maximum jitter to add to the max_requests setting.The jitter causes the restart per worker to be randomized by randint(0, max_requests_jitter). This is intended to stagger worker restarts to avoid all workers restarting at the same time.

also update CODEOWNERS

Reference: https://meedan.atlassian.net/browse/CV2-6066

…ter to restart process to reduce impact of memory growth

dmou

ooh big jump. what's the hypothetical impact of this change? is it greater request concurrency? decreased average memory usage?

skyemeedan · 2025-03-03T22:16:18Z

ooh big jump. what's the hypothetical impact of this change? is it greater request concurrency? decreased average memory usage?

when running with previous settings (16 processes) and a less aggressive test (25 parallel connections vs 100) it seemed like it was finally able to handle the load without the task being marked unhealthy, but still saw some errors. after two days, it is using 80% memory. I'm expecting that this 4X increase will make OOM very likely (perhaps mitigated by the other args) and if we see that, will back it off again.

…ilures

Skye Bender-deMoll added 2 commits March 3, 2025 13:37

[CV2-6066] 4X workprocesses from 16 to 64 adding max-requests and jit…

0498eb6

…ter to restart process to reduce impact of memory growth

update codeowners

28457b3

skyemeedan requested review from computermacgyver and dmou as code owners March 3, 2025 21:43

skyemeedan changed the title ~~[CV2-6066] 4X workerprocesses from 16 to 64 adding max-requests and jit…~~ [CV2-6066] 4X workerprocesses from 16 to 64 ... Mar 3, 2025

dmou approved these changes Mar 3, 2025

View reviewed changes

updated live openai test to only compare vector lenths to fix test fa…

d36101e

…ilures

skyemeedan merged commit d4ea765 into develop Mar 3, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CV2-6066] 4X workerprocesses from 16 to 64 ... #497

[CV2-6066] 4X workerprocesses from 16 to 64 ... #497

Uh oh!

skyemeedan commented Mar 3, 2025

Uh oh!

dmou left a comment

Uh oh!

skyemeedan commented Mar 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[CV2-6066] 4X workerprocesses from 16 to 64 ... #497

[CV2-6066] 4X workerprocesses from 16 to 64 ... #497

Uh oh!

Conversation

skyemeedan commented Mar 3, 2025

Uh oh!

dmou left a comment

Choose a reason for hiding this comment

Uh oh!

skyemeedan commented Mar 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants