Allow users to run crawls with 1 or 2 browser windows #2627

tw4l · 2025-05-27T21:17:25Z

Fixes #2425

Changed

Switch backend to primarily using number of browser windows rather than scale multiplier (including migration to calculate browserWindows from scale for existing workflows and crawls)
Still support scale in addition to browserWindows in input models for creating and updating workflows and re-adjusting live crawl scale for backwards compatibility
Adds new max_browser_windows value to Helm chart, but calculates the value from max_crawl_scale as fallback for users with that value already set in local charts
Rework frontend to allow users to select multiples of crawler_browser_instances or any value below crawler_browser_instances for browser windows. For instance, with crawler_browser_instances=4 and max_browser_windows=8, the user would be presented with the following options: 1, 2, 3, 4, 8
Sets maximum width of screencast to image width returned by message

backend/btrixcloud/operator/crawls.py

SuaYoo · 2025-05-29T02:09:23Z

@tw4l Updated to set the maximum width to the screencast image width:

…ndows

This will keep scale and browserWindows from getting out of sync when the number of workers per pod on an instance changes.

Comment out unsetting of scale for now for easier testing on dev

…ces, then in multiples through max

- rename pod_count -> scale for consistency - remove debug logging - simplify update_scale to remove cast

Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>

SuaYoo

Looks good testing locally, if it's easy to do we should add a deprecation notice for the scale API field.

ikreymer

Looks good! Nice work, tested on dev with different number of windows!

) - follow up to #2627 - use qa_num_browser_windows to set exact number of QA browsers, fallback to qa_scale - set num_browser_windows and num_browsers_per_pod using crawler / qa values depending if QA crawl - scale_from_browser_windows() accepts optional browsers_per_pod if dealing with possible QA override - store 'desiredScale' in CrawlStatus to avoid recomputing for later scale resolving - ensure status.scale is always the actual scale observed

tw4l commented May 27, 2025

View reviewed changes

backend/btrixcloud/operator/crawls.py Outdated Show resolved Hide resolved

tw4l commented May 27, 2025

View reviewed changes

backend/btrixcloud/operator/crawls.py Outdated Show resolved Hide resolved

tw4l commented May 27, 2025

View reviewed changes

backend/btrixcloud/operator/crawls.py Outdated Show resolved Hide resolved

tw4l and others added 26 commits May 28, 2025 20:04

Modify backend scale to be number of browser windows

8b9d4bd

Fix SettingsReponse model

fa339c0

Update frontend for simplified maxBrowserWindows

54d91f6

Import math

fc1eafb

Operator fixups

d18f500

Handle case where scale < workers per pod

3958bfd

Add pylint comment

a7de12b

Update API settings test

ece1bc7

Fix pylint comment

31f5ade

Use crawl.scale, add lots of debug print logging

5cd019c

Fixups

2561701

Consolidate print logging lines

0396c1c

Remove some debug print logging

6fd850a

Temp: Debug print log sync_crawls return

bf8f1d0

rebase fix

551b18b

work

27d0fc1

switch back to last pod

9e1a86e

fix remainder check

b7f855d

rename priorities to use max_browser_windows

92557ec

Fix screencast window count

1249ead

Rename scale fields to distinguish pods from browser windows

13c85be

Fix linting

c5da074

Undo change to worker index calculation for screenshots

92ed072

Remove unused variable

9e67798

Add separate browserWindows on backend alongside scale

5f2f859

Update frontend to use browserWindows not scale

459e466

ikreymer and others added 16 commits May 28, 2025 20:05

ensure backwards compatible with max_crawl_scale if no max_browser_wi…

9f466b9

…ndows

fix tests

ae50e6e

switch browser windows to text box

844f253

Calculate scale at time of need instead of storing in db

7977977

This will keep scale and browserWindows from getting out of sync when the number of workers per pod on an instance changes.

Update org import for change

c6275aa

More fixups for removing scale from db

eaef86f

Add migration to convert scale to browserWindows in db

bb29cc5

Comment out unsetting of scale for now for easier testing on dev

frontend: custom range for browser windows, by 1 until browser instan…

c9014db

…ces, then in multiples through max

Don't unset scale in migration

5df33de

Store scale in crawl object

4922179

Update scale in crawl model when crawl is live rescaled

4166e32

Add some tests

43551e1

Update expected totals in tests

a9ee784

Remove outdated pylint comment

9fbb99f

set max width

d939625

fix spinner

aab9f5a

SuaYoo force-pushed the issue-2425-browser-windows branch from 8cb422e to aab9f5a Compare May 29, 2025 03:06

ikreymer marked this pull request as ready for review May 29, 2025 03:08

tw4l requested review from ikreymer and SuaYoo May 29, 2025 03:45

SuaYoo requested a review from emma-sg May 29, 2025 04:50

ikreymer and others added 2 commits May 29, 2025 11:33

cleanup:

cee8dc9

- rename pod_count -> scale for consistency - remove debug logging - simplify update_scale to remove cast

Update backend/btrixcloud/operator/crawls.py

435a33e

Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>

SuaYoo reviewed Jun 3, 2025

View reviewed changes

Add deprecated flag to Scale

5e21715

ikreymer approved these changes Jun 3, 2025

View reviewed changes

ikreymer merged commit dc41468 into main Jun 3, 2025
27 checks passed

ikreymer deleted the issue-2425-browser-windows branch June 3, 2025 20:37

ikreymer mentioned this pull request Jun 11, 2025

additional scale / browser window cleanup to properly support QA: #2663

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Allow users to run crawls with 1 or 2 browser windows #2627

Allow users to run crawls with 1 or 2 browser windows #2627

Uh oh!

tw4l commented May 27, 2025 •

edited by SuaYoo

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SuaYoo commented May 29, 2025

Uh oh!

SuaYoo left a comment

Uh oh!

ikreymer left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Allow users to run crawls with 1 or 2 browser windows #2627

Allow users to run crawls with 1 or 2 browser windows #2627

Uh oh!

Conversation

tw4l commented May 27, 2025 • edited by SuaYoo Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SuaYoo commented May 29, 2025

Uh oh!

SuaYoo left a comment

Choose a reason for hiding this comment

Uh oh!

ikreymer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tw4l commented May 27, 2025 •

edited by SuaYoo

Loading