Dev/minimal flow #16

Raalsky · 2024-08-19T10:43:55Z

No description provided.

kgodlewski · 2024-08-20T13:49:37Z

src/neptune_scale/__init__.py

        """
        verify_type("api_token", api_token, str)
        verify_type("family", family, str)
        verify_type("run_id", run_id, str)
+        verify_type("max_queue_size", max_queue_size, int)
+        verify_type("max_queue_size_exceeded_callback", max_queue_size_exceeded_callback, (Callable, type(None)))



we could check for > 1 here or add verify_int() with eg. positive=True argument

Sure, will do

kgodlewski · 2024-08-20T14:01:55Z

src/neptune_scale/core/components/operations_queue.py

@@ -51,6 +51,7 @@ def __init__(

    def enqueue(self, *, operation: RunOperation) -> None:
        try:
+            # TODO: This lock could be moved to the Run class
            with self._lock:


Or just have OpeartionsQueue own the lock? It doesn't seem Run is using it, apart from passing to the OperationsQueue constructor.

In the following PRs we're making sure that no one is putting any new operations once we're operating on SyncProcess so this cannot be a part of OperationsQueue

kgodlewski · 2024-08-20T15:38:28Z

src/neptune_scale/__init__.py

+        verify_type("as_experiment", as_experiment, (str, type(None)))
+        verify_type("creation_time", creation_time, (datetime, type(None)))
+        verify_type("from_run_id", from_run_id, (str, type(None)))
+        verify_type("from_step", from_step, (int, float, type(None)))


Shouldn't we make from_step obligatory when from_run_id is provided, and the other way around? Otherwise _create_run will silently ignore forking

Isin't it covered by line 128?

if (from_run_id is not None and from_step is None) or (from_run_id is None and from_step is not None): raise ValueError("`from_run_id` and `from_step` must be used together.")

kgodlewski · 2024-08-20T15:49:47Z

src/neptune_scale/core/metadata_splitter.py

+            new_size = size + pb_key_size(key) + proto_value.ByteSize() + 6
+


Where does the +6 come from?

It was based on our internal previous script, an overhead for type and length definitions I think.

kgodlewski · 2024-08-20T15:51:32Z

src/neptune_scale/core/serialization.py

@@ -33,3 +62,8 @@ def make_step(number: float | int, raise_on_step_precision_loss: bool = False) -
    micro = micro % m

    return Step(whole=whole, micro=micro)
+
+
+def pb_key_size(key: str) -> int:


A short explanation of how this is calculated and why would be great

It was from our previous script but I think it comes from max length assumption (10k if I remember, so 2 bytes at most for varint representation) + type definition overhead.

kgodlewski · 2024-08-20T16:05:11Z

tests/unit/test_metadata_splitter.py

+    result = list(builder)
+
+    # then
+    assert len(result) > 0


Detail: assert len(result) > 1 would make sure we're actually generating multiple results for entries that are too large. There's another one below as well

src/neptune_scale/core/components/errors_monitor.py

src/neptune_scale/__init__.py

src/neptune_scale/core/components/errors_queue.py

Raalsky added 8 commits July 29, 2024 10:10

Added minimal Run classes (#6)

6cf6715

Added OperationsQueue component (#7)

b85f1c5

Logging metadata (#8)

2faf3a5

Run creation and basic data synchronization (#9)

4c91b15

Added support for env variables for project and api token (#11)

67f63cb

Splitting metadata into multiple messages on log (#12)

6e4ada2

Added ErrorsMonitor and ErrorsQueue (#13)

d8098f9

Added support for family parameter (#14)

e35876c

kgodlewski reviewed Aug 20, 2024

View reviewed changes

Raalsky added 2 commits August 21, 2024 09:23

Code review

cceddab

Code review 2

04a7dce

Raalsky requested a review from kgodlewski August 21, 2024 07:40

kgodlewski approved these changes Aug 21, 2024

View reviewed changes

Raalsky merged commit 100a9b4 into main Aug 21, 2024
4 checks passed

Raalsky deleted the dev/minimal-flow branch August 21, 2024 11:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dev/minimal flow #16

Dev/minimal flow #16

Uh oh!

Raalsky commented Aug 19, 2024

Uh oh!

kgodlewski Aug 20, 2024

Uh oh!

Raalsky Aug 21, 2024

Uh oh!

kgodlewski Aug 20, 2024

Uh oh!

Raalsky Aug 21, 2024

Uh oh!

kgodlewski Aug 20, 2024

Uh oh!

Raalsky Aug 21, 2024

Uh oh!

kgodlewski Aug 20, 2024

Uh oh!

Raalsky Aug 21, 2024

Uh oh!

kgodlewski Aug 20, 2024

Uh oh!

Raalsky Aug 21, 2024

Uh oh!

kgodlewski Aug 20, 2024

Uh oh!

Raalsky Aug 21, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		new_size = size + pb_key_size(key) + proto_value.ByteSize() + 6

Dev/minimal flow #16

Dev/minimal flow #16

Uh oh!

Conversation

Raalsky commented Aug 19, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!