Add prototype of new client #7

alxhill · 2024-08-22T20:53:17Z

Todo:

Change ingest flow to disable direct-write
Stop using polars for reading schema
Don't assume the first column is always the timeseries one
Re-use existing S3 upload
Integrate into existing classes
...and much more 🙂

alkasm · 2024-08-22T23:47:28Z

nominal/client.py

+
+from _api.scout_catalog import CatalogService
+
+catalog = create_service(CatalogService, get_base_url())


we can probably wrap this all in a client or session class so we have fewer globals floating around

alkasm · 2024-08-22T23:48:18Z

nominal/client.py

+
+# wrappers for conjure service apis
+
+from _api.scout_catalog import CatalogService


all these paths need to change to ._api.combined.<whatever>

(though also can just create a separately installable package for this eventually)

down to start publishing the conjure apis immediately - conjure python generator produces python packages, just needs some wiring to push to pypi

alkasm · 2024-08-22T23:49:51Z

nominal/client.py

+
+    # todo - merge this with the existing _upload_s3 function, it's mostly copy-pasted wholesale
+    def _upload_file_s3(self):
+        if hasattr(self, "s3_path"):


These attrs can be optional or something, rather than needing to check for presence.

alkasm · 2024-08-22T23:50:19Z

nominal/client.py

+class Dataset:
+    def __init__(self, csv_path: str):
+        self.csv_path = csv_path
+        print("Reading dataset schema...", flush=True)


we can migrate over to logging for these messages

agree - do you know if it auto-flushes? was v annoying not seeing logs during long-running blocking operations

alkasm · 2024-08-22T23:50:50Z

nominal/client.py

+        if hasattr(self, "run_rid"):
+            return


should definitely log when we skip steps

the reason I didn't do this is that this will be called recursively - so if you had 10 series in a workbook, it'd call sync on all of them, then each of those would call sync on their respective run (and eventually dataset). so the dataset/run would be uploaded the first time, then skipped for the other 9 series & produce a lot of confusing noise.

if/when we switch to a proper logger, would def add this at debug level or something

alkasm · 2024-08-22T23:52:39Z

nominal/client.py

+    @staticmethod
+    def from_ch(run: Run, sess, name: str, series_id: str, series_type: str) -> "Series":
+        base_query = f"SELECT timestamp, value FROM nominal.dataset_{series_type} WHERE series='{series_id}'"
+        return Series(run, sess, name, base_query, RawNumericSeriesNode(name=name))


Suggested change

@staticmethod

def from_ch(run: Run, sess, name: str, series_id: str, series_type: str) -> "Series":

base_query = f"SELECT timestamp, value FROM nominal.dataset_{series_type} WHERE series='{series_id}'"

return Series(run, sess, name, base_query, RawNumericSeriesNode(name=name))

@classmethod

def from_ch(cls, run: Run, sess, name: str, series_id: str, series_type: str) -> "Series":

base_query = f"SELECT timestamp, value FROM nominal.dataset_{series_type} WHERE series='{series_id}'"

return cls(run, sess, name, base_query, RawNumericSeriesNode(name=name))

alkasm · 2024-08-23T00:32:25Z

nominal/client.py

+    data_sources = [
+        (ds.filename, CreateRunDataSource(data_source=DataSource(dataset=ds.dataset_rid), series_tags={}))


will want the ability to add a custom ref name here, with the filename probably as a fallback

alkasm · 2024-08-23T00:33:35Z

nominal/client.py

+
+
+class Dataset:
+    def __init__(self, csv_path: str):


lets create a bunch of classmethods for alternative ctors, e.g. from_csv() so that we can later have from_pandas(), from_parquet(), etc

alkasm · 2024-09-11T04:05:03Z

Closing this PR (but not deleting the branch)

alxhill added 5 commits August 22, 2024 12:16

Update generated api

1a5a940

Add prototype client

1baf21b

formatting

c1e3ecc

lock

c74fe27

add notebook

791a63a

alkasm reviewed Aug 23, 2024

View reviewed changes

autocommit

ac3f0aa

alkasm closed this Sep 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add prototype of new client #7

Add prototype of new client #7

Uh oh!

alxhill commented Aug 22, 2024 •

edited

Loading

Uh oh!

alkasm Aug 22, 2024

Uh oh!

alkasm Aug 22, 2024

Uh oh!

alxhill Aug 27, 2024

Uh oh!

alkasm Aug 22, 2024

Uh oh!

alkasm Aug 22, 2024

Uh oh!

alxhill Aug 27, 2024

Uh oh!

alkasm Aug 22, 2024

Uh oh!

alxhill Aug 27, 2024

Uh oh!

alkasm Aug 22, 2024

Uh oh!

alkasm Aug 23, 2024 •

edited

Loading

Uh oh!

alkasm Aug 23, 2024

Uh oh!

alkasm commented Sep 11, 2024

Uh oh!

Uh oh!


		from _api.scout_catalog import CatalogService

		catalog = create_service(CatalogService, get_base_url())


		# wrappers for conjure service apis

		from _api.scout_catalog import CatalogService

		data_sources = [
		(ds.filename, CreateRunDataSource(data_source=DataSource(dataset=ds.dataset_rid), series_tags={}))

Add prototype of new client #7

Add prototype of new client #7

Uh oh!

Conversation

alxhill commented Aug 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alkasm Aug 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alkasm commented Sep 11, 2024

Uh oh!

Uh oh!

alxhill commented Aug 22, 2024 •

edited

Loading

alkasm Aug 23, 2024 •

edited

Loading