add model mapping for embeddings #21

nullfunc · 2025-06-11T00:39:56Z

Description of changes:
Add GCP embeddings. Converts OpenAI embeddings request to a vertex embeddings to make call to model. Response is conversely converted from vertex response to OpenAI embeddings response.

The previous vertex.py was changed to chat.py (parallels how AWS does it). Minimal changes to AWS code to ensure we don't have major issues taking upstream changes.

jordanstephens

small suggestions inline, but largely 👍

jordanstephens · 2025-06-11T21:14:48Z

src/api/routers/gcp/chat.py

@@ -82,7 +47,7 @@ def get_proxy_target(model, path):
    else:
        return f"https://{location}-aiplatform.googleapis.com/v1/projects/{project_id}/locations/{location}/{model}:rawPredict"

-def get_headers(model, request, path):
+def get_header(model, request, path):


Suggested change

def get_header(model, request, path):

def get_headers(model, request, path):

jordanstephens · 2025-06-11T21:14:54Z

src/api/routers/gcp/chat.py

@@ -159,7 +127,7 @@ async def handle_proxy(request: Request, path: str):
                conversion_target = "anthropic"

        # Build safe target URL
-        target_url, request_headers = get_headers(model, request, path)
+        target_url, request_headers = get_header(model, request, path)


Suggested change

target_url, request_headers = get_header(model, request, path)

target_url, request_headers = get_headers(model, request, path)

jordanstephens · 2025-06-11T21:21:57Z

src/api/routers/gcp/embeddings.py

+    vertex_request = {
+        "instances": []
+    }
+
+    msg_input = request.get("input")
+    if type(msg_input) is str:
+        vertex_request["instances"] = [{
+            "content": f"{msg_input}"
+            }]
+    elif type(msg_input) is list:
+        vertex_request["instances"] = [{"content": f"{str(item)}"} for item in msg_input]
+
+    return vertex_request


I think we can tighten this up a bit. What do you think of this?

Suggested change

vertex_request = {

"instances": []

}

msg_input = request.get("input")

if type(msg_input) is str:

vertex_request["instances"] = [{

"content": f"{msg_input}"

}]

elif type(msg_input) is list:

vertex_request["instances"] = [{"content": f"{str(item)}"} for item in msg_input]

return vertex_request

inputs = request.get("input", [])

if not isinstance(inputs, list):

inputs = [inputs]

return {

"instances": [{"content": str(content)} for content in inputs]

}

nullfunc added 6 commits June 6, 2025 15:02

add model mapping to aws embedding

efa854a

fix tests

4b596f5

working embedding translation

b7f9e6b

add vertex format to openai embedding response

42716b0

update embeddings response

dc68b73

add tests

fe69d90

nullfunc deployed to AWS_ROLE_TO_ASSUME June 11, 2025 00:40 — with GitHub Actions Active

nullfunc requested review from lionello, jordanstephens and raphaeltm June 11, 2025 00:40

jordanstephens approved these changes Jun 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add model mapping for embeddings #21

add model mapping for embeddings #21

Uh oh!

nullfunc commented Jun 11, 2025

Uh oh!

jordanstephens left a comment

Uh oh!

jordanstephens Jun 11, 2025

Uh oh!

jordanstephens Jun 11, 2025

Uh oh!

jordanstephens Jun 11, 2025

Uh oh!

Uh oh!

	def get_header(model, request, path):
	def get_headers(model, request, path):

	target_url, request_headers = get_header(model, request, path)
	target_url, request_headers = get_headers(model, request, path)

add model mapping for embeddings #21

Are you sure you want to change the base?

add model mapping for embeddings #21

Uh oh!

Conversation

nullfunc commented Jun 11, 2025

Uh oh!

jordanstephens left a comment

Choose a reason for hiding this comment

Uh oh!

jordanstephens Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

jordanstephens Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

jordanstephens Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!