Claude Sonnet 3.7 Model Not Supported in aws_bedrock_claude_pipeline.py Due to Inference Profile Requirement #494
samuelpetermoshi
started this conversation in
General
Replies: 1 comment
-
Hello @g453030291 and @tjbck |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
hello @g453030291,
Issue Summary
OpenWebUI is currently unable to use the Claude Sonnet 3.7 model via
aws_bedrock_claude_pipeline.py
. The model does not appear in the available model list due to a filtering condition and fails invocation due to a missing inference profile.Root Cause Analysis (RCA)
1. Model Filtering Excludes Claude Sonnet 3.7
The pipeline script retrieves available models using the following line:
https://github.com/propertyguru/ds-openwebui/blob/bbd5eba50ea32a3e7e4e2e526a5794679d401b2b/pg_openwebui_pipelines/pipelines/aws_bedrock_claude_pipeline.py#L150
However, Claude Sonnet 3.7 is not an on-demand model. As a result, it is excluded from the model list due to this filtering.
2. Invocation Requires an Inference Profile
Even when the model is manually added or the filtering condition is removed, invocation fails with the following error:
This confirms that Claude Sonnet 3.7 must be invoked using an inference profile ARN, rather than via the default on-demand method.
Suggested Fix
byInferenceType
optional).Reference video explaining the inference profile flow:
📹 https://youtu.be/7-BIpBn9ao4?si=HZksyMVVR__eKfhv&t=343
Impact
cc: @tjbck
Beta Was this translation helpful? Give feedback.
All reactions