-
I have deployed Cohere EmbedV4 on Azure but have a hard time making it work. I am using python 3.11, and client = EmbeddingsClient(
endpoint="https://<myProject>.services.ai.azure.com/models",
credential=AzureKeyCredential("myKey")
)
response = client.embed(
input=["first phrase","second phrase","third phrase"],
model = "embed-v-4-0-2",
) I am getting this error :
Which doesn't tell much. Looking deeper and trying to debug the lib it may look like an Note that :
Thanks |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 2 replies
-
Hi, The
Here is a full python example
|
Beta Was this translation helpful? Give feedback.
-
Update: For older models (like ada-002), it apparently doesn’t matter. For Cohere’s, you should disable it. I was today years old when I learned this even existed. |
Beta Was this translation helpful? Give feedback.
-
Ok this makes sense, you had this preview feature turned on which switches between models using a single endpoint and authentication, making integration more seamless. For MaaS (Serverless) ensure this is set to “Off,” which controls whether this deployment feature is enabled. For reference for others, the preview features are set by selecting the Speaker funnel on https://ai.azure.com If you encounter model deployment issues or inability to deploy Models as Service (serverless) ensure this feature is turned off. Reference url is https://learn.microsoft.com/en-us/azure/ai-foundry/concepts/deployments-overview |
Beta Was this translation helpful? Give feedback.
Update:
This feature flag was activated (cf. screenshot), which meant the model was force-deployed on *.services.ai.azure.com. Once disabled, I was able to deploy it correctly.
For older models (like ada-002), it apparently doesn’t matter. For Cohere’s, you should disable it.
I was today years old when I learned this even existed.
I'm leaving the discussion open if this isn't an expected behaviour from Azure. But my problem is solved.
Apologies for taking up your time.