fix: Anthropic prompt caching on GCP Vertex AI #9605

sammcj · 2025-03-28T05:02:47Z

Title

Fix (hopefully) for prompt caching not working with Anthropic models on GCP VertexAI

Relevant issues

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally - the same tests fail on my branch as fail on main
[-] My PR passes all unit tests on (make test-unit)[https://docs.litellm.ai/docs/extras/contributing_code] - the same tests fail on my branch as fail on main
My PR's scope is as isolated as possible, it only solves 1 specific problem

New test:

Other tests are no more broken than they are on main at present:

And the same linting errors as on main (not on the changes included here):

Type

🐛 Bug Fix

Changes

Add missing header for Anthropic models on GCP Vertex which allows prompt caching to work.

vercel · 2025-03-28T05:02:51Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Mar 28, 2025 9:06am

litellm/llms/anthropic/chat/transformation.py

This reverts commit a867324.

…x_parallel_requests = 0 (#9671) * fix(proxy_server.py): remove non-functional parent backoff/retry on /chat/completion Causes circular reference error * fix(http_parsing_utils.py): safely return parsed body - don't allow mutation of cached request body by client functions Root cause fix for circular reference error * Revert "fix: Anthropic prompt caching on GCP Vertex AI (#9605)" (#9670) This reverts commit a867324. * add type hints for AnthropicMessagesResponse * define types for response form AnthropicMessagesResponse * fix response typing * allow using litellm.messages.acreate and litellm.messages.create * fix anthropic_messages implementation * add clear type hints to litellm.messages.create functions * fix anthropic_messages * working anthropic API tests * fixes - anthropic messages interface * use new anthropic interface * fix code quality check * docs anthropic messages endpoint * add namespace_packages = True to mypy * fix mypy lint errors * docs anthropic messages interface * test: fix unit test * test(test_http_parsing_utils.py): update tests --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>

fix: Anthropic prompt caching on GCP Vertex AI

2a6d6df

vercel bot had a problem deploying to Preview March 28, 2025 05:03 Failure

sammcj mentioned this pull request Mar 28, 2025

[Feature]: Context Caching for Vertex AI #6898

Open

Merge branch 'main' into gcp_prompt_caching

0cea7ef

vercel bot deployed to Preview March 28, 2025 05:08 View deployment

krrishdholakia reviewed Mar 28, 2025

View reviewed changes

litellm/llms/anthropic/chat/transformation.py Show resolved Hide resolved

test(vertex): anthropic prompt caching

688c4f8

vercel bot deployed to Preview March 28, 2025 09:06 View deployment

krrishdholakia merged commit a867324 into BerriAI:main Mar 30, 2025
3 checks passed

sammcj deleted the gcp_prompt_caching branch March 31, 2025 11:29

krrishdholakia added a commit that referenced this pull request Apr 1, 2025

Revert "fix: Anthropic prompt caching on GCP Vertex AI (#9605)"

6ad67fd

This reverts commit a867324.

krrishdholakia mentioned this pull request Apr 1, 2025

Revert "fix: Anthropic prompt caching on GCP Vertex AI" #9670

Merged

krrishdholakia added a commit that referenced this pull request Apr 1, 2025

Revert "fix: Anthropic prompt caching on GCP Vertex AI (#9605)" (#9670)

46b3dbd

This reverts commit a867324.

krrishdholakia added a commit that referenced this pull request Apr 1, 2025

Revert "fix: Anthropic prompt caching on GCP Vertex AI (#9605)" (#9670)

eb9d3b5

This reverts commit a867324.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix: Anthropic prompt caching on GCP Vertex AI #9605

fix: Anthropic prompt caching on GCP Vertex AI #9605

Uh oh!

sammcj commented Mar 28, 2025 •

edited

Loading

Uh oh!

vercel bot commented Mar 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fix: Anthropic prompt caching on GCP Vertex AI #9605

fix: Anthropic prompt caching on GCP Vertex AI #9605

Uh oh!

Conversation

sammcj commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Title

Relevant issues

Pre-Submission checklist

Type

Changes

Uh oh!

vercel bot commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sammcj commented Mar 28, 2025 •

edited

Loading

vercel bot commented Mar 28, 2025 •

edited

Loading