Return a copy from strict key removal to not break cache keys #9693

adrianlyjak · 2025-04-02T02:03:55Z

Caching stability improvements

A) Return a copy from strict key removal to not break cache keys. This change seems nice and simple. All of the callers I saw using it were updating their reference, e.g. thing = _remove_strict_from_schema(thing), so this seems like it should work just fine. After digging deeper, this part may be unnecessary?

B I also saw there's a property that can be set to re-use the same cache key later, in case mutation occurs, but I guess that only happens if litellm_params object has been set previously. Looks like there's a good opportunity to initialize it in wrapper_async, such that it can be used to cache the key between request and response if it was not otherwise set.

Origination of this is perhaps a bit of an edge case, but I think these improvements will help.

My request didn't have litellm_params, although this is probably normal
I had erroneously set strict: true on my json schema

Relevant issues

fix #9692

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally
My PR passes all unit tests on (make test-unit)[https://docs.litellm.ai/docs/extras/contributing_code]
My PR's scope is as isolated as possible, it only solves 1 specific problem

☝️ this last one is debatable. I could remove part A) to make this simpler

Tests passing screenshots

Type

🐛 Bug Fix
✅ Test

Changes

Return a copy from strict key removal to not break cache keys

vercel · 2025-04-02T02:04:00Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Apr 13, 2025 7:42pm

adrianlyjak · 2025-04-02T02:21:16Z

oh, I'm not really sure this bug is going to be common. I was sending a custom json schema, (rather than a pydantic model). This was to decorate the json object with propertyOrdering, because vertex (at least gemini) annoyingly returns fields in alphabetical ordering, rather than order within the json.

Maybe my fault here for including strict in the first place. The bug didn't occur when I tried passing in the pydantic model directly

adrianlyjak · 2025-04-02T02:24:40Z

Another idea would be to attach some sort of persistent attribute to a request to re-use the same cache key later, in case mutation occurs, but I didn't spend the time looking into how difficult that would be.

Maybe this would be a better idea, (at least if not too complex), because it seems like it could solve the whole class of caching problems where a request changes between request and response caching time. I don't think I'd ever see any value in that happening?

adrianlyjak · 2025-04-02T02:30:42Z

Just rubber ducking here now:

What's odd is that this does seem like it should already be happening, with get_cache_key, it stores the value in the lite llm params

adrianlyjak · 2025-04-02T03:29:32Z

ok, fixed

adrianlyjak · 2025-04-02T03:51:16Z

welp, something's screwed up with the cost tracking tests now

adrianlyjak · 2025-04-02T04:28:41Z

should be fixed

krrishdholakia · 2025-04-02T14:55:57Z

tests/local_testing/test_unit_test_caching.py

+    # mutate kwargs
+    kwargs["temperature"] = 0.8
+    cache_key_2 = litellm.cache.get_cache_key(**kwargs)
+    assert cache_key == cache_key_2


hey @adrianlyjak this is not desired behaviour - if the user changes an optional param, we do not want to return a cached response

@krrishdholakia I didn't change anything to make this particular test pass, this is actually the current functionality. This appears to be the existing intended behavior of the code, to memoize the cache key within a single request.

This test scenario is perhaps a little unrealistic, since the temperature itself can't get changed, as the kwargs are spread, copying the dict, however if a nested parameter such as the response schema is mutated between the start of the request and the response, then the same cache key is used. The related fix I implemented was to just ensure the litellm_params were initialized, (so the cache key is actually memoized)

if the user changes an optional param

To be clear, as I understand, this would only be happening internally within integrations. I don't know of a way for the user to be modifying the request parameters after calling the completion (or other function)

Adjusted the test for clarity, as modifying temperature isn't really an expected use case, instead I normalized the system -> developer role on a message

adrianlyjak · 2025-04-13T19:33:31Z

Accidentally closed

A) Return a copy from strict key removal to not break cache keys B) Fix issue in existing cache key stabilizer that was not storing a stable key across request/response if no litellm_params existed in the request

CLAassistant · 2025-04-22T22:08:42Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

Adrian Lyjak seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

vercel bot deployed to Preview April 2, 2025 02:05 View deployment

adrianlyjak marked this pull request as ready for review April 2, 2025 02:13

adrianlyjak marked this pull request as draft April 2, 2025 02:28

adrianlyjak marked this pull request as ready for review April 2, 2025 03:29

adrianlyjak force-pushed the main branch from 5120a83 to c6eea98 Compare April 2, 2025 03:32

vercel bot deployed to Preview April 2, 2025 03:33 View deployment

adrianlyjak force-pushed the main branch 2 times, most recently from 837818d to 5e75601 Compare April 2, 2025 04:26

vercel bot deployed to Preview April 2, 2025 04:27 View deployment

adrianlyjak force-pushed the main branch from 5e75601 to b2e8a82 Compare April 2, 2025 04:28

vercel bot deployed to Preview April 2, 2025 04:29 View deployment

krrishdholakia reviewed Apr 2, 2025

View reviewed changes

adrianlyjak closed this Apr 13, 2025

adrianlyjak force-pushed the main branch from b2e8a82 to 64bb89c Compare April 13, 2025 19:05

adrianlyjak reopened this Apr 13, 2025

vercel bot deployed to Preview April 13, 2025 19:34 View deployment

adrianlyjak force-pushed the main branch from 3323a68 to e85c46c Compare April 13, 2025 19:41

fix BerriAI#9692. Keep cache key stable during mutation

0ca1f8d

A) Return a copy from strict key removal to not break cache keys B) Fix issue in existing cache key stabilizer that was not storing a stable key across request/response if no litellm_params existed in the request

adrianlyjak force-pushed the main branch from e85c46c to 0ca1f8d Compare April 13, 2025 19:41

vercel bot deployed to Preview April 13, 2025 19:42 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Return a copy from strict key removal to not break cache keys #9693

Return a copy from strict key removal to not break cache keys #9693

Uh oh!

adrianlyjak commented Apr 2, 2025 •

edited

Loading

Uh oh!

vercel bot commented Apr 2, 2025 •

edited

Loading

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

krrishdholakia Apr 2, 2025

Uh oh!

adrianlyjak Apr 2, 2025 •

edited

Loading

Uh oh!

adrianlyjak Apr 2, 2025

Uh oh!

adrianlyjak Apr 13, 2025

Uh oh!

adrianlyjak commented Apr 13, 2025

Uh oh!

CLAassistant commented Apr 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Return a copy from strict key removal to not break cache keys #9693

Are you sure you want to change the base?

Return a copy from strict key removal to not break cache keys #9693

Uh oh!

Conversation

adrianlyjak commented Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Caching stability improvements

Relevant issues

Pre-Submission checklist

Type

Changes

Uh oh!

vercel bot commented Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

adrianlyjak commented Apr 2, 2025

Uh oh!

krrishdholakia Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

adrianlyjak Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adrianlyjak Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

adrianlyjak Apr 13, 2025

Choose a reason for hiding this comment

Uh oh!

adrianlyjak commented Apr 13, 2025

Uh oh!

CLAassistant commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

adrianlyjak commented Apr 2, 2025 •

edited

Loading

vercel bot commented Apr 2, 2025 •

edited

Loading

adrianlyjak Apr 2, 2025 •

edited

Loading

CLAassistant commented Apr 22, 2025 •

edited

Loading