Fix issue 10264 unicode docode error #10350

sugarkwork · 2025-04-27T02:41:40Z

Title

Fix UnicodeDecodeError on non-English OS by specifying UTF-8 encoding when opening files

Relevant issues

Fixes #10264

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have added testing in the tests/litellm_utils_tests/test_utils.py directory, Adding at least 1 test is a hard requirement – see details at https://docs.litellm.ai/docs/extras/contributing_code
I have added a screenshot/log of my new test passing locally
My PR passes all unit tests on (make test-unit)
My PR’s scope is as isolated as possible; it only solves 1 specific problem

Type

🐛 Bug Fix

Changes

Changed all occurrences of open("r") to open("r", encoding="utf-8") to ensure files are read as UTF-8 rather than the platform default, preventing UnicodeDecodeError on non-English OSes.
Added a unit test at tests/litellm_utils_tests/test_utils.py to verify that anthropic_tokenizer.json is read correctly with UTF-8 encoding.

screenshot

…ixes #10264)

vercel · 2025-04-27T02:41:44Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Apr 27, 2025 2:42am

CLAassistant · 2025-04-27T02:41:46Z

All committers have signed the CLA.

sugarkwork added 2 commits April 27, 2025 10:11

Specify UTF-8 encoding when opening files to avoid cp932 decode errors

fbbe5f6

add test for open(..., encoding="utf-8") in anthropic_tokenizer.json (f…

83ac2fa

…ixes #10264)

vercel bot deployed to Preview April 27, 2025 02:42 View deployment

sugarkwork closed this May 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix issue 10264 unicode docode error #10350

Fix issue 10264 unicode docode error #10350

Uh oh!

sugarkwork commented Apr 27, 2025

Uh oh!

vercel bot commented Apr 27, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Apr 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Fix issue 10264 unicode docode error #10350

Fix issue 10264 unicode docode error #10350

Uh oh!

Conversation

sugarkwork commented Apr 27, 2025

Title

Relevant issues

Pre-Submission checklist

Type

Changes

screenshot

Uh oh!

vercel bot commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

vercel bot commented Apr 27, 2025 •

edited

Loading

CLAassistant commented Apr 27, 2025 •

edited

Loading