[Infra] Merge dev branch by yuneng-berri · Pull Request #25871 · BerriAI/litellm

yuneng-berri · 2026-04-16T16:20:11Z

Relevant issues

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/test_litellm/ directory, Adding at least 1 test is a hard requirement - see details
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem
I have requested a Greptile review by commenting @greptileai and received a Confidence Score of at least 4/5 before requesting a maintainer review

Delays in PR merge?

If you're seeing a delay in your PR being merged, ping the LiteLLM Team on Slack (#pr-review).

CI (LiteLLM team)

CI status guideline:

50-55 passing tests: main is stable with minor issues.

45-49 passing tests: acceptable but needs attention

<= 40 passing tests: unstable; be careful with your merges and assess the risk.

Branch creation CI run
Link:
CI run for the last commit
Link:
Merge / cherry-pick CI run
Links:

Screenshots / Proof of Fix

Type

🚄 Infrastructure

Changes

Gemini API keys embedded in URLs as ?key= query parameters leak through httpx error tracebacks, which are then captured by traceback.format_exc() and forwarded to logging callbacks, Slack/Teams alerts, and HTTP client responses. Short-term: all httpx.HTTPStatusError handlers now raise MaskedHTTPStatusError(...) from None, which masks the URL and breaks exception chaining so the original error never appears in tracebacks. Long-term: moved all Gemini/Vertex URL constructions from ?key={api_key} to x-goog-api-key header (Google's documented auth method), so the key is never in the URL at all. WebSocket realtime is the only exception since WS clients cannot use custom headers. Additionally hardened all outbound credential paths: - WebSocket close reasons now pass through _redact_string() - Callback pipeline (failure_handler) redacts traceback_exception and error_str before forwarding to integrations (Langfuse, Datadog, etc.) - Slack/Teams alert messages redacted in send_llm_exception_alert, ProxyLogging.failure_handler, and post_call_failure_hook - HTTP error responses in proxy SSE and health endpoints redacted - Exception messages in exception_mapping_utils redacted - print_verbose() stdout output redacted when set_verbose=True - HTTPHandler.put() now has MaskedHTTPStatusError (was missing)

Addresses Greptile review — the async streaming path was missing mask_sensitive_info() on the response body, while the sync path had it.

Add 50 tests across 3 files covering the new MaskedHTTPStatusError, safe response helpers, _redact_string in error paths, Gemini interactions x-goog-api-key header auth, and RAG ingestion header usage. Fix missing early-validation for Gemini API key in _get_token_and_url() which caused TypeError when key was None (headers got None value). Harmonize error messages between the two validation sites. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

security: prevent API key leaks in error tracebacks, logs, and alerts

…ictedpython fix(guardrails): replace custom_code sandbox with RestrictedPython

Tighten validation of request body parameters in the proxy routing layer. Use context variables for internal call state management instead of passing flags through request kwargs. Clean up metadata handling at the proxy boundary.

Add safe_join() and safe_filename() in proxy/common_utils/path_utils.py for constructing filesystem paths from user-controlled inputs. Apply to guardrail category YAML endpoint and dotprompt file converter.

Add null byte rejection to safe_join and safe_filename. Normalize backslash separators in safe_filename for cross-platform safety. Include resolved path in ValueError for debugging. Move imports to module level per project conventions.

fix(proxy): harden request parameter handling

fix(proxy): add shared path utilities, prevent directory traversal

- vertex_ai_context_caching.py: add explicit Optional[str] annotation on auth_header so later branches that assign vertex_auth_header (Optional[str]) type-check against the first branch's dict assignment (which already has type: ignore[assignment]). - path_utils.py: remove unused pathlib.Path import (F401). - emulated_handler.py: extract _extract_tool_call_fields, _resolve_queries_from_args, _execute_file_search_tool_calls, and _build_follow_up_input helpers to drop aresponses_with_emulated_file_search below ruff's PLR0915 statement limit. Behavior unchanged.

RestrictedPython (ZPL-2.1, a BSD-style permissive license) was added as a dependency for the custom_code guardrail sandbox, but the license checker didn't recognize it. Add to authorized packages list.

…HTTPStatusError MaskedHTTPStatusError constructs a new httpx.Response from the original error. Two bugs surfaced under real HTTP error responses: 1. The new Response was created without request=, so response.request raised RuntimeError("The .request property has not been set.") for any downstream caller (e.g. exception_mapping_utils) that inspected it. 2. The decoded response bytes were passed together with the original Content-Encoding header. On construction httpx tried to decompress the already-decoded bytes and raised httpx.DecodingError ("Error -3 while decompressing data: incorrect header check"). Set response.request to the masked Request and strip Content-Encoding (and the now-stale Content-Length) before rebuilding the Response. URL/message masking is unchanged; the new request carries the already masked URL. Also update test_logging_key_masking_gemini: the security commit 25f93be moved Gemini API keys from ?key=... URL params to the x-goog-api-key header, so api_base no longer contains the key.

…itellm_yj_apr15 # Conflicts: # litellm/litellm_core_utils/litellm_logging.py # uv.lock

vercel · 2026-04-16T16:20:19Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
litellm	Ready	Preview, Comment	Apr 16, 2026 4:43pm

greptile-apps · 2026-04-16T16:27:22Z

Greptile Summary

This dev branch merge introduces several security hardening and infrastructure improvements: (1) a RestrictedPython-based sandbox for custom code guardrails replacing the old regex-based validation, (2) path-traversal prevention via safe_join/safe_filename helpers, (3) broader secret redaction coverage in error tracebacks and WebSocket close reasons, (4) an MaskedHTTPStatusError wrapper that strips API keys from HTTP error URLs, and (5) a ContextVar (is_internal_call) to suppress double-billing for emulated file-search sub-calls. The changes are well-tested with comprehensive security test cases.

Confidence Score: 5/5

Safe to merge — all changes are security improvements with no regressions identified.

No P0 or P1 issues found. The sandbox migration from regex to RestrictedPython is a clear security improvement with comprehensive test coverage. Path-traversal guards, credential masking, and secret-redaction expansions are all correctly implemented.

No files require special attention.

Important Files Changed

Filename	Overview
litellm/proxy/guardrails/guardrail_hooks/custom_code/sandbox.py	New RestrictedPython sandbox; async allowed via AsyncAwareTransformer; correct guard setup (safer_getattr, full_write_guard, default_guarded_getitem/iter)
litellm/proxy/common_utils/path_utils.py	New safe_join/safe_filename helpers; realpath + containment check correctly prevents directory traversal and null-byte injection
litellm/_logging.py	SecretRedactionFilter now also redacts exception tracebacks and extra fields; _redact_string respects LITELLM_DISABLE_REDACT_SECRETS flag
litellm/llms/custom_httpx/http_handler.py	MaskedHTTPStatusError and raise_masked*_error helpers strip API keys from request URLs on HTTP errors; well-tested
litellm/_internal_context.py	New is_internal_call ContextVar for suppressing billing callbacks during emulated file-search sub-calls
litellm/proxy/guardrails/guardrail_endpoints.py	get_category_yaml now uses safe_join to prevent path traversal; sandbox helpers applied to test_custom_code_guardrail endpoint
litellm/responses/file_search/emulated_handler.py	is_internal_call correctly set and restored in finally blocks for both sub-calls; clean implementation
tests/test_litellm/proxy/guardrails/test_custom_code_security.py	Comprehensive sandbox security tests: bytecode rewrite, cr_frame, NFKC homoglyph, dunder access, import forms, runtime rejections, augmented assignment
litellm/litellm_core_utils/exception_mapping_utils.py	_redact_string now applied to tracebacks included in mapped exception messages

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[User submits custom guardrail code] --> B[compile_sandboxed]
    B --> C{RestrictedPython\nAST transform}
    C -->|Import / exec / dunder\nname / bytecode attack| D[SyntaxError raised\n→ CustomCodeCompilationError]
    C -->|Clean code| E[Compiled bytecode]
    E --> F[exec in build_sandbox_globals]
    F --> G{apply_guardrail\ndefined?}
    G -->|No| H[CustomCodeCompilationError]
    G -->|Yes| I[_compiled_function stored]
    I --> J[apply_guardrail called at request time]
    J --> K{_getattr_ = safer_getattr\n_write_ = full_write_guard\n__builtins__ = safe_builtins}
    K -->|Dunder attr access\nor unsafe write| L[AttributeError / TypeError at runtime]
    K -->|Safe operation| M[allow / block / modify result]

_{Reviews (2): Last reviewed commit: "fix(ci): authorize langgraph-prebuilt in..." | Re-trigger Greptile}

greptile-apps · 2026-04-16T16:27:25Z

+        return value
    return _SECRET_RE.sub(_REDACTED, value)




_redact_string now respects the global redaction disable flag

This is a behavioral change: previously _redact_string was unconditional. Now that it's applied in security-sensitive error paths (WebSocket close reasons, error tracebacks) added elsewhere in this PR, disabling the global redaction env var will also disable those new guards. Operators who set this flag for debugging should understand it now covers traceback and WebSocket redaction too — not just message-level redaction. Consider noting this in the env-var documentation.

langgraph-prebuilt 1.0.9 imports ExecutionInfo and ServerInfo from langgraph.runtime, but those symbols are not exported until langgraph 1.1.0. Our pin of langgraph==1.0.10 allows langgraph-prebuilt<1.1.0,>=1.0.8, and uv resolves to 1.0.9 (the latest in range), which breaks at import time in every test that touches langgraph.prebuilt (e.g. tests/pass_through_tests/test_mcp_routes.py): ImportError: cannot import name 'ExecutionInfo' from 'langgraph.runtime' Pinning langgraph-prebuilt to 1.0.8 pairs correctly with langgraph==1.0.10 and restores the import path.

langgraph-prebuilt was previously pulled in as a transitive of langgraph so PyPI license metadata was reported as unknown. Now that it is explicitly pinned (==1.0.8) to avoid the broken 1.0.9 release, the license checker flags it. It is published under MIT by the same langchain-ai/langgraph repository as langgraph itself.

codecov · 2026-04-16T17:06:23Z

Codecov Report

❌ Patch coverage is 88.23529% with 26 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
litellm/responses/file_search/emulated_handler.py	83.92%	9 Missing ⚠️
litellm/proxy/guardrails/guardrail_endpoints.py	46.66%	8 Missing ⚠️
.../guardrails/guardrail_hooks/custom_code/sandbox.py	88.57%	4 Missing ⚠️
litellm/llms/custom_httpx/llm_http_handler.py	33.33%	2 Missing ⚠️
litellm/llms/bedrock/realtime/handler.py	50.00%	1 Missing ⚠️
litellm/proxy/common_request_processing.py	50.00%	1 Missing ⚠️
litellm/proxy/prompts/prompt_endpoints.py	50.00%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

stuxf and others added 24 commits April 14, 2026 23:09

fix: apply mask_sensitive_info to async streaming error body

74f55b0

Addresses Greptile review — the async streaming path was missing mask_sensitive_info() on the response body, while the sync path had it.

ci: trigger CI re-run for codecov

abb8d8d

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

test(gemini): align API key expectations

f521e27

test(prompts): isolate in-memory version tests

b1bc3c1

test(gemini): stub API key for format param tests

2911d99

fix(guardrails): replace custom_code sandbox with RestrictedPython

0a1b442

fix(guardrails): move test_custom_code_security.py to correct location

fc9852c

fix(guardrails): provide _inplacevar_ to sandbox

469045b

Merge pull request #25117 from stuxf/fix/credential-leak-prevention

3f3bfdb

security: prevent API key leaks in error tracebacks, logs, and alerts

Merge pull request #25818 from jaydns/fix/custom-code-guardrail-restr…

93faf32

…ictedpython fix(guardrails): replace custom_code sandbox with RestrictedPython

fix(proxy): harden request parameter handling

47214be

Tighten validation of request body parameters in the proxy routing layer. Use context variables for internal call state management instead of passing flags through request kwargs. Clean up metadata handling at the proxy boundary.

fix(proxy): add shared path utilities, prevent directory traversal

9691649

Add safe_join() and safe_filename() in proxy/common_utils/path_utils.py for constructing filesystem paths from user-controlled inputs. Apply to guardrail category YAML endpoint and dotprompt file converter.

test: add unit tests for path_utils safe_join and safe_filename

c2b3b62

style: fix import ordering in prompt_endpoints

a44c4d0

style: move import os to module level, fix import ordering

a4faace

Merge pull request #25827 from stuxf/fix/outbound-host-validation

0c8b83c

fix(proxy): harden request parameter handling

Merge pull request #25834 from stuxf/fix/path-traversal-guardrail-yaml

be1b802

fix(proxy): add shared path utilities, prevent directory traversal

fix(ci): authorize RestrictedPython in liccheck.ini

070374d

RestrictedPython (ZPL-2.1, a BSD-style permissive license) was added as a dependency for the custom_code guardrail sandbox, but the license checker didn't recognize it. Add to authorized packages list.

Merge remote-tracking branch 'origin/litellm_internal_staging' into l…

dafa1bf

…itellm_yj_apr15 # Conflicts: # litellm/litellm_core_utils/litellm_logging.py # uv.lock

greptile-apps Bot reviewed Apr 16, 2026

View reviewed changes

yuneng-berri temporarily deployed to integration-postgres April 16, 2026 16:36 — with GitHub Actions Inactive

yuneng-berri temporarily deployed to integration-postgres April 16, 2026 16:37 — with GitHub Actions Inactive

yuneng-berri had a problem deploying to integration-postgres April 16, 2026 16:37 — with GitHub Actions Error

vercel Bot deployed to Preview April 16, 2026 16:37 View deployment

yuneng-berri temporarily deployed to integration-postgres April 16, 2026 16:42 — with GitHub Actions Inactive

yuneng-berri had a problem deploying to integration-postgres April 16, 2026 16:42 — with GitHub Actions Error

vercel Bot deployed to Preview April 16, 2026 16:43 View deployment

yuneng-berri requested a review from ryan-crabbe-berri April 16, 2026 17:08

ryan-crabbe-berri approved these changes Apr 16, 2026

View reviewed changes

yuneng-berri merged commit 21c0718 into litellm_internal_staging Apr 16, 2026
98 of 100 checks passed

yuneng-berri deleted the litellm_yj_apr15 branch April 16, 2026 17:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Infra] Merge dev branch#25871

[Infra] Merge dev branch#25871
yuneng-berri merged 26 commits intolitellm_internal_stagingfrom
litellm_yj_apr15

yuneng-berri commented Apr 16, 2026

Uh oh!

vercel Bot commented Apr 16, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot commented Apr 16, 2026 •

edited

Loading

Important Files Changed

Uh oh!

greptile-apps Bot Apr 16, 2026

Uh oh!

codecov Bot commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

yuneng-berri commented Apr 16, 2026

Relevant issues

Pre-Submission checklist

Delays in PR merge?

CI (LiteLLM team)

Screenshots / Proof of Fix

Type

Changes

Uh oh!

vercel Bot commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps Bot commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

greptile-apps Bot Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Apr 16, 2026

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vercel Bot commented Apr 16, 2026 •

edited

Loading

greptile-apps Bot commented Apr 16, 2026 •

edited

Loading