fix(langsmith): avoid no running event loop during sync init by pandego · Pull Request #23727 · BerriAI/litellm

pandego · 2026-03-16T10:07:11Z

Summary

Fix the LangSmith logger init path for synchronous callers.

Today LangsmithLogger.__init__() always calls asyncio.create_task(self.periodic_flush()). When LiteLLM is initialized from a synchronous path, there is no running event loop yet, so logger startup raises RuntimeError: no running event loop.

This change starts the periodic flush task only when an event loop is already running. In sync contexts, logger initialization now stays safe and non-blocking instead of raising during setup.

Root cause

asyncio.create_task() requires an active running loop in the current thread. LangsmithLogger was calling it unconditionally during object construction.

Fix

add a small _start_periodic_flush_task() helper
use asyncio.get_running_loop() to detect whether task startup is safe
lazily start periodic flushing from async logging paths when needed
keep the behavior unchanged when LiteLLM is initialized inside an async context
keep the task startup control internal instead of exposing a public test-only constructor flag

Test plan

added regression coverage for:
- init without a running loop
- init with a running loop
- lazy startup from async_log_success_event
- lazy startup from async_log_failure_event
uv run pytest tests/test_litellm/integrations/test_langsmith_init.py -q
uv run pytest tests/logging_callback_tests/test_langsmith_unit_test.py -q

Context

This is related to earlier reports around LangSmith sync initialization and event-loop handling:

vercel · 2026-03-16T10:07:17Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
litellm	Ready	Preview, Comment	Mar 16, 2026 11:32am

CLAassistant · 2026-03-16T10:07:18Z

All committers have signed the CLA.

codspeed-hq · 2026-03-16T10:09:27Z

Merging this PR will not alter performance

✅ 16 untouched benchmarks

_{Comparing pandego:fix/langsmith-sync-no-event-loop (59caea4) with main (58e74a6)}

greptile-apps · 2026-03-16T10:10:13Z

Greptile Summary

This PR fixes a RuntimeError: no running event loop crash that occurred when LangsmithLogger was instantiated from a synchronous context. The unconditional asyncio.create_task(self.periodic_flush()) call in __init__ is replaced with a guarded _start_periodic_flush_task() helper that uses asyncio.get_running_loop(), returning None (and logging a debug message) when no loop is present. To ensure the flush task still starts when the logger is later used asynchronously, _ensure_periodic_flush_task() is called at the top of both async_log_success_event and async_log_failure_event, lazily scheduling the task on the first async invocation after a sync-context init.

Key observations:

The fix correctly handles the sync-init → async-use production pattern that was previously broken.
_ensure_periodic_flush_task is synchronous with no await between its guard check and assignment, which correctly prevents duplicate task creation under asyncio's cooperative scheduling model.
The synchronous log_success_event path intentionally does not call _ensure_periodic_flush_task; it relies on batch-size flushing via _send_batch(), which is the correct design since periodic flush is an async-only concept here.
All stale @patch("asyncio.create_task") decorators (which patched the wrong symbol) have been removed and replaced with focused, accurate test coverage for the guard paths and lazy startup.
Previous review feedback (silent queue drops on sync-init, placement of _ensure_periodic_flush_task inside the try block for async_log_failure_event, loose call-count assertions, and missing failure-event test) has been fully addressed in this iteration.

Confidence Score: 4/5

This PR is safe to merge; it fixes a real crash path without breaking existing async behaviour.
The core logic change is minimal and well-targeted. All previously raised review threads have been addressed. Test coverage now spans the no-loop, with-loop, and lazy-startup paths for both async log methods. The only reason for a non-perfect score is a small untested edge case: _ensure_periodic_flush_task will re-invoke _start_periodic_flush_task on every async log call when the prior task's done() is True (e.g., after a periodic_flush coroutine dies due to an unhandled exception) — but this is intentional recovery behaviour, is safe under asyncio's cooperative model, and is documented in a comment.
No files require special attention; both changed files are straightforward.

Important Files Changed

Filename	Overview
litellm/integrations/langsmith.py	Core fix: replaces unconditional `asyncio.create_task` in `__init__` with a guarded `_start_periodic_flush_task` helper, adds lazy `_ensure_periodic_flush_task` called on every async log event. Logic is sound; `log_success_event` (sync path) intentionally does not call `_ensure_periodic_flush_task` since sync logging relies on batch-size-based flushing via `_send_batch()`.
tests/test_litellm/integrations/test_langsmith_init.py	Stale `@patch("asyncio.create_task")` decorators correctly removed; five new tests cover no-loop init, loop-present init, and lazy startup for both async log paths. One subtle gap: `test_langsmith_init_starts_periodic_flush_with_running_loop` asserts `mock_loop.create_task.assert_called_once()` but does not verify the coroutine passed is `periodic_flush`; however the test does close the scheduled coroutine to avoid ResourceWarning.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[LangsmithLogger.__init__] --> B[self._flush_task = _start_periodic_flush_task]
    B --> C{asyncio.get_running_loop?}
    C -- RuntimeError: No Loop --> D[return None\n_flush_task = None]
    C -- Loop exists --> E[loop.create_task periodic_flush\n_flush_task = Task]

    F[async_log_success_event] --> G[_ensure_periodic_flush_task]
    H[async_log_failure_event] --> G
    G --> I{_flush_task is None\nor .done?}
    I -- Yes --> J[_start_periodic_flush_task]
    J --> C
    I -- No --> K[Task already running - skip]

    E --> L[periodic_flush loop\n while True: sleep → flush_queue]
    D -.->|lazy start on first async call| F
    D -.->|lazy start on first async call| H

_{Last reviewed commit: 59caea4}

pandego · 2026-03-16T10:59:44Z

Addressed the review feedback in the latest push.

What changed:

lazily start the periodic flush task on async log calls if the logger was created before an event loop existed
keep the original init-time guard so sync construction no longer raises
strengthened the tests:
- verify the no-loop init path leaves _flush_task unset
- verify the happy path schedules loop.create_task(...)
- verify async logging lazily starts the periodic flush task after sync init
removed the stale @patch("asyncio.create_task") decorators from the init tests

I also checked branch state against main before pushing - it is current, so the earlier failing checks were not caused by the branch being behind.

Local validation run:

uv run pytest tests/test_litellm/integrations/test_langsmith_init.py -q
uv run pytest tests/logging_callback_tests/test_langsmith_unit_test.py -q

pandego · 2026-03-16T11:50:44Z

Quick note: the LangSmith sync-init fix and regression coverage are updated per review and passing locally.

The remaining red checks appear unrelated to this change:

lint
test (proxy-misc)

I did not change the proxy startup path touched by the proxy-misc failure, and the lint failure looks repo-wide rather than specific to the LangSmith files in this PR.

fix(langsmith): skip periodic flush task without event loop

7d7200a

vercel bot deployed to Preview March 16, 2026 10:08 View deployment

greptile-apps bot reviewed Mar 16, 2026

View reviewed changes

Comment thread litellm/integrations/langsmith.py Outdated

Comment thread tests/test_litellm/integrations/test_langsmith_init.py Outdated

fix(langsmith): lazily start periodic flush task

628e267

vercel bot deployed to Preview March 16, 2026 11:00 View deployment

greptile-apps bot reviewed Mar 16, 2026

View reviewed changes

Comment thread litellm/integrations/langsmith.py Outdated

Comment thread tests/test_litellm/integrations/test_langsmith_init.py Outdated

test(langsmith): tighten flush task coverage

6b69557

vercel bot deployed to Preview March 16, 2026 11:14 View deployment

greptile-apps bot reviewed Mar 16, 2026

View reviewed changes

Comment thread tests/test_litellm/integrations/test_langsmith_init.py

test(langsmith): cover lazy failure flush startup

87bf503

vercel bot deployed to Preview March 16, 2026 11:23 View deployment

greptile-apps bot reviewed Mar 16, 2026

View reviewed changes

Comment thread litellm/integrations/langsmith.py Outdated

Comment thread litellm/integrations/langsmith.py

refactor(langsmith): keep flush startup private

59caea4

vercel bot deployed to Preview March 16, 2026 11:32 View deployment

ghost changed the base branch from main to litellm_oss_staging_03_17_2026 March 17, 2026 05:34

ghost merged commit e9291a9 into BerriAI:litellm_oss_staging_03_17_2026 Mar 17, 2026
37 of 39 checks passed

emerzon mentioned this pull request Apr 12, 2026

add azure ai grok 4 20 models #25582

Open

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(langsmith): avoid no running event loop during sync init#23727

fix(langsmith): avoid no running event loop during sync init#23727
5 commits merged intoBerriAI:litellm_oss_staging_03_17_2026from
pandego:fix/langsmith-sync-no-event-loop

pandego commented Mar 16, 2026 •

edited

Loading

Uh oh!

vercel bot commented Mar 16, 2026 •

edited

Loading

Uh oh!

CLAassistant commented Mar 16, 2026 •

edited

Loading

Uh oh!

codspeed-hq bot commented Mar 16, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Mar 16, 2026 •

edited

Loading

Important Files Changed

Uh oh!

Uh oh!

Uh oh!

pandego commented Mar 16, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pandego commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

pandego commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Root cause

Fix

Test plan

Context

Uh oh!

vercel bot commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codspeed-hq bot commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will not alter performance

Uh oh!

greptile-apps bot commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

Uh oh!

pandego commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pandego commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pandego commented Mar 16, 2026 •

edited

Loading

vercel bot commented Mar 16, 2026 •

edited

Loading

CLAassistant commented Mar 16, 2026 •

edited

Loading

codspeed-hq bot commented Mar 16, 2026 •

edited

Loading

greptile-apps bot commented Mar 16, 2026 •

edited

Loading

pandego commented Mar 16, 2026 •

edited

Loading