pipecat

Author	SHA1	Message	Date
Paul Kompfner	77cfb181f6	Clarify per-inference helper usage in WebsocketLLMService docstring	2026-03-30 23:25:56 -04:00
Paul Kompfner	0b256936c6	Add ConnectionClosed to _receive_response_events raises docstring	2026-03-30 23:14:45 -04:00
Paul Kompfner	3922963c7a	Extract helpers in _process_context to reduce repeated code	2026-03-30 23:10:38 -04:00
Paul Kompfner	ab9f2a35b6	Clean up TTFB metrics and previous_response state on inference failure	2026-03-30 23:04:06 -04:00
Paul Kompfner	f19d1183d8	Clean up TTFB metrics and previous_response state on retry failure	2026-03-30 23:00:22 -04:00
Paul Kompfner	9ad4fe6344	Use concrete inference language instead of abstract transaction terminology	2026-03-30 22:42:40 -04:00
Paul Kompfner	04882f6f2a	Simplify _connect_websocket guard and remove unused State import	2026-03-30 22:32:08 -04:00
Paul Kompfner	712e42533d	Introduce WebsocketLLMService and refactor OpenAIResponsesLLMService to use it Add WebsocketLLMService as a base class for WebSocket-based LLM services, parallel to WebsocketTTSService/WebsocketSTTService but codifying a transactional request-response model rather than a continuous background receive loop. WebsocketLLMService provides: - Connection lifecycle (start/stop/cancel → connect/disconnect) - _ws_send/_ws_recv with transparent ConnectionClosed handling (auto-reconnect via exponential backoff → WebsocketReconnectedError) - _ensure_connected with retry via _try_reconnect OpenAIResponsesLLMService now inherits from WebsocketLLMService, removing duplicated connection management code (_connect, _disconnect, _reconnect, _ensure_connected, _ws_send, start, stop, cancel) and simplifying _process_context from a loop with attempt tracking to a flat try/except with a single retry.	2026-03-30 22:26:31 -04:00
Paul Kompfner	0efef19d60	Fix code review issues in WebSocket Responses service - Use finally block in _disconnect to ensure state is always cleaned up, even if websocket.close() throws — prevents stale cancellation state (e.g. _cancel_pending_response) from polluting a new connection - Catch ConnectionClosed in _drain_cancelled_response alongside TimeoutError — prevents _needs_drain from staying True and bricking the service on every subsequent inference attempt - Fall back to OPENAI_API_KEY env var when api_key is not passed, since the WebSocket connection uses raw websockets (not the AsyncOpenAI client which handles this automatically) - Use _clear_cancellation_state() instead of piecemeal resets where appropriate	2026-03-30 10:54:47 -04:00
Paul Kompfner	b5683556d4	Remove duplicate entries in run-release-evals.py, which appeared after a rebase	2026-03-30 10:03:43 -04:00
Paul Kompfner	26f85687d6	Handle response cancellation by draining before next inference Instead of trying to filter stale events inline (unreliable — the API doesn't provide a way to correlate events to a specific response), drain remaining events from a cancelled response before starting the next one. On cancellation, send response.cancel and set a drain flag. At the start of the next _process_context, read and discard events until a terminal event arrives, ensuring a clean connection. Falls back to reconnecting if draining times out.	2026-03-30 09:59:03 -04:00
Paul Kompfner	670ce30a1c	Document why HTTP variant doesn't use previous_response_id Over HTTP, previous_response_id requires store=True (30-day OpenAI-side conversation storage). The WebSocket variant avoids this via a connection-local in-memory cache that works with store=False. Add comments explaining this in both class docstrings, at the store=False parameter, and in the adapter's previous_response_id note.	2026-03-30 09:59:03 -04:00
Paul Kompfner	1c8d31de70	Add trace logging for previous_response_id decisions and fix example Add detailed trace-level logging to _apply_previous_response_optimization showing why the optimization was applied or fell back to full context, including the relevant data for debugging. Use append_to_context=False for the filler TTSSpeakFrame in the function-calling example to avoid altering the conversation history and breaking the previous_response_id prefix match.	2026-03-30 09:59:03 -04:00
Paul Kompfner	9defff2a34	Skip server-known output items in previous_response_id optimization When using previous_response_id, the server already knows its own output from the previous response. Store the raw response output and, on the next call, compare it against the items following the matched input prefix — checking role and text content for messages, and call_id for function calls. If the items match, skip them and send only truly new input (user messages, tool results). Falls back to full context if either the prefix or the output comparison fails.	2026-03-30 09:59:03 -04:00
Paul Kompfner	59d28f9fd2	Add changelog for WebSocket OpenAI Responses service	2026-03-30 09:59:03 -04:00
Paul Kompfner	f2a8a9e753	Add WebSocket-based OpenAI Responses LLM service with previous_response_id optimization Introduce a WebSocket variant of the OpenAI Responses API service that maintains a persistent connection to wss://api.openai.com/v1/responses for lower-latency inference. The WebSocket variant automatically uses previous_response_id to send only incremental context when possible, falling back to full context on reconnection or cache miss. The WebSocket variant becomes the new default OpenAIResponsesLLMService, and the HTTP variant is renamed to OpenAIResponsesHttpLLMService. Both share a private base class with common settings, parameter building, and run_inference (always HTTP) logic.	2026-03-30 09:58:56 -04:00
Mark Backman	d1eb2699f3	Merge pull request #4192 from pipecat-ai/mb/update-langchain Update langchain dependencies to latest major versions	2026-03-30 08:54:41 -04:00
Mark Backman	2e0f5fc6e9	Merge pull request #4194 from pipecat-ai/mb/update-community-integrations-package-convention Add pipecat-{vendor} package naming convention to community guide	2026-03-30 08:52:28 -04:00
Mark Backman	dd3ca6fbba	Merge pull request #4191 from pipecat-ai/mb/remove-openpipe Remove OpenPipe integration	2026-03-30 08:52:14 -04:00
Mark Backman	171692aa30	Add pipecat-{vendor} package naming convention to community guide Formalizes the package naming pattern that most community contributors already follow organically, improving discoverability on PyPI.	2026-03-29 12:39:20 -04:00
Mark Backman	81ddd103f9	Fix KeyError on context messages without role in RTVI observer Use dict.get() instead of direct key access to handle context messages that don't have a 'role' key, such as tool results.	2026-03-29 10:28:00 -04:00
Mark Backman	8c9e189394	Fix langchain imports for langchain 1.x compatibility ChatPromptTemplate moved from langchain.prompts to langchain_core.prompts in langchain 1.x.	2026-03-29 10:27:48 -04:00
Mark Backman	b6579dc763	Update uv lock with latest versions of Pygments and cryptography	2026-03-29 10:20:45 -04:00
Mark Backman	abd63336e4	Add changelog for #4192	2026-03-29 10:18:52 -04:00
Mark Backman	ccb9dc20f8	Update langchain dependencies to latest major versions Update langchain 0.3→1.2, langchain-community 0.3→0.4, and langchain-openai 0.3→1.1. This also unblocks openai>=2.26 which was previously constrained by the now-removed openpipe package.	2026-03-29 10:17:28 -04:00
Mark Backman	2177e28ee1	Remove OpenPipe integration OpenPipe was acquired by CoreWeave in September 2025. The Python package hasn't been updated since June 2025 and the repo since 2024. The openpipe package caps openai<=1.97.1, creating dependency conflicts with other extras. Remove the dead integration to clean up the codebase.	2026-03-29 10:12:35 -04:00
Mark Backman	3eb7c2bcd9	Merge pull request #4187 from OmerCohenAviv/fix/heartbeat-monitor-configurable Fix heartbeat monitor timeout not respecting custom heartbeat interval	2026-03-29 09:31:12 -04:00
Mark Backman	878940f94e	Merge pull request #4189 from Arindam200/main Add NebiusLLMService for Nebius Token Factory	2026-03-29 09:03:06 -04:00
Mark Backman	a3aeafcb2d	Alphabetize nebius entry in pyproject.toml extras	2026-03-29 08:58:01 -04:00
Mark Backman	63254fe337	Add NebiusLLMService with developer role and tool support fixes - Add Nebius LLM service wrapping OpenAI-compatible Token Factory API - Set supports_developer_role = False (Nebius rejects developer role) - Default to openai/gpt-oss-120b model (supports function calling) - Add Nebius function-calling example and env.example entry - Fix Sarvam developer role support - Update examples to use developer role for intro messages	2026-03-29 08:50:11 -04:00
Arindam200	39919f7889	Add NebiusLLMService for Nebius Token Factory Adds an OpenAI-compatible LLM service for Nebius Token Factory, supporting open-source models (Meta Llama, Qwen, DeepSeek) via their OpenAI-compatible REST API at https://api.tokenfactory.nebius.com/v1/.	2026-03-29 14:35:46 +05:30
OmercohenAviv	f2e0f5d20c	move wait_time out of loop	2026-03-29 00:05:21 +03:00
OmercohenAviv	2724ef6d6f	non optional	2026-03-28 12:12:02 +03:00
OmercohenAviv	33fb8852e6	ruff	2026-03-28 12:05:30 +03:00
OmercohenAviv	5fe48da2fb	Merge branch 'main' into fix/heartbeat-monitor-configurable	2026-03-28 11:57:23 +03:00
OmercohenAviv	dccd98ec8a	test	2026-03-28 11:53:51 +03:00
Aleix Conchillo Flaqué	a84c69858e	Merge pull request #4185 from pipecat-ai/changelog-0.0.108 Release 0.0.108 - Changelog Update v0.0.108	2026-03-27 21:47:53 -07:00
aconchillo	ca224219dc	Update changelog for version 0.0.108	2026-03-27 21:43:37 -07:00
Aleix Conchillo Flaqué	83dc979d19	Merge pull request #4186 from pipecat-ai/mb/fix-websocket-disconnect-race-condition Fix FastAPI WebSocket disconnect race condition	2026-03-27 21:40:21 -07:00
Aleix Conchillo Flaqué	fc76b3f2fb	update pyproject.toml and uv.lock	2026-03-27 21:36:03 -07:00
Mark Backman	4670370dbb	Add changelog for #4186	2026-03-28 00:02:44 -04:00
Mark Backman	47e53890e3	Fix FastAPI WebSocket disconnect race condition causing pipeline hang When the remote side disconnects while send() is in flight, send() was setting _closing=True. This prevented the receive loop from firing on_client_disconnected, causing the pipeline to hang waiting for a disconnect signal that never came. The fix removes _closing from send() (that flag means we initiated the close) and instead checks Starlette application_state in _can_send() to suppress subsequent sends after a failure. Fixes #3912	2026-03-28 00:01:25 -04:00
Aleix Conchillo Flaqué	195180b6f4	Merge pull request #4184 from pipecat-ai/aleix/fix-sarvam-examples-role Fix Sarvam examples to use 'user' role instead of 'developer'	2026-03-27 20:34:59 -07:00
Aleix Conchillo Flaqué	8b64166bb7	Fix Sarvam examples to use 'user' role instead of 'developer' Sarvam uses the OpenAI-compatible API but does not support the 'developer' role, causing errors. Use 'user' role instead.	2026-03-27 20:33:25 -07:00
Aleix Conchillo Flaqué	1d18995435	Merge pull request #4183 from pipecat-ai/aleix/fix-task-scheduling Yield after create_task to ensure timer tasks are scheduled	2026-03-27 20:32:32 -07:00
Aleix Conchillo Flaqué	ea7324b2ba	Add changelog for #4183	2026-03-27 19:03:55 -07:00
Aleix Conchillo Flaqué	52ed7137af	Yield after create_task to ensure timer tasks are scheduled Add `await asyncio.sleep(0)` after `create_task()` calls in UserIdleController, SpeechTimeoutUserTurnStopStrategy, TurnAnalyzerUserTurnStopStrategy, and UserTurnCompletionLLMServiceMixin so the event loop schedules the newly created timer tasks before the caller continues.	2026-03-27 19:03:23 -07:00
kompfner	b33df03724	Merge pull request #4179 from pipecat-ai/pk/fix-gemini-live-vertex Don't send history_config for Gemini Live Vertex (unsupported)	2026-03-27 17:34:29 -04:00
Paul Kompfner	28fbe1db08	Don't send history_config for Gemini Live Vertex (unsupported)	2026-03-27 17:30:47 -04:00
kompfner	9240e92d9f	Merge pull request #4177 from pipecat-ai/pk/tweak-26i-for-gemini-3.1-flash-live-support Tweak 26i example system instruction for Gemini 3.1 Flash Live compat…	2026-03-27 17:20:06 -04:00

1 2 3 4 5 ...

8730 Commits