pipecat

Author	SHA1	Message	Date
Mark Backman	41e3afbc2f	Remove deprecated add_pattern_pair method from PatternPairAggregator	2026-04-02 10:28:01 -04:00
kompfner	a3c7f6c2af	Merge pull request #4215 from pipecat-ai/pk/remove-openaillmcontext Remove deprecated `OpenAILLMContext` as well as everything (code path…	2026-04-01 14:03:35 -04:00
Paul Kompfner	ebab75765d	Fix stream cancellation tests to mock get_chat_completions The tests were mocking the removed _stream_chat_completions_*_context methods. Update them to mock get_chat_completions instead.	2026-03-31 18:54:23 -04:00
Paul Kompfner	394599d031	Remove deprecated `OpenAILLMContext` as well as everything (code paths or whole types) dependent on it (all of which were also deprecated)	2026-03-31 18:15:25 -04:00
mattie ruth backman	0f47076703	More RTVI version parsing improvements	2026-03-31 16:05:53 -04:00
mattie ruth backman	3e255f3d21	improve version format check	2026-03-31 16:05:53 -04:00
mattie ruth backman	565b9b961d	add tests for rtvi versioning	2026-03-31 16:05:53 -04:00
Mark Backman	7501effad5	Remove deprecated service module shims and old implementations Delete deprecated import shims that only re-export from new locations: - services/ai_services.py - services/gemini_multimodal_live/ - services/aws_nova_sonic/ - services/openai_realtime/ - services/deepgram/{stt,tts}_sagemaker.py - services/google/{llm_openai,llm_vertex,google}.py - services/google/gemini_live/llm_vertex.py - services/riva/ - services/nim/ Remove deprecated implementations replaced by newer services: - services/openai_realtime_beta/ (use openai.realtime) - services/google/openai/ (use google.llm) Also removes associated examples and tests for deleted services.	2026-03-31 15:34:14 -04:00
Paul Kompfner	712e42533d	Introduce WebsocketLLMService and refactor OpenAIResponsesLLMService to use it Add WebsocketLLMService as a base class for WebSocket-based LLM services, parallel to WebsocketTTSService/WebsocketSTTService but codifying a transactional request-response model rather than a continuous background receive loop. WebsocketLLMService provides: - Connection lifecycle (start/stop/cancel → connect/disconnect) - _ws_send/_ws_recv with transparent ConnectionClosed handling (auto-reconnect via exponential backoff → WebsocketReconnectedError) - _ensure_connected with retry via _try_reconnect OpenAIResponsesLLMService now inherits from WebsocketLLMService, removing duplicated connection management code (_connect, _disconnect, _reconnect, _ensure_connected, _ws_send, start, stop, cancel) and simplifying _process_context from a loop with attempt tracking to a flat try/except with a single retry.	2026-03-30 22:26:31 -04:00
Paul Kompfner	26f85687d6	Handle response cancellation by draining before next inference Instead of trying to filter stale events inline (unreliable — the API doesn't provide a way to correlate events to a specific response), drain remaining events from a cancelled response before starting the next one. On cancellation, send response.cancel and set a drain flag. At the start of the next _process_context, read and discard events until a terminal event arrives, ensuring a clean connection. Falls back to reconnecting if draining times out.	2026-03-30 09:59:03 -04:00
Paul Kompfner	9defff2a34	Skip server-known output items in previous_response_id optimization When using previous_response_id, the server already knows its own output from the previous response. Store the raw response output and, on the next call, compare it against the items following the matched input prefix — checking role and text content for messages, and call_id for function calls. If the items match, skip them and send only truly new input (user messages, tool results). Falls back to full context if either the prefix or the output comparison fails.	2026-03-30 09:59:03 -04:00
Paul Kompfner	f2a8a9e753	Add WebSocket-based OpenAI Responses LLM service with previous_response_id optimization Introduce a WebSocket variant of the OpenAI Responses API service that maintains a persistent connection to wss://api.openai.com/v1/responses for lower-latency inference. The WebSocket variant automatically uses previous_response_id to send only incremental context when possible, falling back to full context on reconnection or cache miss. The WebSocket variant becomes the new default OpenAIResponsesLLMService, and the HTTP variant is renamed to OpenAIResponsesHttpLLMService. Both share a private base class with common settings, parameter building, and run_inference (always HTTP) logic.	2026-03-30 09:58:56 -04:00
Mark Backman	8c9e189394	Fix langchain imports for langchain 1.x compatibility ChatPromptTemplate moved from langchain.prompts to langchain_core.prompts in langchain 1.x.	2026-03-29 10:27:48 -04:00
OmercohenAviv	5fe48da2fb	Merge branch 'main' into fix/heartbeat-monitor-configurable	2026-03-28 11:57:23 +03:00
OmercohenAviv	dccd98ec8a	test	2026-03-28 11:53:51 +03:00
Mark Backman	47e53890e3	Fix FastAPI WebSocket disconnect race condition causing pipeline hang When the remote side disconnects while send() is in flight, send() was setting _closing=True. This prevented the receive loop from firing on_client_disconnected, causing the pipeline to hang waiting for a disconnect signal that never came. The fix removes _closing from send() (that flag means we initiated the close) and instead checks Starlette application_state in _can_send() to suppress subsequent sends after a failure. Fixes #3912	2026-03-28 00:01:25 -04:00
Mark Backman	5c51981207	Merge pull request #4149 from pipecat-ai/mb/fix-service-switcher-passthrough-errors Fix ServiceSwitcher reacting to pass-through ErrorFrames	2026-03-26 16:34:45 -04:00
Mark Backman	c331c75d66	Add tests for send_media() exception handling in DeepgramSTTService	2026-03-26 09:20:58 -04:00
Mark Backman	7fef3b01eb	Merge pull request #4142 from pipecat-ai/mb/grok-move-to-xai-module Consolidate Grok services into xai module	2026-03-25 23:32:18 -04:00
Mark Backman	fdbdbc8be3	Fix ServiceSwitcher reacting to pass-through ErrorFrames from other pipeline stages ErrorFrames propagating upstream from downstream processors (e.g. TTS) would enter the ServiceSwitcher via process_frame, traverse the active service sub-pipeline, and reach push_frame where they incorrectly triggered failover. Now only errors whose processor is one of the managed services trigger handle_error. Also fix the log in handle_error to attribute errors to the actual source processor rather than the current active_service. Closes #4139	2026-03-25 22:53:04 -04:00
filipi87	413dbaf974	Automated tests to validate the silence injection guards.	2026-03-25 16:05:58 -03:00
filipi87	da3f184316	Automated tests to validate the silence injection guards.	2026-03-25 15:38:21 -03:00
Mark Backman	4ee4002d5d	Merge pull request #4137 from pipecat-ai/mb/language-string-log-level-debug Downgrade unrecognized language string log from warning to debug	2026-03-25 12:26:46 -04:00
Mark Backman	1c99a537b2	Consolidate Grok services into xai module Both GrokLLMService and XAIHttpTTSService use the same xAI API (api.x.ai), so move Grok source files into the xai module. Leave deprecation shims in the old grok/ paths for backward compatibility.	2026-03-25 12:07:40 -04:00
Nicholas Zhao	bbd14de9c5	Address PR review: rename to XAIHttpTTSService, add language map, clean up API - Rename XAITTSService → XAIHttpTTSService and XAITTSSettings → XAIHttpTTSSettings - Add language_to_xai_language() with explicit LANGUAGE_MAP using resolve_language() - Remove deprecated InputParams, params, voice, language init params - Remove XAI_DEFAULT_SAMPLE_RATE and XAI_PCM_CODEC constants; add encoding param - Set sample_rate=None default (picked up from PipelineParams or user) - Use Language.EN enum instead of string "en" for default language - Add changelog/4031.added.md - Add 07e-interruptible-xai.py foundational example - Update 14g-function-calling-grok.py to use XAIHttpTTSService - Register 07e in run-release-evals.py	2026-03-25 10:46:54 -04:00
Nicholas Zhao	02b97035f8	Add xAI TTS service	2026-03-25 10:45:15 -04:00
Mark Backman	f470ff193e	Update language tests to expect debug instead of warning	2026-03-25 10:26:10 -04:00
Paul Kompfner	bb33045389	Add system instruction conflict resolution tests for realtime adapters Test that OpenAI Realtime, Grok Realtime, and Nova Sonic adapters prefer init-provided system_instruction over context-provided, warn on conflicts, and don't warn for developer messages.	2026-03-24 17:30:35 -04:00
Paul Kompfner	4c121332cf	Convert developer messages to user for Cerebras (and lay groundwork for other incompatible services) OpenAI-compatible services that don't support the "developer" message role can now set supports_developer_role = False on the service class. BaseOpenAILLMService passes this as convert_developer_to_user to the adapter, which converts developer messages to user messages before sending them to the API. Applied to Cerebras and Perplexity. Also removes the now-redundant developer→user conversion step from PerplexityLLMAdapter (handled by the parent adapter via the flag).	2026-03-24 16:05:15 -04:00
Paul Kompfner	0530722c58	Convert developer messages to user in Perplexity adapter Perplexity doesn't support the "developer" role. Developer messages are now converted to "user" before other transformations are applied.	2026-03-24 16:05:15 -04:00
Paul Kompfner	0d1b834770	Add developer message support to realtime adapters OpenAI Realtime, Grok Realtime, and AWS Nova Sonic adapters now convert "developer" role messages to "user" (consistent with all other non-OpenAI adapters). Previously these messages were silently dropped. Adds starter unit tests for all three realtime adapters.	2026-03-24 16:05:15 -04:00
Paul Kompfner	2135557689	Simplify: don't promote developer messages to system instruction Developer messages are now always converted to "user" in non-OpenAI adapters, never promoted to the system instruction. This removes an inconsistency where adding an unrelated message to context would change whether a developer message got promoted. Simplifications: - Rename _extract_initial_system_or_developer → _extract_initial_system - Return Optional[str] instead of Tuple (role is always "system") - Drop initial_context_message_role from _resolve_system_instruction - Drop system_role fields from all ConvertedMessages dataclasses	2026-03-24 16:02:42 -04:00
Paul Kompfner	a0393b9af6	Fix: warn on system_instruction conflict even with single system message When the only message in context was a system message, _extract_initial_system_or_developer would convert it to "user" (to prevent empty history) without warning about the conflict with system_instruction. Now warns inline before converting, with a message explaining both the conflict and the user-role conversion.	2026-03-24 16:02:42 -04:00
Paul Kompfner	64ba013b68	Move OpenAI Responses adapter tests into test_get_llm_invocation_params.py Consolidates all adapter get_llm_invocation_params tests in one file. Adds new tests for developer message handling in the Responses adapter.	2026-03-24 16:02:42 -04:00
Paul Kompfner	7377d88cf5	Move system_instruction tests into test_get_llm_invocation_params.py	2026-03-24 16:02:42 -04:00
Paul Kompfner	d4dea30407	Centralize system message handling in adapters; add developer message support Two goals: 1. Centralize system_instruction vs context system message resolution into the LLM adapters. This eliminates duplication between in-pipeline and out-of-band (run_inference) code paths across ~16 locations in service llm.py files. 2. Add support for "developer" role messages in conversation context, which is facilitated by the above centralization. Shared helpers on BaseLLMAdapter: - _extract_initial_system_or_developer: extracts/converts messages[0] based on role and whether system_instruction is provided - _resolve_system_instruction: warns on conflicts between system_instruction and context system messages, returns the effective instruction Developer message handling (new): - Non-OpenAI adapters: an initial "developer" message is promoted to the system instruction when no system_instruction is provided; otherwise it is converted to "user". Subsequent "developer" messages are always converted to "user". No conflict warning is emitted for developer messages (unlike "system" messages). - OpenAI adapter: "developer" messages pass through in conversation history without triggering conflict warnings. - OpenAI Responses adapter: "developer" messages are kept as "developer" role (same as "system", which is also converted to "developer" for the Responses API). Other behavior changes: - Gemini: "initial" system message detection now checks messages[0] only (previously searched anywhere in the list) - Bedrock: a lone system message is now converted to "user" instead of being extracted to an empty message list (matches existing Anthropic behavior)	2026-03-24 16:02:42 -04:00
Mark Backman	5d71de8aad	Fix LLMFullResponseEndFrame racing ahead of final TTSTextFrame Route LLMFullResponseEndFrame through the serialization queue instead of pushing it directly downstream when push_text_frames is enabled. This ensures the frame is emitted only after the audio context is fully drained, preserving correct ordering relative to TTSTextFrames. Previously, the final sentence TTSTextFrame would arrive at the LLMAssistantAggregator after LLMFullResponseEndFrame, causing it to be dropped from the conversation context (especially with RTVI text input where no subsequent interruption would flush the orphaned text).	2026-03-24 15:09:42 -04:00
Filipi da Silva Fuchter	5ed183d215	Merge pull request #4022 from krispai/krisp-viva-vad-support Draft Implementation for Krisp VIVA VAD.	2026-03-24 09:44:32 -04:00
Mark Backman	5c3d3aea2b	Merge pull request #4115 from pipecat-ai/mb/user-turn-stop-warnings Warn when VAD stop_secs misconfiguration may degrade turn detection	2026-03-24 09:32:20 -04:00
Alex-wuhu	8c6f4a8d7b	Add Novita AI LLM service provider	2026-03-24 09:20:50 -04:00
Mark Backman	483b643b07	Warn when VAD stop_secs misconfiguration may degrade turn detection Add warnings in SpeechTimeoutUserTurnStopStrategy and TurnAnalyzerUserTurnStopStrategy when stop_secs differs from the recommended default (0.2s) or when stop_secs >= STT p99 latency, which collapses the STT wait timeout to 0s. Document the stop_secs=0.2 assumption in stt_latency.py.	2026-03-23 17:57:51 -04:00
Garegin Harutyunyan	f1f51de962	Merge branch 'main' into krisp-viva-vad-support	2026-03-23 18:35:58 +04:00
Garegin Harutyunyan	c32240e14b	Fixed review comments.	2026-03-23 17:44:48 +04:00
Pablo Ois Lagarde	bc0e7130b8	fix: always include parameters field in Genesys AudioHook messages The AudioHook protocol requires every message to carry a `parameters` object. `_create_message` conditionally included it only when parameters were truthy, so pong responses and closed responses without outputVariables were sent without the field. Clients that validate message structure (including the Genesys reference implementation) rejected these messages, which broke server sequence tracking and prevented outputVariables from reaching the Architect flow. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 16:37:53 -03:00
kompfner	488dc1d07e	Merge pull request #4074 from pipecat-ai/pk/openai-responses-llm-service feat: add OpenAI Responses API LLM service	2026-03-19 15:44:26 -04:00
Paul Kompfner	348df9d4ce	fix: remove redundant instructions override in run_inference The override would re-add `instructions` after the adapter had intentionally converted it to a developer message for empty contexts. Added a regression test.	2026-03-19 13:34:41 -04:00
Paul Kompfner	d702ebd6a2	Add frame_order parameter to SyncParallelPipeline Adds a FrameOrder enum with ARRIVAL (default, existing behavior) and PIPELINE (pushes frames in pipeline definition order). This lets callers guarantee output ordering between parallel pipelines — e.g. ensuring image frames precede audio frames — without needing a separate reordering processor downstream. Updates the 05-sync-speech-and-image example to use FrameOrder.PIPELINE, removing the ImageBeforeAudioReorderer class entirely.	2026-03-19 09:43:51 -04:00
filipi87	5fd98e1391	Fixing TTS frame order.	2026-03-19 09:43:40 -03:00
Mark Backman	bad10177d4	Add WakePhraseUserTurnStartStrategy (#4064 ) - Add WakePhraseUserTurnStartStrategy for gating interaction behind wake phrase detection, with timeout and single_activation modes - Add default_user_turn_start_strategies() and default_user_turn_stop_strategies() helper functions - Deprecate WakeCheckFilter in favor of the new strategy - Extend ProcessFrameResult to stop strategies for short-circuit evaluation - Fix MinWordsUserTurnStartStrategy including filtered text in output	2026-03-18 16:47:17 -04:00
Paul Kompfner	951bb0c1a7	feat: set store=False and add run_inference tests Set store=False in Responses API calls since we send full conversation history as input items and don't use previous_response_id. Add 5 run_inference tests for OpenAIResponsesLLMService using real LLMContext and adapter (only HTTP client mocked).	2026-03-18 14:47:12 -04:00

1 2 3 4 5 ...

430 Commits