pipecat

Author	SHA1	Message	Date
Paul Kompfner	c66a5a8ede	feat: set store=False and add run_inference tests Set store=False in Responses API calls since we send full conversation history as input items and don't use previous_response_id. Add 5 run_inference tests for OpenAIResponsesLLMService using real LLMContext and adapter (only HTTP client mocked).	2026-03-20 21:34:22 -04:00
Paul Kompfner	312837a1d4	test: add run_inference tests for OpenAIResponsesLLMService Uses real LLMContext and adapter (only HTTP client is mocked) to test basic inference, client exception propagation, system_instruction override, empty context fallback, and max_tokens override.	2026-03-20 21:34:22 -04:00
Paul Kompfner	4d4e56cfef	test: add run_inference tests for OpenAIResponsesLLMService Tests cover basic inference, client exception propagation, system_instruction override, and max_tokens override.	2026-03-20 21:34:22 -04:00
Paul Kompfner	4d548117fa	feat: add OpenAI Responses API LLM service Add OpenAIResponsesLLMService using the Responses API, with a dedicated adapter that converts LLMContext messages to Responses API input items (system→developer, tool_calls→function_call, tool→function_call_output, multimodal content conversion, and tools schema flattening). - New adapter: open_ai_responses_adapter.py - New service: openai/responses/llm.py - Examples: 07-interruptible and 14-function-calling variants - 19 unit tests for adapter conversion logic - Eval entries for both examples	2026-03-20 21:34:22 -04:00
Paul Kompfner	0a4acfa294	Add frame_order parameter to SyncParallelPipeline Adds a FrameOrder enum with ARRIVAL (default, existing behavior) and PIPELINE (pushes frames in pipeline definition order). This lets callers guarantee output ordering between parallel pipelines — e.g. ensuring image frames precede audio frames — without needing a separate reordering processor downstream. Updates the 05-sync-speech-and-image example to use FrameOrder.PIPELINE, removing the ImageBeforeAudioReorderer class entirely.	2026-03-20 21:34:22 -04:00
filipi87	d32a8a9ee2	Fixing TTS frame order.	2026-03-20 21:34:21 -04:00
Mark Backman	98d3f697f1	Add WakePhraseUserTurnStartStrategy (#4064 ) - Add WakePhraseUserTurnStartStrategy for gating interaction behind wake phrase detection, with timeout and single_activation modes - Add default_user_turn_start_strategies() and default_user_turn_stop_strategies() helper functions - Deprecate WakeCheckFilter in favor of the new strategy - Extend ProcessFrameResult to stop strategies for short-circuit evaluation - Fix MinWordsUserTurnStartStrategy including filtered text in output	2026-03-20 21:34:21 -04:00
Mark Backman	490e460c4b	Fix DeepgramSTTService base_url forcing HTTPS/WSS schemes The base_url parameter previously forced wss:// and https:// schemes, breaking air-gapped or private deployments that need ws:// or http://. Extract URL derivation into _derive_deepgram_urls() helper that respects the developers scheme choice while deriving the paired WebSocket and HTTP URLs the Deepgram SDK requires. Closes #4019	2026-03-20 21:34:21 -04:00
Mark Backman	4171a75f79	fix: resolve raw language strings through Language enum for proper service conversion Raw strings like "de-DE" passed as the language parameter to TTS/STT services were bypassing the Language enum resolution logic, causing silent failures (e.g. ElevenLabs expects "de" not "de-DE"). Now raw strings are first converted to Language enums so they go through the same resolve_language() path, with a warning logged for unrecognized strings.	2026-03-20 21:34:21 -04:00
Mark Backman	55fb274d5a	Fix stale state in user turn stop strategies between turns Reset stop strategies at turn start (not just turn stop) so that late transcriptions arriving between turns do not leave stale _text that causes premature stops on the next turn. Also cancel pending timeout tasks in reset() for both SpeechTimeout and TurnAnalyzer strategies.	2026-03-20 21:34:21 -04:00
Paul Kompfner	99f28120b7	Remove trailing system→user conversion for cross-call stability Perplexity appears to have statefulness within a conversation, so converting a system message to "user" in one call and then back to "system" in the next (after more messages are appended) causes API errors. Remove the trailing system→user conversion entirely — if the context only has system messages, the API call will fail but the mistake will be caught right away.	2026-03-12 16:07:39 -04:00
Paul Kompfner	e69f5a76e1	Add test for trailing assistant+system ordering, improve docstring Add test exercising the step 3 ordering where stripping a trailing assistant exposes a system message that then gets converted to user. Move the reasoning about when a trailing system message can occur into the docstring.	2026-03-12 15:24:17 -04:00
Paul Kompfner	7f98cc9921	Remove initial system message merging, handle trailing system messages Perplexity allows multiple initial system messages, so don't merge them. Instead, skip system-system pairs during the consecutive same-role merge step. Broaden the trailing message fix to convert any trailing system message to user (not just a lone system message), so contexts with only system messages don't fail.	2026-03-12 15:14:56 -04:00
Paul Kompfner	0373f85b85	Add PerplexityLLMAdapter to enforce Perplexity's message ordering constraints Perplexity's API is stricter than OpenAI about conversation history: - Requires strict alternation between user/tool and assistant messages - Disallows system messages except as the initial message - Requires the last message to be user or tool The new adapter transforms messages before sending to satisfy all three constraints: merging consecutive initial system messages, converting non-initial system to user, merging consecutive same-role messages, and removing trailing assistant messages. Also adds dual-system-instruction warnings to Cerebras, Fireworks, Mistral, Perplexity, and SambaNova services (matching the existing BaseOpenAILLMService pattern), and updates the warning text in BaseOpenAILLMService to be more descriptive.	2026-03-12 14:56:30 -04:00
Aleix Conchillo Flaqué	a4310d4335	Merge pull request #3980 from pipecat-ai/aleix/move-google-vertex-openai Move Google Vertex and OpenAI LLM modules to subpackages	2026-03-10 13:37:02 -07:00
Aleix Conchillo Flaqué	b23652caa6	Update imports to use new google.vertex and google.openai paths	2026-03-10 12:58:04 -07:00
kollaikal-rupesh	80bd935c19	Add ServiceSwitcherStrategyFailover for automatic failover on service errors (#3870 ) * Add ServiceSwitcherStrategyFailover for automatic error-based service switching Introduce a strategy hierarchy: ServiceSwitcherStrategy (base) → ServiceSwitcherStrategyManual (handles ManuallySwitchServiceFrame) → ServiceSwitcherStrategyFailover (adds error-based failover). ServiceSwitcher now defaults to ServiceSwitcherStrategyManual with strategy_type optional. Non-fatal ErrorFrames are forwarded to the strategy via handle_error(). * Move metadata request into _set_active_if_available Requesting metadata is part of making a service active, so it belongs alongside setting _active_service and firing on_service_switched. This removes the duplicate queue_frame calls from ServiceSwitcher push_frame and process_frame.	2026-03-10 15:37:30 -04:00
Mark Backman	9b26faff05	Merge pull request #3961 from ai-coustics/goekmengoergen/sys-663-re-enable-enhancement-level-feature-on-pipecat Add enhancement_level support to `AICFilter`.	2026-03-10 14:24:15 -04:00
Mark Backman	912f1be31c	Add system_instruction parameter to run_inference (#3968 ) * Add system_instruction parameter to run_inference Allow callers to provide a custom system instruction directly when calling run_inference, without having to construct provider-specific context objects. For OpenAI, the instruction is prepended as a system message (preserving existing messages). For Anthropic, Google, and AWS Bedrock, it overrides the single system field with a warning when an existing system instruction is present in the context. * Use system_instruction parameter in _generate_summary Pass the summarization prompt via run_inference's system_instruction parameter instead of embedding it as a system message in the context. * Add changelog for #3968	2026-03-10 12:57:23 -04:00
Gökmen Görgen	3a6f848a5b	update test description.	2026-03-10 14:54:49 +01:00
Gökmen Görgen	a96702acfc	fix test.	2026-03-10 14:41:18 +01:00
Gökmen Görgen	bc11bf9673	remove `_is_filter_enabled` from `AICFilter` and refactor related logic and tests.	2026-03-10 13:48:32 +01:00
Gökmen Görgen	0c87fcc48c	re-add bypass parameter support to `AICFilter` and update related unit tests.	2026-03-10 13:36:15 +01:00
Gökmen Görgen	df64f3f943	add enhancement_level support to `AICFilter`. # Conflicts: # src/pipecat/audio/filters/aic_filter.py	2026-03-10 13:36:15 +01:00
kompfner	02791cd503	Merge pull request #3965 from pipecat-ai/pk/fix-integration-tests Fix broken `test_unified_function_calling_anthropic` due to use of an…	2026-03-09 13:42:35 -04:00
Mark Backman	786279f143	Remove unused imports, 2026-03-07	2026-03-09 12:44:47 -04:00
Paul Kompfner	9423d22051	Fix broken `test_unified_function_calling_anthropic` due to use of an unsupported/deprecated model. Update the tests in test_integration_unified_function_calling.py to not specify particular models but instead just use service defaults (the tests shouldn't be model-dependent anyway)	2026-03-09 12:07:56 -04:00
filipi87	f0c5925a79	Fixing Piper test.	2026-03-09 12:07:45 -03:00
Filipi da Silva Fuchter	16336a3ea4	Merge pull request #3937 from pipecat-ai/filipi/fix_orphan_function_call Fix context summarization leaving orphaned tool responses in kept context.	2026-03-09 09:19:17 -04:00
filipi87	4557ef8c42	Renaming method to _get_earliest_function_call_not_resolved_in_range	2026-03-09 10:16:02 -03:00
Mark Backman	efda57de5c	Move turn completion instructions to system_instruction Turn completion instructions were being injected as a system message in the LLM context, which caused warning spam when system_instruction was also set, did not persist across full context updates, and broke LLMs that do not support consecutive system messages. Instead, compose the turn completion instructions into the LLM service system_instruction field. This is managed via _base_system_instruction which stores the original value for restoration when turn completion is disabled.	2026-03-08 10:41:40 -04:00
radhikagpt1208	b14c8e0e94	Fix turn completion mixin not resetting state after each LLM response	2026-03-08 08:46:45 -04:00
Mark Backman	1cfaea2007	Address code review feedback	2026-03-07 07:42:42 -05:00
Filipi da Silva Fuchter	4b9fc8a30c	Merge pull request #3804 from pipecat-ai/filipi/concurrent_audio_contexts Allowing concurrent audio contexts	2026-03-06 14:49:57 -05:00
filipi87	24430d8d45	Fixing Piper test.	2026-03-06 16:15:26 -03:00
Paul Kompfner	f4c039048c	Adopt the `settings` pattern for Grok Realtime session properties Move `session_properties` into `GrokRealtimeLLMSettings`, making `settings` the canonical way to configure Grok Realtime — matching the pattern used across the rest of the codebase. The `session_properties` init arg is now deprecated in favor of `settings=GrokRealtimeLLMSettings(session_properties=...)`. `system_instruction` is synced bidirectionally between the top-level settings field and `session_properties.instructions`, with top-level taking precedence on conflict. (Unlike OpenAI Realtime, Grok's `SessionProperties` has no `model` field, so no model sync is needed.)	2026-03-06 12:53:26 -05:00
Paul Kompfner	bd4229ea9d	Adopt the `settings` pattern for OpenAI Realtime session properties Move `session_properties` into `OpenAIRealtimeLLMSettings`, making `settings` the canonical way to configure OpenAI Realtime — matching the pattern used across the rest of the codebase. The `session_properties` init arg is now deprecated in favor of `settings=OpenAIRealtimeLLMSettings(session_properties=...)`. `model` and `system_instruction` are synced bidirectionally between the top-level settings fields and `session_properties.model`/`.instructions`, with top-level taking precedence on conflict.	2026-03-06 11:46:21 -05:00
filipi87	4ef3b52c72	Fix context summarization leaving orphaned tool responses in kept context.	2026-03-06 11:40:27 -03:00
Mark Backman	14c3a88f02	Fix tests	2026-03-06 08:29:14 -05:00
Mark Backman	a4375274b2	Add Settings subclasses to all services and auto-discovered init tests - Add dedicated Settings subclasses to 20 LLM services that were borrowing parent Settings classes (e.g. AzureLLMSettings, GroqLLMSettings) so users don't need cross-module imports - Fix field defaults to NOT_GIVEN in BaseWhisperSTTSettings, OpenAIRealtimeSTTSettings, and NvidiaSegmentedSTTSettings for delta-mode safety - Fix incomplete default_settings in AWS, Cartesia, ElevenLabs, Fish, and Whisper services so validate_complete() passes - Add auto-discovered tests that verify all Settings classes default to NOT_GIVEN (delta safety) and all services initialize with complete settings (store completeness)	2026-03-06 08:29:14 -05:00
Mark Backman	034e81ff18	Update STT service settings	2026-03-06 08:29:14 -05:00
Mark Backman	b358657a79	Make max_context_tokens and max_unsummarized_messages independently optional Allow either threshold to be set to None to cleanly disable that trigger, instead of requiring users to set a very large number as a workaround. At least one of the two must remain set (validated at construction time).	2026-03-03 20:08:22 -05:00
Mark Backman	ca0ec16373	Merge pull request #3889 from ai-coustics/goedev/aic-voice-focus-and-memoryview-fix AIC Voice Focus version update & concurrency safety issue on audio buffer.	2026-03-03 09:28:13 -05:00
filipi87	fc905a7ef5	Merge branch 'main' into filipi/deepgram # Conflicts: # src/pipecat/services/deepgram/stt_sagemaker.py	2026-03-03 10:54:30 -03:00
Mark Backman	aad1211a57	Merge pull request #3885 from pipecat-ai/mb/latency-breakdown Add latency breakdown to UserBotLatencyObserver	2026-03-02 19:27:35 -05:00
Mark Backman	7dbb130666	Add chronological_events utility function to display UserBotLatencyObserver report	2026-03-02 19:23:42 -05:00
Aleix Conchillo Flaqué	44466cfa07	Merge pull request #3896 from pipecat-ai/aleix/broadcast-interruption Add broadcast_interruption() to FrameProcessor	2026-03-02 13:36:39 -08:00
Aleix Conchillo Flaqué	4a61d5bfad	Add broadcast_interruption() to FrameProcessor Replace the round-trip push_interruption_task_frame_and_wait() mechanism with broadcast_interruption(), which pushes an InterruptionFrame both upstream and downstream directly from the calling processor. This eliminates race conditions (transcription arriving before the InterruptionFrame comes back), swallowed-event timeouts (frame blocked before reaching the sink), and the complexity of _wait_for_interruption flag / queue bypass / frame.complete() obligations. - Add broadcast_interruption() to FrameProcessor - Deprecate push_interruption_task_frame_and_wait() (delegates to new method) - Remove event field and complete() from InterruptionFrame/InterruptionTaskFrame - Remove _wait_for_interruption flag and all special-case logic - Remove frame.complete() calls in stt_mute_filter and llm_response_universal - Update all 17 call sites to use broadcast_interruption() - Update tests	2026-03-02 13:26:45 -08:00
Mark Backman	ff5b985009	Convert observer data models to Pydantic BaseModel with timestamps Enables .model_dump() serialization for Pipecat Cloud collection. All metrics now include start_time (Unix timestamp) for timeline plotting alongside duration_secs.	2026-03-02 16:11:43 -05:00
Mark Backman	a738a4d82b	Add function call latency tracking to LatencyBreakdown	2026-03-02 16:11:43 -05:00

1 2 3 4 5 ...

380 Commits