pipecat

Author	SHA1	Message	Date
Paul Kompfner	e69f5a76e1	Add test for trailing assistant+system ordering, improve docstring Add test exercising the step 3 ordering where stripping a trailing assistant exposes a system message that then gets converted to user. Move the reasoning about when a trailing system message can occur into the docstring.	2026-03-12 15:24:17 -04:00
Paul Kompfner	7f98cc9921	Remove initial system message merging, handle trailing system messages Perplexity allows multiple initial system messages, so don't merge them. Instead, skip system-system pairs during the consecutive same-role merge step. Broaden the trailing message fix to convert any trailing system message to user (not just a lone system message), so contexts with only system messages don't fail.	2026-03-12 15:14:56 -04:00
Paul Kompfner	0373f85b85	Add PerplexityLLMAdapter to enforce Perplexity's message ordering constraints Perplexity's API is stricter than OpenAI about conversation history: - Requires strict alternation between user/tool and assistant messages - Disallows system messages except as the initial message - Requires the last message to be user or tool The new adapter transforms messages before sending to satisfy all three constraints: merging consecutive initial system messages, converting non-initial system to user, merging consecutive same-role messages, and removing trailing assistant messages. Also adds dual-system-instruction warnings to Cerebras, Fireworks, Mistral, Perplexity, and SambaNova services (matching the existing BaseOpenAILLMService pattern), and updates the warning text in BaseOpenAILLMService to be more descriptive.	2026-03-12 14:56:30 -04:00
Aleix Conchillo Flaqué	a4310d4335	Merge pull request #3980 from pipecat-ai/aleix/move-google-vertex-openai Move Google Vertex and OpenAI LLM modules to subpackages	2026-03-10 13:37:02 -07:00
Aleix Conchillo Flaqué	b23652caa6	Update imports to use new google.vertex and google.openai paths	2026-03-10 12:58:04 -07:00
kollaikal-rupesh	80bd935c19	Add ServiceSwitcherStrategyFailover for automatic failover on service errors (#3870 ) * Add ServiceSwitcherStrategyFailover for automatic error-based service switching Introduce a strategy hierarchy: ServiceSwitcherStrategy (base) → ServiceSwitcherStrategyManual (handles ManuallySwitchServiceFrame) → ServiceSwitcherStrategyFailover (adds error-based failover). ServiceSwitcher now defaults to ServiceSwitcherStrategyManual with strategy_type optional. Non-fatal ErrorFrames are forwarded to the strategy via handle_error(). * Move metadata request into _set_active_if_available Requesting metadata is part of making a service active, so it belongs alongside setting _active_service and firing on_service_switched. This removes the duplicate queue_frame calls from ServiceSwitcher push_frame and process_frame.	2026-03-10 15:37:30 -04:00
Mark Backman	9b26faff05	Merge pull request #3961 from ai-coustics/goekmengoergen/sys-663-re-enable-enhancement-level-feature-on-pipecat Add enhancement_level support to `AICFilter`.	2026-03-10 14:24:15 -04:00
Mark Backman	912f1be31c	Add system_instruction parameter to run_inference (#3968 ) * Add system_instruction parameter to run_inference Allow callers to provide a custom system instruction directly when calling run_inference, without having to construct provider-specific context objects. For OpenAI, the instruction is prepended as a system message (preserving existing messages). For Anthropic, Google, and AWS Bedrock, it overrides the single system field with a warning when an existing system instruction is present in the context. * Use system_instruction parameter in _generate_summary Pass the summarization prompt via run_inference's system_instruction parameter instead of embedding it as a system message in the context. * Add changelog for #3968	2026-03-10 12:57:23 -04:00
Gökmen Görgen	3a6f848a5b	update test description.	2026-03-10 14:54:49 +01:00
Gökmen Görgen	a96702acfc	fix test.	2026-03-10 14:41:18 +01:00
Gökmen Görgen	bc11bf9673	remove `_is_filter_enabled` from `AICFilter` and refactor related logic and tests.	2026-03-10 13:48:32 +01:00
Gökmen Görgen	0c87fcc48c	re-add bypass parameter support to `AICFilter` and update related unit tests.	2026-03-10 13:36:15 +01:00
Gökmen Görgen	df64f3f943	add enhancement_level support to `AICFilter`. # Conflicts: # src/pipecat/audio/filters/aic_filter.py	2026-03-10 13:36:15 +01:00
kompfner	02791cd503	Merge pull request #3965 from pipecat-ai/pk/fix-integration-tests Fix broken `test_unified_function_calling_anthropic` due to use of an…	2026-03-09 13:42:35 -04:00
Mark Backman	786279f143	Remove unused imports, 2026-03-07	2026-03-09 12:44:47 -04:00
Paul Kompfner	9423d22051	Fix broken `test_unified_function_calling_anthropic` due to use of an unsupported/deprecated model. Update the tests in test_integration_unified_function_calling.py to not specify particular models but instead just use service defaults (the tests shouldn't be model-dependent anyway)	2026-03-09 12:07:56 -04:00
filipi87	f0c5925a79	Fixing Piper test.	2026-03-09 12:07:45 -03:00
Filipi da Silva Fuchter	16336a3ea4	Merge pull request #3937 from pipecat-ai/filipi/fix_orphan_function_call Fix context summarization leaving orphaned tool responses in kept context.	2026-03-09 09:19:17 -04:00
filipi87	4557ef8c42	Renaming method to _get_earliest_function_call_not_resolved_in_range	2026-03-09 10:16:02 -03:00
Mark Backman	efda57de5c	Move turn completion instructions to system_instruction Turn completion instructions were being injected as a system message in the LLM context, which caused warning spam when system_instruction was also set, did not persist across full context updates, and broke LLMs that do not support consecutive system messages. Instead, compose the turn completion instructions into the LLM service system_instruction field. This is managed via _base_system_instruction which stores the original value for restoration when turn completion is disabled.	2026-03-08 10:41:40 -04:00
radhikagpt1208	b14c8e0e94	Fix turn completion mixin not resetting state after each LLM response	2026-03-08 08:46:45 -04:00
Mark Backman	1cfaea2007	Address code review feedback	2026-03-07 07:42:42 -05:00
Filipi da Silva Fuchter	4b9fc8a30c	Merge pull request #3804 from pipecat-ai/filipi/concurrent_audio_contexts Allowing concurrent audio contexts	2026-03-06 14:49:57 -05:00
filipi87	24430d8d45	Fixing Piper test.	2026-03-06 16:15:26 -03:00
Paul Kompfner	f4c039048c	Adopt the `settings` pattern for Grok Realtime session properties Move `session_properties` into `GrokRealtimeLLMSettings`, making `settings` the canonical way to configure Grok Realtime — matching the pattern used across the rest of the codebase. The `session_properties` init arg is now deprecated in favor of `settings=GrokRealtimeLLMSettings(session_properties=...)`. `system_instruction` is synced bidirectionally between the top-level settings field and `session_properties.instructions`, with top-level taking precedence on conflict. (Unlike OpenAI Realtime, Grok's `SessionProperties` has no `model` field, so no model sync is needed.)	2026-03-06 12:53:26 -05:00
Paul Kompfner	bd4229ea9d	Adopt the `settings` pattern for OpenAI Realtime session properties Move `session_properties` into `OpenAIRealtimeLLMSettings`, making `settings` the canonical way to configure OpenAI Realtime — matching the pattern used across the rest of the codebase. The `session_properties` init arg is now deprecated in favor of `settings=OpenAIRealtimeLLMSettings(session_properties=...)`. `model` and `system_instruction` are synced bidirectionally between the top-level settings fields and `session_properties.model`/`.instructions`, with top-level taking precedence on conflict.	2026-03-06 11:46:21 -05:00
filipi87	4ef3b52c72	Fix context summarization leaving orphaned tool responses in kept context.	2026-03-06 11:40:27 -03:00
Mark Backman	14c3a88f02	Fix tests	2026-03-06 08:29:14 -05:00
Mark Backman	a4375274b2	Add Settings subclasses to all services and auto-discovered init tests - Add dedicated Settings subclasses to 20 LLM services that were borrowing parent Settings classes (e.g. AzureLLMSettings, GroqLLMSettings) so users don't need cross-module imports - Fix field defaults to NOT_GIVEN in BaseWhisperSTTSettings, OpenAIRealtimeSTTSettings, and NvidiaSegmentedSTTSettings for delta-mode safety - Fix incomplete default_settings in AWS, Cartesia, ElevenLabs, Fish, and Whisper services so validate_complete() passes - Add auto-discovered tests that verify all Settings classes default to NOT_GIVEN (delta safety) and all services initialize with complete settings (store completeness)	2026-03-06 08:29:14 -05:00
Mark Backman	034e81ff18	Update STT service settings	2026-03-06 08:29:14 -05:00
Mark Backman	b358657a79	Make max_context_tokens and max_unsummarized_messages independently optional Allow either threshold to be set to None to cleanly disable that trigger, instead of requiring users to set a very large number as a workaround. At least one of the two must remain set (validated at construction time).	2026-03-03 20:08:22 -05:00
Mark Backman	ca0ec16373	Merge pull request #3889 from ai-coustics/goedev/aic-voice-focus-and-memoryview-fix AIC Voice Focus version update & concurrency safety issue on audio buffer.	2026-03-03 09:28:13 -05:00
filipi87	fc905a7ef5	Merge branch 'main' into filipi/deepgram # Conflicts: # src/pipecat/services/deepgram/stt_sagemaker.py	2026-03-03 10:54:30 -03:00
Mark Backman	aad1211a57	Merge pull request #3885 from pipecat-ai/mb/latency-breakdown Add latency breakdown to UserBotLatencyObserver	2026-03-02 19:27:35 -05:00
Mark Backman	7dbb130666	Add chronological_events utility function to display UserBotLatencyObserver report	2026-03-02 19:23:42 -05:00
Aleix Conchillo Flaqué	44466cfa07	Merge pull request #3896 from pipecat-ai/aleix/broadcast-interruption Add broadcast_interruption() to FrameProcessor	2026-03-02 13:36:39 -08:00
Aleix Conchillo Flaqué	4a61d5bfad	Add broadcast_interruption() to FrameProcessor Replace the round-trip push_interruption_task_frame_and_wait() mechanism with broadcast_interruption(), which pushes an InterruptionFrame both upstream and downstream directly from the calling processor. This eliminates race conditions (transcription arriving before the InterruptionFrame comes back), swallowed-event timeouts (frame blocked before reaching the sink), and the complexity of _wait_for_interruption flag / queue bypass / frame.complete() obligations. - Add broadcast_interruption() to FrameProcessor - Deprecate push_interruption_task_frame_and_wait() (delegates to new method) - Remove event field and complete() from InterruptionFrame/InterruptionTaskFrame - Remove _wait_for_interruption flag and all special-case logic - Remove frame.complete() calls in stt_mute_filter and llm_response_universal - Update all 17 call sites to use broadcast_interruption() - Update tests	2026-03-02 13:26:45 -08:00
Mark Backman	ff5b985009	Convert observer data models to Pydantic BaseModel with timestamps Enables .model_dump() serialization for Pipecat Cloud collection. All metrics now include start_time (Unix timestamp) for timeline plotting alongside duration_secs.	2026-03-02 16:11:43 -05:00
Mark Backman	a738a4d82b	Add function call latency tracking to LatencyBreakdown	2026-03-02 16:11:43 -05:00
Mark Backman	ddba1b84a9	Add first-bot-speech latency to UserBotLatencyObserver Measure time from ClientConnectedFrame to first BotStartedSpeakingFrame, emitting a one-time on_first_bot_speech_latency event with breakdown.	2026-03-02 16:11:43 -05:00
Mark Backman	18155b6a63	Add latency breakdown to UserBotLatencyObserver Add per-service latency breakdown metrics alongside existing user-to-bot latency measurement. When enable_metrics=True, the observer now emits an on_latency_breakdown event with TTFB, text aggregation, and user turn duration metrics collected between VADUserStoppedSpeakingFrame and BotStartedSpeakingFrame. - Add LatencyBreakdown dataclass with ttfb, text_aggregation, user_turn_secs fields - Accumulate MetricsFrame data during user→bot cycles - Reset accumulators on InterruptionFrame to discard stale metrics - Measure user_turn_secs from actual user silence (VAD timestamp - stop_secs) to turn release (UserStoppedSpeakingFrame) - Filter zero-value TTFB entries from startup metric resets - Add frame deduplication using bounded deque + set pattern - Update example 29 with latency breakdown display	2026-03-02 16:11:43 -05:00
Mark Backman	bbbfdfd321	Replace per-processor start_time with start_offset_secs Use start_offset_secs (offset from StartFrame) on ProcessorStartupTiming instead of a wall-clock timestamp. Reports keep a single start_time anchor for dashboard visualization. Remove _mono_to_wall conversion.	2026-03-02 14:07:34 -05:00
Mark Backman	75669b12a2	Convert observer data models to Pydantic BaseModel with timestamps Switch ProcessorStartupTiming, StartupTimingReport, and TransportTimingReport from dataclasses to Pydantic BaseModel. Add start_time (Unix timestamp) fields and wall clock conversion for monotonic observer timestamps.	2026-03-02 13:10:09 -05:00
Mark Backman	68e8732e72	Add BotConnectedFrame and on_transport_timing_report event Add BotConnectedFrame (SystemFrame) pushed by SFU transports (Daily, LiveKit, HeyGen, Tavus) when the bot joins the room. Replace the on_transport_readiness_measured event with on_transport_timing_report which includes both bot_connected_secs and client_connected_secs.	2026-03-02 13:10:09 -05:00
Mark Backman	0836066898	Add ClientConnectedFrame and transport readiness timing Introduce ClientConnectedFrame (SystemFrame) pushed by all transports when a client connects. StartupTimingObserver uses this to measure transport readiness — the time from StartFrame to first client connection — via a new on_transport_readiness_measured event.	2026-03-02 13:10:09 -05:00
Mark Backman	c54232bdb4	Add StartupTimingObserver for measuring processor start() times Tracks how long each processor start method takes during pipeline startup by measuring StartFrame arrive/leave deltas. Emits a timing report via the on_startup_timing_report event and auto-logs a summary. Internal pipeline processors are excluded from reports by default.	2026-03-02 10:48:50 -05:00
filipi87	8b09f7bbb4	Upgrading Deepgram to version 6.	2026-03-02 11:22:33 -03:00
Gökmen Görgen	16c676a921	add a test for reproducing the user feedback first.	2026-03-02 10:34:50 +01:00
Mark Backman	91c46ffbf4	Re-inject turn completion instructions after LLM context reset When filter_incomplete_user_turns is enabled and an LLMMessagesUpdateFrame replaces the context via set_messages(), the turn completion instructions system message was lost. This caused the LLM to stop emitting turn completion markers. Re-inject the instructions after set_messages() to fix this.	2026-03-01 16:37:07 -05:00
Aleix Conchillo Flaqué	f37fd39cdb	Add optional direction parameter to PipelineTask.queue_frame() and queue_frames() Allow pushing frames upstream through the pipeline by passing FrameDirection.UPSTREAM. Downstream frames use the existing push queue, while upstream frames are pushed directly from the pipeline sink.	2026-02-28 17:28:44 -08:00

1 2 3 4 5 ...

369 Commits