pipecat

Author	SHA1	Message	Date
Mark Backman	b358657a79	Make max_context_tokens and max_unsummarized_messages independently optional Allow either threshold to be set to None to cleanly disable that trigger, instead of requiring users to set a very large number as a workaround. At least one of the two must remain set (validated at construction time).	2026-03-03 20:08:22 -05:00
Mark Backman	ca0ec16373	Merge pull request #3889 from ai-coustics/goedev/aic-voice-focus-and-memoryview-fix AIC Voice Focus version update & concurrency safety issue on audio buffer.	2026-03-03 09:28:13 -05:00
filipi87	fc905a7ef5	Merge branch 'main' into filipi/deepgram # Conflicts: # src/pipecat/services/deepgram/stt_sagemaker.py	2026-03-03 10:54:30 -03:00
Mark Backman	aad1211a57	Merge pull request #3885 from pipecat-ai/mb/latency-breakdown Add latency breakdown to UserBotLatencyObserver	2026-03-02 19:27:35 -05:00
Mark Backman	7dbb130666	Add chronological_events utility function to display UserBotLatencyObserver report	2026-03-02 19:23:42 -05:00
Aleix Conchillo Flaqué	44466cfa07	Merge pull request #3896 from pipecat-ai/aleix/broadcast-interruption Add broadcast_interruption() to FrameProcessor	2026-03-02 13:36:39 -08:00
Aleix Conchillo Flaqué	4a61d5bfad	Add broadcast_interruption() to FrameProcessor Replace the round-trip push_interruption_task_frame_and_wait() mechanism with broadcast_interruption(), which pushes an InterruptionFrame both upstream and downstream directly from the calling processor. This eliminates race conditions (transcription arriving before the InterruptionFrame comes back), swallowed-event timeouts (frame blocked before reaching the sink), and the complexity of _wait_for_interruption flag / queue bypass / frame.complete() obligations. - Add broadcast_interruption() to FrameProcessor - Deprecate push_interruption_task_frame_and_wait() (delegates to new method) - Remove event field and complete() from InterruptionFrame/InterruptionTaskFrame - Remove _wait_for_interruption flag and all special-case logic - Remove frame.complete() calls in stt_mute_filter and llm_response_universal - Update all 17 call sites to use broadcast_interruption() - Update tests	2026-03-02 13:26:45 -08:00
Mark Backman	ff5b985009	Convert observer data models to Pydantic BaseModel with timestamps Enables .model_dump() serialization for Pipecat Cloud collection. All metrics now include start_time (Unix timestamp) for timeline plotting alongside duration_secs.	2026-03-02 16:11:43 -05:00
Mark Backman	a738a4d82b	Add function call latency tracking to LatencyBreakdown	2026-03-02 16:11:43 -05:00
Mark Backman	ddba1b84a9	Add first-bot-speech latency to UserBotLatencyObserver Measure time from ClientConnectedFrame to first BotStartedSpeakingFrame, emitting a one-time on_first_bot_speech_latency event with breakdown.	2026-03-02 16:11:43 -05:00
Mark Backman	18155b6a63	Add latency breakdown to UserBotLatencyObserver Add per-service latency breakdown metrics alongside existing user-to-bot latency measurement. When enable_metrics=True, the observer now emits an on_latency_breakdown event with TTFB, text aggregation, and user turn duration metrics collected between VADUserStoppedSpeakingFrame and BotStartedSpeakingFrame. - Add LatencyBreakdown dataclass with ttfb, text_aggregation, user_turn_secs fields - Accumulate MetricsFrame data during user→bot cycles - Reset accumulators on InterruptionFrame to discard stale metrics - Measure user_turn_secs from actual user silence (VAD timestamp - stop_secs) to turn release (UserStoppedSpeakingFrame) - Filter zero-value TTFB entries from startup metric resets - Add frame deduplication using bounded deque + set pattern - Update example 29 with latency breakdown display	2026-03-02 16:11:43 -05:00
Mark Backman	bbbfdfd321	Replace per-processor start_time with start_offset_secs Use start_offset_secs (offset from StartFrame) on ProcessorStartupTiming instead of a wall-clock timestamp. Reports keep a single start_time anchor for dashboard visualization. Remove _mono_to_wall conversion.	2026-03-02 14:07:34 -05:00
Mark Backman	75669b12a2	Convert observer data models to Pydantic BaseModel with timestamps Switch ProcessorStartupTiming, StartupTimingReport, and TransportTimingReport from dataclasses to Pydantic BaseModel. Add start_time (Unix timestamp) fields and wall clock conversion for monotonic observer timestamps.	2026-03-02 13:10:09 -05:00
Mark Backman	68e8732e72	Add BotConnectedFrame and on_transport_timing_report event Add BotConnectedFrame (SystemFrame) pushed by SFU transports (Daily, LiveKit, HeyGen, Tavus) when the bot joins the room. Replace the on_transport_readiness_measured event with on_transport_timing_report which includes both bot_connected_secs and client_connected_secs.	2026-03-02 13:10:09 -05:00
Mark Backman	0836066898	Add ClientConnectedFrame and transport readiness timing Introduce ClientConnectedFrame (SystemFrame) pushed by all transports when a client connects. StartupTimingObserver uses this to measure transport readiness — the time from StartFrame to first client connection — via a new on_transport_readiness_measured event.	2026-03-02 13:10:09 -05:00
Mark Backman	c54232bdb4	Add StartupTimingObserver for measuring processor start() times Tracks how long each processor start method takes during pipeline startup by measuring StartFrame arrive/leave deltas. Emits a timing report via the on_startup_timing_report event and auto-logs a summary. Internal pipeline processors are excluded from reports by default.	2026-03-02 10:48:50 -05:00
filipi87	8b09f7bbb4	Upgrading Deepgram to version 6.	2026-03-02 11:22:33 -03:00
Gökmen Görgen	16c676a921	add a test for reproducing the user feedback first.	2026-03-02 10:34:50 +01:00
Mark Backman	91c46ffbf4	Re-inject turn completion instructions after LLM context reset When filter_incomplete_user_turns is enabled and an LLMMessagesUpdateFrame replaces the context via set_messages(), the turn completion instructions system message was lost. This caused the LLM to stop emitting turn completion markers. Re-inject the instructions after set_messages() to fix this.	2026-03-01 16:37:07 -05:00
Aleix Conchillo Flaqué	f37fd39cdb	Add optional direction parameter to PipelineTask.queue_frame() and queue_frames() Allow pushing frames upstream through the pipeline by passing FrameDirection.UPSTREAM. Downstream frames use the existing push queue, while upstream frames are pushed directly from the pipeline sink.	2026-02-28 17:28:44 -08:00
filipi87	d077a810ae	Fixing context summarization tests	2026-02-27 18:42:50 -03:00
Mark Backman	82c249608f	Move dedicated LLM summarization into LLMContextSummarizer The dedicated LLM logic lived in LLMAssistantAggregator, creating two code paths and requiring the aggregator to call a private LLMService method. Move it into the summarizer which already owns the config and summarization lifecycle, keeping the aggregator handler as a single-line upstream push.	2026-02-27 12:09:00 -05:00
Mark Backman	98e737b4e9	Add tests for context summarization improvements Cover summary message role, template, on_summary_applied event, summarization timeout, and dedicated LLM routing/error handling.	2026-02-27 12:08:43 -05:00
Filipi da Silva Fuchter	db40a354be	Merge pull request #3794 from omChauhanDev/fix/context-summarization-llm-specific-message skipping provider-specific messages during summarization	2026-02-27 10:57:34 -05:00
filipi87	3b427a47b6	Fixing Piper test.	2026-02-27 11:57:11 -03:00
kompfner	7fe458fe59	Merge pull request #3817 from pipecat-ai/pk/service-settings-fix-back-compat-for-nested-external-sdk-types Flatten `LiveOptions` into individual fields on `DeepgramSTTSettings`…	2026-02-26 11:08:27 -05:00
Paul Kompfner	faed775d90	Extract `_DeepgramSTTSettingsBase` with shared `_merge_live_options_delta` to deduplicate LiveOptions merge logic between `__init__` and `apply_update`, and between the Deepgram STT and SageMaker variants; make top-level model/language take precedence over conflicting live_options values in updates; remove unnecessary Language enum-to-string conversion (Language is a StrEnum)	2026-02-26 11:02:44 -05:00
Mark Backman	d69a337def	Add text_aggregation_mode parameter to TTSService Move the sentence vs token aggregation concern into text aggregators so all text flows through them regardless of mode. This enables pattern detection and tag handling to work in TOKEN mode. - Add TextAggregationMode enum (SENTENCE, TOKEN) as the user-facing TTS setting, separate from the internal AggregationType - Add TOKEN mode support to Simple, SkipTags, and PatternPair aggregators - Add text_aggregation_mode parameter to TTSService and all TTS subclasses - Deprecate aggregate_sentences in favor of text_aggregation_mode - Merge TTSService._process_text_frame() into a single codepath	2026-02-26 08:55:41 -05:00
Paul Kompfner	8b6aa4b912	Unflatten `LiveOptions` back into a single `live_options` field on `DeepgramSTTSettings` and `DeepgramSageMakerSTTSettings`; add `apply_update` override with delta-merge semantics and `from_mapping` override for backward-compatible dict-style updates	2026-02-25 18:25:11 -05:00
Mark Backman	69d916ca51	Consume InterimTranscriptionFrame and TranslationFrame in LLMUserAggregator These frames were falling through to the else branch and being pushed downstream, unlike TranscriptionFrame which is explicitly consumed. This aligns with how the assistant aggregator already filters them.	2026-02-24 20:51:41 -05:00
kompfner	03cb0054f9	Merge branch 'main' into pk/service-settings-refactor	2026-02-23 11:46:03 -05:00
Om Chauhan	9476b5d184	added changelog	2026-02-21 17:35:08 +05:30
Om Chauhan	f49658de15	skipping provider-specific messages during summarization	2026-02-21 17:19:50 +05:30
Aleix Conchillo Flaqué	827032fefb	Unblock push_interruption_task_frame_and_wait after timeout When the InterruptionFrame does not complete within the timeout the caller was stuck in an infinite loop logging warnings. Now the event is set after the first timeout so the processor can continue. Also adds a keyword timeout parameter so callers can customize the wait duration.	2026-02-20 14:56:42 -08:00
Aleix Conchillo Flaqué	474b27305f	Merge pull request #3748 from pipecat-ai/mb/user-idle-configurable Make UserIdleController always-on with dynamic timeout updates	2026-02-19 11:44:51 -08:00
Aleix Conchillo Flaqué	20509e8f96	Merge pull request #3744 from pipecat-ai/mb/user-idle-timeout-frame Redesign UserIdleController to use BotStoppedSpeakingFrame	2026-02-19 11:34:42 -08:00
Paul Kompfner	94a651cee2	Remove dead `ServiceSettings.to_dict` method	2026-02-17 15:15:18 -05:00
Luke Payyapilli	247f0bbcd3	Fix async generator cleanup to prevent uvloop crash on Python 3.12+	2026-02-17 13:10:31 -05:00
Paul Kompfner	3b1ba57452	Change `apply_update` / `_update_settings` return type from `set[str]` to `dict[str, Any]`. The dict maps each changed field name to its pre-update value, enabling services to do granular diffing of complex settings objects. Existing call-site patterns (`"field" in changed`, `if changed`, iteration) work unchanged; set-difference sites use `changed.keys() - {...}`.	2026-02-17 11:49:15 -05:00
Mark Backman	dba4de77bf	Merge pull request #3684 from ai-coustics/goedev/aic-model-caching AIC model caching	2026-02-16 10:43:14 -05:00
Mark Backman	507765625f	Make UserIdleController always-on with dynamic timeout updates Always create UserIdleController (timeout=0 means disabled), removing all Optional guards. Add UserIdleTimeoutUpdateFrame to allow changing the idle timeout at runtime.	2026-02-14 09:54:30 -05:00
Mark Backman	012ef41ff4	Redesign UserIdleController to use BotStoppedSpeakingFrame Replace the continuous heartbeat-based timer (UserSpeakingFrame/BotSpeakingFrame + asyncio.Event loop) with a simple one-shot timer that starts when BotStoppedSpeakingFrame is received and cancels on UserStartedSpeakingFrame or BotStartedSpeakingFrame. This eliminates false idle triggers caused by gaps between the user finishing speaking and the bot starting to speak (LLM/TTS latency). Guard the timer start with two conditions to prevent false triggers: - User turn in progress: during interruptions, BotStoppedSpeaking arrives while the user is still speaking mid-turn. - Function calls in progress: FunctionCallsStarted arrives before BotStoppedSpeaking because the bot speaks concurrently with the function call starting, so the timer must wait for the result and subsequent bot response.	2026-02-14 08:55:56 -05:00
Paul Kompfner	8a4ab611be	Broad service settings refactor, with the primary aim of making service settings discoverable and strongly-typed. Service settings can be updated at runtime with `UpdateSettingsFrame`s. Does not (yet) touch `InputParams`, to avoid scope creep and touching something currently part of the public API. But there is a lot of overlap between `Settings` object fields and `InputParams` fields. Other than discoverability/typing, these are some other improvements brought by this refactor: - There is now a single code path (see `_update_settings_from_typed`) where services can respond to settings changes (by, say, reconnecting if needed), improving maintainability and guaranteeing one and only one reconnection no matter which settings changed - `set_language`/`set_model`/`set_voice`—which we're assuming are usable as public methods, though not recommended over `UpdateSettingsFrame`—all use the same code path as settings updates. They're also now all consistent in that, if a service needs to respond to a change (by, say, reconnecting if needed), any of these methods will kick off that process. Note that this is technically a behavior change. - Several services now properly react to changed settings by reconnecting: - `AWSTranscribeSTTService` - `AzureSTTService` - `SonioxSTTService` - `GladiaSTTService` - `SpeechmaticsSTTService` - `AssemblyAISTTService` - `CartesiaSTTService` - `FishAudioTTSService` (would previously only reconnect when `model` changed) - `GoogleSTTService` - `SpeechmaticsSTTService` (which previously only handled some* settings updates through a nonstandard public `update_params` method) - `GradiumSTTService` - `NvidiaSegmentedSTTService` (which previously only handled changes to language) - Bookkeeping across various services has been reduced, mostly by deduping ivars; the `self._settings` ivar is treated as the source of truth NOTE: I pretty much guarantee that there are services missed in this PR in terms of bringing to consistency with how updates are handled (like whether changes in certain fields trigger reconnects when they need to). We can squash remaining inconsistencies as we stumble onto them, service by service. The goal here is to get things mostly in order, and establish the infrastructure and patterns we'll need going forward.	2026-02-13 15:12:26 -05:00
Luke Payyapilli	3adb2f50a6	Fix LLMUserAggregator broadcasting mute events before StartFrame	2026-02-13 11:59:56 -05:00
Mark Backman	71a752c971	Add tests for TracingContext and TurnTraceObserver Cover pipeline-scoped tracing context lifecycle, span hierarchy, conversation/turn context management, and concurrent pipeline isolation.	2026-02-11 23:27:35 -05:00
Gökmen Görgen	2036757b84	add unit tests for `AICModelManager` and `AICFilter` error handling, model loading, and processor behavior	2026-02-11 15:22:37 +01:00
Aleix Conchillo Flaqué	93f4402198	Update stream close test to match new _closing helper	2026-02-10 18:19:57 -08:00
filipi87	4a00e6829f	Automated tests for the context summarizer.	2026-02-10 18:58:44 -03:00
filipi87	9d89afa7d4	Automated tests for the context summarization feature.	2026-02-10 18:58:33 -03:00
Mark Backman	981253c703	Rename RequestMetadataFrame to ServiceSwitcherRequestMetadataFrame with service targeting Add a `service` field so the frame targets a specific service, allowing ServiceSwitcher.push_frame to consume it only when the targeted service matches the active service. STTService and test mocks now push the frame downstream after handling instead of silently consuming it.	2026-02-09 16:48:34 -05:00

1 2 3 4 5 ...

339 Commits