pipecat

Author	SHA1	Message	Date
Aleix Conchillo Flaqué	b78cecf7b2	Rename UserTurnCompletedFrame to UserTurnInferenceCompletedFrame The old name overlapped semantically with `UserStoppedSpeakingFrame`: both could be read as "the user's turn is done." They're at different layers — `UserStoppedSpeakingFrame` is the acoustic stop signal, while this frame is the post-judgment "inference about the turn is now complete (turn is semantically final)" signal emitted by the LLM mixin (on ✓), an end-of-turn classifier, or a custom producer. The new name pairs naturally with the existing `on_user_turn_inference_triggered` event vocabulary and removes the ambiguity with `UserStoppedSpeakingFrame`.	2026-05-07 17:47:41 -07:00
Aleix Conchillo Flaqué	952dddca8b	Replace llm_completion_user_turn_stop_strategies() with FilterIncompleteUserTurnStrategies Wrap the detector chain with `deferred(...)` and append the LLM completion gate via a `UserTurnStrategies` specialization rather than a free-standing helper, mirroring the existing `ExternalUserTurnStrategies` pattern. The class lives next to other strategy containers in `pipecat.turns.user_turn_strategies`, so users discover it where they're already configuring `user_turn_strategies`. The deprecated `filter_incomplete_user_turns` flag now rewires through `FilterIncompleteUserTurnStrategies` under the hood, keeping the migration path identical to before. `deferred(...)` stays public as the explicit escape hatch for non-default compositions.	2026-05-07 17:47:39 -07:00
Aleix Conchillo Flaqué	e3e90d38aa	Preserve full user transcript across multiple inferences in one turn When a stop-strategy chain splits inference-triggered from finalization (e.g. `LLMTurnCompletionUserTurnStopStrategy` gating a deferred detector), more than one inference can fire inside a single user turn — each adds the new transcription segment to the context. Previously each inference overwrote `_pending_user_turn_aggregation`, so the eventual `on_user_turn_stopped` event surfaced only the segment from the last inference, dropping anything the user said before it. Concatenate each segment into `_full_user_turn_aggregation` instead of overwriting, and combine that running buffer with any post-final- inference segment when emitting the public event.	2026-05-07 17:46:15 -07:00
Aleix Conchillo Flaqué	480eca42f5	Split user-turn-stop into inference-triggered and finalized events Fixes a real bug: with `filter_incomplete_user_turns` enabled, the smart-turn detector's tentative stop was firing `on_user_turn_stopped` before the LLM had a chance to veto it. Observers, transcript appenders and UI indicators received an early — and sometimes duplicated — signal. Decomposes the single stop concern into two events: - `on_user_turn_inference_triggered` fires when a stop strategy has enough signal to start LLM inference. The aggregator pushes the context here, kicking off the LLM call. - `on_user_turn_stopped` fires only when the user turn is semantically final. Built-in strategies fire both events at the same call site, preserving today's behavior for the common case. Adds `LLMTurnCompletionUserTurnStopStrategy`, which gates finalization on a `UserTurnCompletedFrame` (a fieldless system frame emitted by any component judging turn completeness — currently the `UserTurnCompletionLLMServiceMixin` on `✓`). Adds `deferred(strategy)` / `DeferredUserTurnStopStrategy`, a thin wrapper that forwards an inner strategy's events except `on_user_turn_stopped`. Use this to install a stop strategy as an inference trigger only, leaving finalization to a peer (e.g. the LLM completion strategy). Adds `llm_completion_user_turn_stop_strategies()` for the common case: UserTurnStrategies( stop=llm_completion_user_turn_stop_strategies(), ) Deprecates `LLMUserAggregatorParams.filter_incomplete_user_turns`. The aggregator emits a `DeprecationWarning`, wraps existing stop strategies with `deferred(...)`, and appends `LLMTurnCompletionUserTurnStopStrategy` automatically.	2026-05-07 17:46:09 -07:00
kompfner	991ee9e0e6	Merge pull request #4404 from pipecat-ai/pk/mitigate-calls-to-missing-tools Mitigate tool-call-related hallucination	2026-05-07 15:05:13 -04:00
Paul Kompfner	e06e0c0282	Mitigate tool-call-related hallucination When tools change mid-conversation, LLMs can produce a few different flavors of tool-call-related hallucination: calling tools that have been removed, avoiding tools that have been re-added, or hallucinating output (made-up answers or tool-call-shaped non-tool-calls) when tools are unavailable. This change introduces an opt-in ``add_tool_change_messages`` flag on the LLM aggregators (preferred entry point: ``LLMContextAggregatorPair( ..., add_tool_change_messages=True)``) that appends a developer-role message to the context whenever ``LLMSetToolsFrame`` changes the set of advertised standard tools. Helps the LLM stay coherent across tool changes by spelling out exactly what just became available or unavailable. Both aggregators participate; whichever handles the frame first wins, and the other (if any) sees an empty diff against the shared context and stays silent — order-independent regardless of whether the frame flows downstream or upstream. Also tightens the existing missing-handler path (introduced in #4301): - Reworded the terminal tool result to a neutral "The function ``X`` is not currently available." (overridable via ``LLMService.MISSING_FUNCTION_CALL_MESSAGE_TEMPLATE``). Previously read "Error: function 'X' is not registered." - Logs at the call site now distinguish developer error (tool advertised but no handler registered → ``logger.error``) from hallucination (tool not advertised → ``logger.warning``). Includes a manual validation harness (``examples/features/features-add-tool-change-messages.py``) that exercises the new ``add_tool_change_messages`` mitigation by flipping tool availability on a turn counter so its effect can be observed end-to-end with the flag on vs. off.	2026-05-05 13:02:43 -04:00
Mark Backman	f1a3ee97de	fix: surface TTSSpeakFrame greetings in on_assistant_turn_stopped Two issues were causing TTSSpeakFrame(append_to_context=True) greetings to silently lose their trailing words and never fire on_assistant_turn_stopped: - LLMAssistantPushAggregationFrame was emitted without a PTS, so the transport routed it through the audio (sync) queue while word-level TTSTextFrames travel through the clock queue. The aggregation could reach the assistant aggregator before the final words, leaving them orphaned in the buffer. Stamp the frame with `_word_last_pts + 1` when there are word timestamps so it can't overtake them. - The aggregator's LLMAssistantPushAggregationFrame handler called push_aggregation() directly, bypassing _trigger_assistant_turn_stopped. For TTS-only flows there is no LLMFullResponseStartFrame, so the turn start timestamp was never set and on_assistant_turn_stopped never fired. Open a turn (if needed) and trigger stopped from the handler. Fixes #4264.	2026-05-04 10:41:22 -04:00
Aleix Conchillo Flaqué	698c2ba92e	Fix on_assistant_turn_stopped not firing for empty LLM responses When the LLM returned zero text tokens (e.g. it was interrupted before producing tokens or about to push tokens), push_aggregation() returned an empty string and on_assistant_turn_stopped was never emitted. This left consumers waiting for an event that would never arrive. Now on_assistant_turn_stopped always fires, with an empty content string when the LLM produced no text tokens. Fixes #4292	2026-04-14 10:07:19 -07:00
Filipi da Silva Fuchter	6eccd16543	Merge pull request #4217 from pipecat-ai/filipi/async_tools Supporting async function calls.	2026-04-07 09:35:03 -03:00
Paul Kompfner	70469e3c0c	Assert no LLMContextFrame when run_llm is not set in message frame tests	2026-04-03 11:34:58 -04:00
Paul Kompfner	6111df947e	Test LLMAssistantAggregator handling of upstream message frames Add tests for LLMRunFrame, LLMMessagesAppendFrame, LLMMessagesUpdateFrame, and LLMMessagesTransformFrame sent upstream to LLMAssistantAggregator, mirroring the existing LLMUserAggregator downstream tests. Add frames_to_send_direction param to run_test helper to support this.	2026-04-03 11:34:58 -04:00
Paul Kompfner	4eebfd65d9	Add a `LLMMessagesTransformFrame` to facilitate programmatically editing context in a frame-based way. The previous approach required the caller to directly grab a reference to the context object, grab a "snapshot" of its messages at that point in time, transform the messages, and then push an `LLMMessagesUpdateFrame` with the transformed messages. This approach can lead to problems: what if there had already been a change to the context queued in the pipeline? The transformed messages would simply overwrite it without consideration.	2026-04-03 11:34:50 -04:00
filipi87	929a0e33f4	Fixing the automated tests.	2026-04-02 16:58:28 -03:00
Aleix Conchillo Flaqué	976c644f90	Fix tests to expect SpeechControlParamsFrame from default turn strategy	2026-04-02 12:42:06 -07:00
Paul Kompfner	394599d031	Remove deprecated `OpenAILLMContext` as well as everything (code paths or whole types) dependent on it (all of which were also deprecated)	2026-03-31 18:15:25 -04:00
Mark Backman	efda57de5c	Move turn completion instructions to system_instruction Turn completion instructions were being injected as a system message in the LLM context, which caused warning spam when system_instruction was also set, did not persist across full context updates, and broke LLMs that do not support consecutive system messages. Instead, compose the turn completion instructions into the LLM service system_instruction field. This is managed via _base_system_instruction which stores the original value for restoration when turn completion is disabled.	2026-03-08 10:41:40 -04:00
Mark Backman	91c46ffbf4	Re-inject turn completion instructions after LLM context reset When filter_incomplete_user_turns is enabled and an LLMMessagesUpdateFrame replaces the context via set_messages(), the turn completion instructions system message was lost. This caused the LLM to stop emitting turn completion markers. Re-inject the instructions after set_messages() to fix this.	2026-03-01 16:37:07 -05:00
Mark Backman	69d916ca51	Consume InterimTranscriptionFrame and TranslationFrame in LLMUserAggregator These frames were falling through to the else branch and being pushed downstream, unlike TranscriptionFrame which is explicitly consumed. This aligns with how the assistant aggregator already filters them.	2026-02-24 20:51:41 -05:00
Luke Payyapilli	3adb2f50a6	Fix LLMUserAggregator broadcasting mute events before StartFrame	2026-02-13 11:59:56 -05:00
Mark Backman	34b068d657	Improve user turn stop timing by triggering timeout from VAD stop Refactor TranscriptionUserTurnStopStrategy and TurnAnalyzerUserTurnStopStrategy to use VADUserStoppedSpeakingFrame as the ground truth for when speech ended, rather than triggering timeouts from transcription frames.	2026-02-09 14:12:33 -05:00
Mark Backman	63a23246d5	Add UserTurnCompletionLLMServiceMixin (#3518 ) * Added UserTurnCompletionLLMServiceMixin class * Added 22-filter-incomplete-turns.py foundational example * Removed old 22 natural conversation foundational examples * Added test_user_turn_completion_mixin.py	2026-01-30 14:57:15 -05:00
Aleix Conchillo Flaqué	305ab44132	tests: add unittest.main() call	2026-01-30 10:07:34 -08:00
Mark Backman	e80e0eab29	Emit on_assistant_turn_stopped and on_user_turn_stopped from EndFrame or CancelFrame	2026-01-27 14:50:10 -05:00
Aleix Conchillo Flaqué	c7ab87b0cc	turns: move mute to user_mute	2026-01-16 11:07:20 -08:00
Aleix Conchillo Flaqué	24a52375c7	tests: added LLMAssistantAggregator unit tests	2026-01-09 09:50:21 -08:00
Aleix Conchillo Flaqué	4b61fd2d7d	LLMUserAggregator: add user turn stopped message argument It is now possible to get the user aggregation when a `on_user_turn_stopped` event is emitted.	2026-01-09 09:42:41 -08:00
Aleix Conchillo Flaqué	b0185e3539	tests: improve LLMUserAggregator tests	2026-01-09 09:21:28 -08:00
Aleix Conchillo Flaqué	2626154a64	update examples and tests copyright and use a proper dash in 2024-2026	2026-01-07 19:32:22 -08:00
Aleix Conchillo Flaqué	846ca500d3	turns: update old turn_start_strategies deprecations	2025-12-30 19:50:10 -08:00
Aleix Conchillo Flaqué	eb5a797b12	turns: rename bot turn start to user turn stop strategies	2025-12-30 14:33:58 -08:00
Aleix Conchillo Flaqué	5496aa722f	turns: simplify imports and don't require full strategy module path	2025-12-28 16:20:15 -08:00
Aleix Conchillo Flaqué	8b861d9143	LLMUserAggregator: move turn_start_strategies from PipelineTask	2025-12-28 08:16:34 -08:00
Aleix Conchillo Flaqué	43fc26cf0e	tests: add user mute strategies tests to user aggregator	2025-12-27 13:49:31 -08:00
Aleix Conchillo Flaqué	ffb5895404	tests: add initial tests for universal LLMUserAggregator	2025-12-23 15:51:06 -08:00

34 Commits