pipecat

Author	SHA1	Message	Date
Paul Kompfner	ef226c8a8e	fix: silence _settings NotGiven leaks and tighten Google STT language method Six pyright errors followed the same pattern: a value flowed out of `self._settings.X` (typed `T \| _NotGiven`) into a context that wanted the plain `T`. Wrap each with `assert_given(...)` so the sentinel gets stripped at the boundary: - aws/nova_sonic/llm.py: `_settings.model` (in InvokeModel...Input) and `_settings.system_instruction` (passed to the adapter). - deepgram/flux/base.py: iterating `_settings.keyterm`. - google/stt.py: iterating `_settings.languages`. - google/tts.py: iterating `_settings.speaker_configs`. - openai/base_llm.py: `_settings.system_instruction` passed to the adapter. Also takes a deeper pass at the related Google STT issue: the override of `language_to_service_language` had been broadened to take `Language \| list[Language]` and return `str \| list[str]`, a Liskov violation against the base's `Language -> str \| None` contract. External callers always pass a single Language, and the only consumer of the list path was Google STT's own `_get_language_codes`. Restore the override to a single-Language signature and let `_get_language_codes` iterate. The override is also tightened to return `str` (narrower than the base's `str \| None`, which is LSP-compatible) since it always falls back to `"en-US"` rather than returning None. Net: -7 pyright errors (full-config run: 782 -> 775).	2026-05-01 09:36:14 -04:00
Paul Kompfner	2a731336be	fix: tighten language_to_<service>_language return types to plain str These provider-specific helpers are all thin wrappers around `resolve_language(...)`, which itself returns `str` — never `None`. The `str \| None` annotations were misleading and were producing spurious pyright errors at the call sites that assigned the result into a `str` field. Update each helper's signature to `str` and rewrite the `Returns:` docstring to describe the actual fallback behaviour (resolve to base or full code, with a warning). Importantly, the per-class `language_to_service_language(...)` methods on `STTService`/`TTSService` subclasses keep `str \| None` as their return type. That signature is an extension hook for future and/or third-party subclasses that may genuinely not be able to produce a code for some languages, even though all in-tree first- party services currently return a string. Also includes one small unrelated tightening in azure/stt.py: wrap `self._settings.language` with `assert_given(...)` so the truthy fallback to `language_to_azure_language(Language.EN_US)` doesn't silently swallow a NotGiven sentinel. Net: -3 pyright errors (full-config run: 785 -> 782).	2026-05-01 09:36:14 -04:00
Paul Kompfner	bec407ce3a	fix: handle Optional websocket/client receivers across services Pyright flagged 19 sites where `await self._<connection>.send/recv/...` was called on a receiver typed `X \| None`. Each kind of call site needed a slightly different fix to be both type-safe and behaviour- preserving: Streaming/user-facing paths (early return + warn — drop and warn is the right runtime fail-safe when reconnect didn't succeed): - cartesia/stt.py (run_stt) - soniox/stt.py (_send_keepalive) - elevenlabs/tts.py (run_tts — yields ErrorFrame and returns) - deepgram/sagemaker/tts.py (run_tts) - transports/lemonslice/transport.py (send_message) - transports/tavus/transport.py (send_message) "Should never happen" cases (early return with comment, no warn — caller already gated on a separate `_is_*` check, so a warn would be noise): - deepgram/flux/stt.py (transport methods, gated by _transport_is_active) - deepgram/flux/sagemaker/stt.py (same) - stt_service.py (_send_keepalive, gated by _is_keepalive_ready) - elevenlabs/stt.py (_send_keepalive, same) - llm_service.py (_ws_recv — raises ConnectionError to match _ensure_connected's contract) - heygen/client.py (receive loop, gated by self._connected) Just-assigned-above (use a local variable so pyright keeps the narrowing across statements): - lmnt/tts.py - gradium/stt.py - fish/tts.py Other: - transports/websocket/server.py — used the existing local `websocket` parameter in scope instead of `self._websocket` for the close call. - websocket_service.py — `send_with_retry` raises ConnectionError when `self._websocket` is None inside the existing try-block, so the broad `except Exception` triggers reconnect just as it would on a real send failure (preserving the prior behaviour where None silently fell through to the AttributeError-driven reconnect path). Drops three now-clean files from the pyright ignore list: cartesia/stt.py, elevenlabs/stt.py, and soniox/stt.py.	2026-05-01 09:36:14 -04:00
Paul Kompfner	1cd73b1ef8	refactor: give TAdapter a default to restore precise typing for unparameterized LLMService subclasses After making LLMService generic, an unparameterized subclass (`class MyService(LLMService):` with no bracket — the third-party provider pattern) saw `get_llm_adapter()` return `Unknown` rather than `BaseLLMAdapter` as it did before the refactor. Add `default=BaseLLMAdapter` (PEP 696) on the TypeVar — via `typing_extensions.TypeVar` so older Python targets keep working — so unparameterized callers get `LLMService[BaseLLMAdapter]` and `get_llm_adapter()` returns `BaseLLMAdapter`, matching the pre-refactor type precision. Two internal fallouts of having a default (where the default makes unannotated `LLMService` resolve invariantly to `LLMService[BaseLLMAdapter]`): - `FunctionCallParams.llm` is now `LLMService[Any]` so concrete parameterizations like `LLMService[OpenAILLMAdapter]` can be passed where the field is set. - The explicit `LLMService.__init__(self, **kwargs)` in `WebsocketLLMService.__init__` gets a `pyright: ignore[reportArgumentType]` comment — pyright's invariance handling can't see through the multi-inheritance + generic + default combination, but the runtime call is correct (generics are erased).	2026-05-01 09:36:14 -04:00
Paul Kompfner	c4f5f1ebbb	test, refactor: follow-ups to LLMService generic refactor Two follow-ups now that LLMService is generic over its adapter: - Add an explicit backward-compat test verifying that an LLMService subclass with no generic parameter (the third-party-provider pattern) instantiates and returns a usable adapter. The existing MockLLMService (declared without brackets) already exercised this implicitly, but it's worth a named assertion. - Drop the now-redundant `params: SomeLLMInvocationParams = ...` variable annotations on `adapter.get_llm_invocation_params()` results. Since `get_llm_adapter()` now returns the precise adapter type, and `BaseLLMAdapter` is generic in its invocation-params type, the call already infers the right TypedDict.	2026-05-01 09:36:14 -04:00
Paul Kompfner	49068ff557	refactor: make LLMService generic over its adapter type Previously, `LLMService.get_llm_adapter()` returned `BaseLLMAdapter`, which forced every caller that wanted the precise adapter type to write `adapter: SomeAdapter = self.get_llm_adapter()` and accept pyright's complaint that the assignment doesn't match the declared type. That pattern existed in 17 places across the LLM services. Make `LLMService` generic over its adapter type — `LLMService(..., Generic[TAdapter])` with `TAdapter = TypeVar("TAdapter", bound=BaseLLMAdapter)` — so subclasses opt in via `LLMService[XAdapter]` and callers get the precise type back from `get_llm_adapter()` automatically. Backward-compatible for third-party providers: code that says `class MyService(LLMService):` (no bracket) still type-checks, with TAdapter resolving to BaseLLMAdapter from the bound — identical to the pre-refactor behavior. The `adapter_class` attribute keeps its loose `type[BaseLLMAdapter] = OpenAILLMAdapter` typing so the default remains usable; one localized cast in `__init__` bridges the loose class attr to the precise instance attr. In-tree subclasses opted in: - AnthropicLLMService -> LLMService[AnthropicLLMAdapter] - AWSBedrockLLMService -> LLMService[AWSBedrockLLMAdapter] - AWSNovaSonicLLMService -> LLMService[AWSNovaSonicLLMAdapter] - BaseOpenAILLMService -> LLMService[OpenAILLMAdapter] (propagates to ~15 OpenAI-compatible providers like Cerebras, Groq, Together) - GeminiLiveLLMService -> LLMService[GeminiLLMAdapter] - GoogleLLMService -> LLMService[GeminiLLMAdapter] - GrokRealtimeLLMService -> LLMService[GrokRealtimeLLMAdapter] - InworldRealtimeLLMService -> LLMService[InworldRealtimeLLMAdapter] - OpenAIRealtimeLLMService -> LLMService[OpenAIRealtimeLLMAdapter] - _BaseOpenAIResponsesLLMService -> LLMService[OpenAIResponsesLLMAdapter] - WebsocketLLMService is also generic so the multi-inheritance case (OpenAIResponsesLLMService) can keep both bases agreeing on TAdapter. All 17 redundant `adapter: SomeAdapter = self.get_llm_adapter()` annotations are now plain `adapter = self.get_llm_adapter()`.	2026-05-01 09:36:14 -04:00
Paul Kompfner	d23bdaaacd	fix: handle NotGiven from from_standard_tools in Nova Sonic connect Same pattern as the earlier get_setup_params fix: when context tools are absent, the fallback `adapter.from_standard_tools(self._tools)` can return the NotGiven sentinel, and `_send_prompt_start_event` expects a list. Coerce via `or []` so the NotGiven case becomes an empty list.	2026-05-01 09:36:14 -04:00
Paul Kompfner	53ce57b7fa	fix: tighten _process_completed_function_calls in AWS Nova Sonic Three small changes that resolve pyright errors and sharpen the logic: - Guard `self._context` with the codebase's "should never happen" early-return pattern, so we don't blindly call `.get_messages()` on None. - Skip `LLMSpecificMessage` items in the iteration. They're opaque provider-specific payloads with no `.get()`, and the surrounding logic only applies to standard tool-result messages. - Match `role == "tool"` explicitly. The previous truthy-only check was working by accident — the `tool_call_id` filter further down was effectively narrowing to tool messages, but the intent is clearer when stated upfront.	2026-05-01 09:36:14 -04:00
Paul Kompfner	dabca70744	fix: warn and bail in reset_conversation when no context exists reset_conversation is part of the public AWSNovaSonicLLMService API and is also called internally from the receive-task error handler. Previously it captured `self._context` (typed `LLMContext \| None`) and unconditionally passed it to `_handle_context`, which expects a real context — silently doing the wrong thing if no initial context had been received yet. Treat that as developer error: log a warning and return early. Nothing to preserve means nothing to reset.	2026-05-01 09:36:14 -04:00
Paul Kompfner	191bdc733f	fix: conform AWSNovaSonicLLMService.get_setup_params to its protocol The service implements the NovaSonicSessionSender protocol so the session-continuation helper can target either the current or next session. The protocol declares `get_setup_params(self) -> tuple[str \| None, list]`, but the implementation was unannotated and could return NotGiven in the tools position when from_standard_tools fell through to its NotGiven sentinel. Add the matching return annotation and coerce the NotGiven case to an empty list.	2026-05-01 09:36:14 -04:00
Paul Kompfner	5e1bb4cbe5	chore: remove anthropic_adapter.py from pyright ignore list The file is now clean under pyright's basic type checking, so it can move out of the ignore list and be enforced on every run.	2026-05-01 09:36:14 -04:00
Paul Kompfner	9ee123bf33	fix: resolve final pyright error in Anthropic cache control marker Same MessageParam content-typing issue as the consecutive-message merge fix: pyright doesn't carry the str-to-list narrowing forward, and Iterable has no `[-1]` access. Cast to `list[Any]` and document the chain of assumptions (list, non-empty, dict-typed last item) and where each is upheld upstream. This brings anthropic_adapter.py to 0 pyright errors (down from 115).	2026-05-01 09:36:14 -04:00
Paul Kompfner	66f43baf8f	fix: resolve pyright errors in Anthropic _from_standard_message The function takes an OpenAI ChatCompletionMessageParam (a union of TypedDicts) and returns an Anthropic MessageParam (a different TypedDict). It does the conversion via dict-level mutations that don't type-check against either side's TypedDict schema. Work with the deepcopied message as a plain dict and cast to MessageParam at the return sites — matching the boundary-cast convention noted in llm_context.py. Drops anthropic_adapter.py from 20 to 2 pyright errors.	2026-05-01 09:36:14 -04:00
Paul Kompfner	252bb493af	fix: cast Anthropic-format passthrough message to MessageParam The fallback path in `_from_universal_context_message` returns `message.message` from an `LLMSpecificMessage`, which is typed loosely (`Any \| dict`). The surrounding comment already documents the assumption that the message is already in Anthropic format — make that assumption explicit to pyright with a cast.	2026-05-01 09:36:14 -04:00
Paul Kompfner	c517b67bad	fix: resolve pyright error when merging consecutive Anthropic messages MessageParam types content as `str \| Iterable[...]`, and Iterable has no `.extend()`. After the str-to-list conversions, pyright re-reads the TypedDict field as the original wide type rather than carrying the narrowing forward. Cast to `list[Any]` to express the codebase's existing str-or-list assumption. Drops anthropic_adapter.py from 23 to 21 pyright errors.	2026-05-01 09:36:14 -04:00
Paul Kompfner	70aeb5c7c2	fix: resolve pyright errors in Anthropic get_messages_for_logging Content items in MessageParam have a heterogeneous union type (Pydantic ContentBlock variants and TypedDict *BlockParam variants), neither of which supports the dict-style access and mutation this sanitizer does. Treat the deepcopied message as a plain dict and guard each content item with isinstance(item, dict) — matches the runtime shape produced by _from_standard_message and avoids crashing if a non-dict ever flows through the LLMSpecificMessage path. Drops anthropic_adapter.py from 115 to 23 pyright errors.	2026-05-01 09:36:14 -04:00
Mark Backman	f1eef9ba0a	Merge pull request #4400 from pipecat-ai/mb/deepgram-tts-mip-opt-out feat(deepgram): add mip_opt_out to TTS services	2026-05-01 09:12:03 -04:00
Mark Backman	132b9b1002	Add changelog for #4400	2026-05-01 08:58:38 -04:00
Mark Backman	eb4e56d2d9	feat(deepgram): expose mip_opt_out on TTS services Adds a `mip_opt_out` init parameter to both `DeepgramTTSService` (WebSocket) and `DeepgramHttpTTSService` so callers can opt out of the Deepgram Model Improvement Program. When set, the value is forwarded as a query parameter on the request, matching the pattern used by the Deepgram STT services.	2026-05-01 08:55:23 -04:00
kompfner	6d66bbceeb	Merge pull request #4395 from pipecat-ai/pk/app-resources-api-updates Broaden tool_resources to app_resources	2026-04-30 21:19:05 -04:00
Aleix Conchillo Flaqué	a27d9fc30b	Merge pull request #4396 from pipecat-ai/aleix/remove-unused-user-mute-reset refactor(user_mute): remove unused reset() method from strategies	2026-04-30 17:27:54 -07:00
Aleix Conchillo Flaqué	2a8f4734e0	refactor(user_mute): remove unused reset() method from strategies The reset() method on BaseUserMuteStrategy and its subclasses was never called anywhere in the codebase.	2026-04-30 16:31:29 -07:00
Mark Backman	48ac68e3c8	Merge pull request #4393 from pipecat-ai/mb/fix-smart-turn-import fix(turns): defer LocalSmartTurnAnalyzerV3 import to fix transformers warning	2026-04-30 16:40:17 -04:00
Paul Kompfner	c3ef199efa	Add changelog for #4395	2026-04-30 16:19:35 -04:00
Paul Kompfner	1b5c4cfa2a	feat: broaden tool_resources to app_resources Broaden `tool_resources` to `app_resources` for easy access not just in tool handlers but in other places like custom `FrameProcessor`s. Involves 3 changes: - A rename: `tool_resources` -> `app_resources` - A new property on `PipelineTask`: `app_resources` - A new property on `FrameProcessor`: `pipeline_task` Usage in tool handler: async def get_weather(params: FunctionCallParams): resources = cast(MyAppResources, params.app_resources) ... Usage in custom `FrameProcessor`: class MyProcessor(FrameProcessor): async def process_frame(self, frame, direction): await super().process_frame(frame, direction) if self.pipeline_task is not None: resources = cast(MyAppResources, self.pipeline_task.app_resources) ... The previous `tool_resources` aliases (on `PipelineTask`, `FunctionCallParams`, and `FrameProcessorSetup`) keep working but are deprecated as of 1.2.0 and emit `DeprecationWarning`s.	2026-04-30 16:16:17 -04:00
Mark Backman	6e9dd1dbcc	Merge pull request #4390 from pipecat-ai/mb/cartesia-tts-api-updates feat(cartesia): align TTS services with latest API and buffering guidance	2026-04-30 15:59:15 -04:00
Mark Backman	6487f895b3	Setting use_normalized_timestamps to False so that input and output text match	2026-04-30 14:21:14 -04:00
Mark Backman	351105a975	test(krisp): scope importlib.metadata.version mock to imports only The four krisp test files installed a process-wide mock of importlib.metadata.version with `patch(...).start()` at module level and never called .stop(). Once any of these files was collected, the mock leaked across the rest of the test session, returning '0.0.0-dev' for every version check. This corrupted unrelated tests that triggered transformers' import-time dependency check (e.g. lazy imports of LocalSmartTurnAnalyzerV3) — transformers saw tqdm=='0.0.0-dev' and refused to load. Wrap the pipecat imports in `with patch(...)` so the mock is active during import (where pipecat's krisp version check needs it) and torn down before any tests run.	2026-04-30 14:16:54 -04:00
Mark Backman	8ea963852d	Add changelog for #4393	2026-04-30 14:16:46 -04:00
Mark Backman	6f4458f21d	fix(turns): defer LocalSmartTurnAnalyzerV3 import to avoid loading transformers at module load Importing pipecat.turns.user_turn_strategies pulled in LocalSmartTurnAnalyzerV3 → transformers → onnxruntime at module load time. Since this module is imported by llm_response_universal (and therefore most LLM services), any LLM service import paid the cost of loading transformers and triggered its missing-backend warning in environments without PyTorch/TF/Flax. Move the LocalSmartTurnAnalyzerV3 import into default_user_turn_stop_strategies() so it only loads when the default smart-turn strategy is actually constructed. Fixes #4392	2026-04-30 14:16:41 -04:00
Mark Backman	fb42a7dcf3	Add changelog for #4390	2026-04-30 09:45:16 -04:00
Mark Backman	21547c8680	fix(cartesia): stop double-yielding ErrorFrame on HTTP non-200 The non-200 branch yielded an ErrorFrame and then raised, which the outer except caught and yielded a second, less informative "Unknown error" frame. Return after the yield and fold the status code into the message.	2026-04-30 09:41:43 -04:00
Mark Backman	3e5aabc5f2	fix(cartesia): guard HTTP session before use Pyright flagged the .post() call on a possibly-None _session. Raise a clear RuntimeError if start() wasn't called instead of crashing on the attribute access.	2026-04-30 09:34:02 -04:00
Mark Backman	e508642b0a	refactor(cartesia): mark tag helpers as @staticmethod SPELL/EMOTION_TAG/PAUSE_TAG/VOLUME_TAG/SPEED_TAG are stateless and worked only via class-level access. Decorating them lets instance access work too and silences the missing-self lint warning.	2026-04-30 09:31:22 -04:00
Mark Backman	e546541e20	feat(cartesia): align WebSocket TTS with latest API and buffering guidance - Bump default cartesia_version to 2026-03-01. - Replace deprecated use_original_timestamps with use_normalized_timestamps so word timestamps match what was actually spoken. - Add max_buffer_delay_ms init arg; auto-derive 0 in SENTENCE mode to avoid the doc-warned "middle ground" of client + server buffering, leave unset in TOKEN mode for managed buffering. - Silently consume flush_done messages now emitted per transcript when server-side buffering is disabled.	2026-04-30 09:25:31 -04:00
Mark Backman	bfdd19464f	Merge pull request #4385 from pipecat-ai/mb/runner-session-id feat(runner): add session_id to RunnerArguments	2026-04-29 13:17:47 -04:00
Mark Backman	1a93ff52f1	Merge pull request #4386 from pipecat-ai/mb/update-soniox-model feat(soniox): update default TTS model to tts-rt-v1	2026-04-29 13:17:09 -04:00
Mark Backman	6e2008a7a6	Add changelog for #4386	2026-04-29 11:09:38 -04:00
Mark Backman	da8d3a2d80	feat(soniox): update default TTS model to tts-rt-v1 Promotes the Soniox TTS default model from `tts-rt-v1-preview` to the generally available `tts-rt-v1`.	2026-04-29 11:05:12 -04:00
Mark Backman	6b608e7e22	Add changelog for #4385	2026-04-29 09:53:42 -04:00
Mark Backman	924b9a9d8c	feat(runner): add session_id to RunnerArguments Adds a `session_id: str \| None` field to `RunnerArguments` so bots can log/trace a per-session identifier in local development the same way they can in Pipecat Cloud (where it is provided via the `x-daily-session-id` header). The local runner now mints a UUID at every `RunnerArguments` construction site. For paths that already returned a `sessionId` to the caller (Daily `/start`, dial-in webhook), a single UUID is now generated and shared between `runner_args.session_id` and the response body instead of being thrown away. The SmallWebRTC `/api/offer` endpoint accepts an optional `session_id` so the `/sessions/{session_id}/...` proxy can thread it through. This is the prerequisite step for collapsing pipecat-cloud's `SessionArguments` / `SessionArguments` hierarchy onto the upstream runner types.	2026-04-29 09:45:55 -04:00
Aleix Conchillo Flaqué	9411c4b67e	Merge pull request #4382 from pipecat-ai/aleix/unfill-changelog-script chore(changelog): add release-changelog.py and fix (PR line indentation in towncrier template	2026-04-28 13:18:49 -07:00
Mark Backman	ac5eb97670	Merge pull request #4384 from pipecat-ai/mb/nvidia-remove-riva-ref Update README to remove NVIDIA references to RIVA	2026-04-28 13:18:36 -04:00
Mark Backman	3034f8bb3b	Update README to remove NVIDIA references to RIVA	2026-04-28 12:42:58 -04:00
Aleix Conchillo Flaqué	60c66eda48	chore(towncrier): indent (PR ref line by two spaces in template So the rendered changelog has the (PR [...]) line aligned as a list continuation under its bullet. Verified with both short and wrapped entries via `towncrier build --draft`.	2026-04-27 15:07:53 -07:00
Aleix Conchillo Flaqué	ea3585146c	chore(scripts): add release-changelog.py Adds a script to unfill (single-line) entry paragraphs in CHANGELOG.md while keeping `(PR [...])` on its own continuation line.	2026-04-27 15:07:53 -07:00
Aleix Conchillo Flaqué	9697abe559	Merge pull request #4381 from pipecat-ai/changelog-1.1.0 Release 1.1.0 - Changelog Update v1.1.0	2026-04-27 14:02:20 -07:00
aconchillo	cb0335c82a	Update changelog for version 1.1.0	2026-04-27 13:59:17 -07:00
Aleix Conchillo Flaqué	f560614af9	Merge pull request #4379 from pipecat-ai/aleix/bump-daily-python-0.28 chore(daily): bump daily-python to ~=0.28.0	2026-04-27 13:46:00 -07:00
Aleix Conchillo Flaqué	d7a196a3f4	docs(changelog): add entry for daily-python 0.28.0 bump	2026-04-27 13:35:14 -07:00

1 2 3 4 5 ...

9262 Commits