pipecat

Author	SHA1	Message	Date
Paul Kompfner	4703df8686	fix: clear 8 more services from pyright ignore list A fourth pass over low-error-count files. Drops 8 files (57 → 49) and full-pyright errors from 525 → 496. Default pyright stays clean. Optional access on transport/client receivers (4 files). Same fix shape as #4359 — a receiver typed `X \| None` accessed without a guard. For "should never happen" cases (caller's lifecycle ensures the field is non-None when the method runs), used `assert` rather than silent early-return so an invariant violation surfaces loudly: - `transports/whatsapp/client.py` (5 errors): `_validate_whatsapp_webhook_request` was typed `bytes` / `str` but called with `bytes \| None` / `str \| None`. Widened the helper signature and pushed the explicit None-check inside (matching its existing empty-string check). Also handled `pipecat_connection.get_answer()` returning `None` — would have crashed at `.get("sdp")` before. - `transports/websocket/client.py` (5 errors): four are the deprecated `websockets.WebSocketClientProtocol` alias (same `# pyright: ignore[reportAttributeAccessIssue]` as the `services/websocket_service.py` fix from earlier in this PR). The fifth was `async for message in self._websocket` — traced the call chain and confirmed `_client_task` is created only after `self._websocket` is assigned and cancelled before it's cleared, so the field is never None when `_client_task_handler` runs. Used `assert`. - `services/openai/stt.py` (4 errors): same pattern. `_receive_messages` is started by `_connect()` only when `self._websocket` is set, and the reconnect loop in `WebsocketService._receive_task_handler` re-establishes it before each retry. `assert` at entry. Plus L478/L483: the `try`/`except ModuleNotFoundError` import-guard makes `websocket_connect` and `State` `<type> \| None`; `__init__` already raises `ImportError` if either is None, so an `assert` at the `_connect_websocket` use site is honest. Plus an L538 `Language \| str` cast (same shape as last batch). - `services/deepgram/flux/base.py` (2 errors): `event = data.get("event")` flowed into `_handle_turn_resumed(event: str)` as `Any \| None`. Tightened with an `isinstance(event, str)` guard before the `FluxEventType(event)` lookup. The other error (`average_confidence > min_confidence` where `min_confidence: float \| None`) was a latent crash on missing confidence data — restored the original `not min_confidence` (which treats both `None` and `0.0` as "no filter") and added an explicit drop-on-missing-confidence-data branch. `gemini_live` Settings/InputParams (vertex). The deprecated `InputParams` declares `modalities: GeminiModalities \| None` and `media_resolution: GeminiMediaResolution \| None`, but their downstream usage at `services/google/gemini_live/llm.py:952,959` calls `.value` on each — `None` would crash. Rather than touching the deprecated input model, translate `None` to the canonical defaults (`GeminiModalities.AUDIO`, `GeminiMediaResolution.UNSPECIFIED`) at the assignment site in `vertex/llm.py`. Also fixed an unrelated annotation bug: `_get_credentials` was annotated `-> str` but actually returns `service_account.Credentials` (used correctly by the caller — only the annotation was wrong). `moondream/vision.py` (3 errors). `frame.format` is `str \| None` but `Image.frombytes(mode, ...)` requires `str`; raise instead of crashing on missing format. The other two errors are pyright thinking the moondream2-custom `encode_image` and `query` methods are `Tensor` (rather than callables) — those are provided by the model code via `trust_remote_code=True` and aren't visible to pyright on the base `AutoModelForCausalLM` type. Scoped `# pyright: ignore[reportCallIssue]` on the two call sites. `transports/base_output.py` (3 errors). Two are `self._mixer.mix(...)` calls in `with_mixer`, a closure invoked only when `self._mixer` is truthy at the call site — captured the mixer to a local variable inside the closure with an `assert`, then used that. Third is the PIL `frombytes(mode, ...)` shape — `frame.format is None` early- return guard at the top of `resize_frame` so the main resize logic reads cleanly. `elevenlabs/tts.py` (4 errors). The payload-building dict at L1271 was typed `dict[str, str \| dict[str, float \| bool]]` — an aspirational shape that matched only the first two assignments. Subsequent code assigned `list[dict[...]]` (pronunciation locators) and bools, all violating the annotation. Same pattern at L926 (the WebSocket-init `msg`). Both widened to `dict[str, Any]`, which is the honest shape for a JSON request payload and what similar code uses elsewhere. Files dropped from the ignore list (57 → 49): services/deepgram/flux/base.py, services/elevenlabs/tts.py, services/google/gemini_live/vertex/llm.py, services/moondream/vision.py, services/openai/stt.py, transports/base_output.py, transports/websocket/client.py, transports/whatsapp/client.py.	2026-05-01 09:36:14 -04:00
Paul Kompfner	26a40e2e62	fix: clear 10 more services from pyright ignore list A third pass over low-error-count files in the ignore list. Drops 10 files (67 → 57) and full-pyright errors from 555 → 525. Default pyright stays clean. Optional access guards (4 files). The same fix shape as 9e9b1f39e: a receiver typed `X \| None` accessed without a guard, fixed with a local-var capture or an early return. - `mistral/stt.py`: `_connection.send_audio` could crash if `_connect()` swallowed an exception and left `_connection` unset; drop the audio chunk with a warning instead. `_receive_events` iterating `_connection.events()` got the same defensive narrowing. - `deepgram/flux/stt.py`: `_websocket_url` is set in `_connect` before `_connect_websocket` is called, but pyright doesn't track that across methods — assert at the use site. `websocket.response` is `Response \| None` in the websockets stubs even though it's always populated post-handshake; guarded with a fallback. - `audio/filters/rnnoise_filter.py`: the module-level import sets `RNNoise` to `None` if `pyrnnoise` isn't installed; raise `ImportError` explicitly instead of relying on the existing try- block to catch the `None(...)` call. Also gated `filter()` with `or self._rnnoise is None` so pyright sees the narrowing. - `transports/smallwebrtc/request_handler.py`: `get_answer()` legitimately returns `None`; raise instead of crashing on three subscript accesses. `TTSService` `audio-context` API tightening. Mirroring the `append_to_audio_context` fix from the previous batch: `remove_audio_context` was typed `str` but is called with `str \| None` from `get_active_audio_context_id()` results. Widened to `str \| None` and the `None` handling lives in the function body (early debug log + return) — matching `append_to_audio_context`'s shape. `audio_context_available` keeps its narrow `str` signature; asking "is `None` available?" isn't a meaningful question (`_audio_contexts` is `dict[str, asyncio.Queue]`). The internal call site in `on_turn_context_completed` narrows `_turn_context_id` explicitly before passing it. Side effect: deepgram/tts.py's L307 error clears without local changes. `deepgram/tts.py` (4 errors → 0): the same `push_error(ErrorFrame(...))` latent bug we fixed in resembleai earlier in this PR — `push_error` takes a string; there's a separate `push_error_frame` for frames. Two sites switched. The Optional `_websocket.response` access is guarded the same way as deepgram/flux/stt.py. The `remove_audio_context` error was cleared by the tightening above. `aws/utils.py` (3 errors → 0): `AWSTranscribePresignedURL` declared `session_token: str` but the dict source is `str \| None` (AWS supports long-term IAM creds without a session token). Same for `vocabulary_name`/`vocabulary_filter_name` on `get_request_url`, which were typed `str = ""` even though the body uses truthy checks to skip them. Widened to `str \| None = None` — matches actual runtime semantics. `audio/dtmf/utils.py` (2 errors → 0): `files("...").joinpath(...)` returns a `Traversable`, but `aiofiles.open` wants a real path. For regular pip installs this worked in practice (Traversable was a `Path`), but it would fail for zipped distributions (zipapp, zipimport) where the resource isn't on disk. Wrapped in `importlib.resources.as_file(...)` — the canonical bridge that extracts to a temp file when the resource isn't already on the filesystem. Validated end-to-end: regular install still reads bytes; ad-hoc zipapp test confirmed `as_file` extracts the resource and returns a real Path. `openai/image.py` (2 errors → 0): the `size` arg to `images.generate` is `Literal[...] \| None` in the SDK but our settings field is `str \| None`. Mirrored the `groq/tts.py` hint-not-constraint pattern from the previous batch: defined a module-level `OpenAIImageSize = Literal[...]` alias with a comment attributing the upstream symbol and documenting the cast contract (callers can pass any string; invalid values surface as an OpenAI API error). Also guarded `image.data[0]` (response.data is `list[Image] \| None`). `processors/frameworks/{langchain,strands_agents}.py` (4 + 4 → 0): both processors do `messages[-1]["content"]` on a value typed `LLMStandardMessage \| LLMSpecificMessage` (the latter is a dataclass, not a dict, so `__getitem__` errors). Historically these only handled plain-text user messages, so the fix is two explicit guards (skip if the last message isn't a dict; skip if `content` isn't a string) plus a TODO noting that other shapes (multi-modal content, provider-specific messages) aren't supported yet. langchain's `__get_token_value` also got a small fix where `AIMessageChunk.content` is `str \| list[parts]` but the function declares `-> str`; stringify the list case. strands_agents' surfaced two unrelated narrows: a `graph_exit_node: str \| None` arg gated by an `__init__`-time assert, and `agent.stream_async` reached only when we're not in graph mode. Files dropped from the ignore list (67 → 57): audio/dtmf/utils.py, audio/filters/rnnoise_filter.py, processors/frameworks/langchain.py, processors/frameworks/strands_agents.py, services/aws/utils.py, services/deepgram/flux/stt.py, services/deepgram/tts.py, services/mistral/stt.py, services/openai/image.py, transports/smallwebrtc/request_handler.py.	2026-05-01 09:36:14 -04:00
Paul Kompfner	31ff07916f	fix: clear 10 more services from pyright ignore list A second pass over the low-error-count files in the ignore list. Drops 10 files (77 → 67) and full-pyright errors from 580 → 555. Default pyright stays clean. Three coherent shapes plus a handful of one-offs: `Language \| str \| None` → `Language \| None` at STT frame boundaries. `assert_given(self._settings.language)` returns `Language \| str \| None` (strips `_NotGiven`, keeps the rest), but `TranscriptionFrame.language` expects `Language \| None`. In practice both `_settings.language` and SDK-supplied codes resolve to a `Language` enum value, but technically they could be raw strings — and `Language` is a StrEnum, so downstream consumers (which mostly compare/serialize as strings) handle either. Used `cast("Language \| None", ...)` at each call site rather than a runtime-validating helper, so an unrecognised code (e.g. one we haven't added to the enum yet) still flows through unchanged. Cleared azure/stt.py, aws/stt.py, gradium/stt.py; mistral/stt.py keeps the cast at the SDK boundary (storing under `_detected_language: Language \| None`) but stays in the ignore list because of two unrelated Optional-access errors. aiobotocore `async with` stub gap. `aioboto3.Session().client(...)` is an async context manager at runtime but its stubs don't advertise `__aenter__`/`__aexit__` to pyright. Scoped `# pyright: ignore[reportGeneralTypeIssues]` on the two affected sites: aws/agent_core.py and aws/tts.py. aws/tts.py also had a latent bug on the no-`AudioStream` path: the original code set `audio_data = None` and then crashed in `resample(...)` and `len(audio_data)` below; replaced with an early `return` after logging — matches the convention elsewhere (OpenAI TTS, etc.) of not recording usage metrics on the error path. heygen `event_id: str \| None` → `str` at transport→client boundary. Three call sites in transports/heygen/transport.py passed `self._event_id` (`str \| None`) into client methods that take `str`. Added a guard at each: `agent_speak_end` and `interrupt` only fire when `_event_id` is set; `write_audio_frame` warn-and-drops when there's no active bot event rather than sending a malformed message. `OpenAIResponsesLLMInvocationParams` TypedDict. `get_llm_invocation_params` always sets both `input` and `tools` in the same dict literal, but the TypedDict was `total=False` so direct subscript access (`invocation_params["input"]`) tripped `reportTypedDictNotRequiredAccess` in services/openai/responses/llm.py. Marked both keys `Required[...]`; `instructions` stays non-required since it's only added when a system instruction is present. Latent bug in heygen/api_interactive_avatar.py: the code accessed `request_data.voice.voiceId` and `request_data.voice.elevenlabsSettings`, but those names are Pydantic aliases; the actual attribute names (used for attribute access) are `voice_id` and `elevenlabs_settings`. Switched to the field names — those camelCase accesses would have raised AttributeError at runtime if `voice` was set. Other small fixes: - assemblyai/stt.py: the deprecated `connection_params=` init path was reading `formatted_finals` and `word_finalization_max_wait_time` off `AssemblyAIConnectionParams`, but those fields were never on the deprecated input model — they were added to Settings later. Removed the reads (with a comment noting they're only available via the canonical `settings=...` API); the deprecated input model is unchanged. - rtvi/processor.py: two `about: Mapping[str, Any] = None` parameter signatures — declared `Mapping`, defaulted to `None`, and both function bodies already handled the None case. Widened to `Mapping[str, Any] \| None = None`. - aws/stt.py: `subprotocols=["mqtt"]` failed against websockets' `Sequence[Subprotocol] \| None` (Subprotocol is a NewType wrapper). Wrapped: `subprotocols=[Subprotocol("mqtt")]`. Files dropped from the ignore list (77 → 67): processors/frameworks/rtvi/processor.py, services/assemblyai/stt.py, services/aws/agent_core.py, services/aws/stt.py, services/aws/tts.py, services/azure/stt.py, services/gradium/stt.py, services/heygen/api_interactive_avatar.py, services/openai/responses/llm.py, transports/heygen/transport.py.	2026-05-01 09:36:14 -04:00
Paul Kompfner	814f00ce41	fix: clear 19 TTS/STT/etc. services from pyright ignore list Several adjacent fix shapes that together drop 19 files from the pyrightconfig.json ignore list (96 → 77) and full-pyright errors from 605 → 580. Default pyright stays clean. TTS voice/context_id None handling — most files in this batch had a single error of the shape "value typed `T \| None` passed where `T` is required" coming out of `assert_given(self._settings.voice)` (which strips `_NotGiven` but not `None`) or `get_active_audio_context_id()`. Two patterns: - For services where a missing voice means the request can't proceed (hume, openai, xtts, groq, kokoro, piper), added an explicit None check. Inside `run_tts` we yield an `ErrorFrame` and return — matching each service's existing error-emission style (a few wrap `Exception` broadly and were fine; openai/hume/xtts had narrower or no try blocks so a bare `raise ValueError` would have escaped uncaught). Piper validates in `__init__`, where failing fast at construction is the right shape. OpenAI also gained a `voice not in VALID_VOICES` guard with a clear message listing supported voices. - For services where a missing audio context just means "skip this message" (fish, lmnt, smallest, sarvam, neuphonic), widened `TTSService.append_to_audio_context`'s `context_id` signature to `str \| None`. The function body already explicitly handled the None case with a debug log + early return, so the prior `str` annotation was a lie; making it honest cleared call sites without local guards. inworld's `_close_context` got the same treatment. google.genai imports — switched `from google import genai` to `import google.genai as genai` in google/image.py and google/llm.py. The dotted form sidesteps a PEP 420 namespace-package stub gap (the `google` namespace stubs come from a different distribution and don't declare `genai`), which means pyright now resolves `genai` to the real module rather than `Unknown`. IDE autocomplete on `genai.<x>` works for the first time. In image.py this surfaced three latent bugs that the `Unknown` resolution had been hiding (model was `str \| _NotGiven \| None` not narrowed before passing to the SDK; two spots accessed `.image_bytes` on an `Image \| None` without a guard) — all fixed. llm.py's dotted import surfaced 8 errors (Content-list typing nuances, internal `_api_client` access, a few small Optionals); deferred to a future pass since they're outside this commit's scope, so the file stays in the ignore list with the dotted import. Latent bug fixes spotted along the way: - resembleai/tts.py was calling `push_error(ErrorFrame(...))`, but `push_error` takes a string — there's a separate `push_error_frame` for the frame case. Switched to the right method. - openai/base_llm.py: `max_completion_tokens` was the only sibling field on `OpenAILLMSettings` missing `\| None` in its type, which caused the assignment in openai/llm.py from `params.max_completion_tokens` (`int \| None`) to fail. Added `\| None` for consistency with `max_tokens` etc. - heygen/base_api.py: `livekit_url: str = None` and `ws_url: str = None` declared `str` while defaulting to `None`. Removed the bogus defaults — both fields are required at construction in every in-tree call site, and the previous `str = None` was a Pydantic footgun. Other small ones: gladia/stt.py needed a None guard on `_session_url` before `websocket_connect`; openrouter/llm.py's `build_chat_completion_params` override widened to `dict[str, Any]` diverging from the parent's `OpenAILLMInvocationParams` — restored the parent's type; neuphonic/tts.py guarded the receive loop's `async for message in self._websocket` with a local-variable narrowing matching the pattern from 9e9b1f39e. groq/tts.py: tightened `output_format`'s typing to `Literal["flac","mp3","mulaw","ogg","wav"] \| str = "wav"`. The literal side gives IDE autocomplete hints for the currently-supported set; the `\| str` side keeps callers unblocked if groq adds a new format before this list is updated. A `cast` at the API boundary satisfies groq's stricter `Literal` parameter type. The literal alias mirrors the inlined Literal on `groq.resources.audio.speech.AsyncSpeech.create`'s `response_format` (the SDK doesn't export it as a named symbol). websocket_service.py: scoped `# pyright: ignore[reportAttributeAccessIssue]` on `websockets.WebSocketClientProtocol`. That alias is now a deprecated re-export from the legacy submodule and pyright doesn't surface it on the top-level `websockets` namespace; runtime is fine. Migrating to `websockets.ClientConnection` is a separate piece of work (transports/websocket/client.py uses the same alias four times) and left for a future commit. Files dropped from the ignore list: fish/tts.py, gladia/stt.py, google/image.py, groq/tts.py, heygen/base_api.py, hume/tts.py, inworld/tts.py, kokoro/tts.py, lmnt/tts.py, neuphonic/tts.py, openai/llm.py, openai/tts.py, openrouter/llm.py, piper/tts.py, resembleai/tts.py, sarvam/tts.py, smallest/tts.py, websocket_service.py, xtts/tts.py.	2026-05-01 09:36:14 -04:00
Paul Kompfner	96756bc1f6	fix: clean up TypedDict / Optional patterns in 6 more LLM adapters Same approach as the previous round — apply boundary casts where the code does dict-style mutation on TypedDict-typed values, narrow at return sites, and document the LLMSpecificMessage limitation in realtime adapters that pack history into a single text message. aws_nova_sonic_adapter.py — pure typing + small narrowing fixes: - Filter LLMSpecific items in `_from_universal_context_messages` (documented). - `_from_universal_context_message` now declared `-> AWSNovaSonicConversationHistoryMessage \| None` (it already had paths returning None implicitly). - `get_messages_for_logging` returns `dict[str, Any]` per element via `dataclasses.asdict`, matching the declared return type. - Use a local `role` variable so pyright keeps the narrowing across the truthy-content guard. grok_realtime_adapter.py / inworld_realtime_adapter.py — same shape of fix as `open_ai_realtime_adapter.py` from the previous batch. The two files are essentially copies of the OpenAI Realtime adapter, so the same template applies: cast at the boundary, filter LLMSpecificMessage with a documented note, replace the implicit-None fallthrough with `raise ValueError`, and switch the `text_content +=` pattern (which fails when one of the parts is None) to a `text_parts.append(...)` + `" ".join(...)` pattern. open_ai_adapter.py — pure typing. Cast at the `OpenAILLMInvocationParams` return, narrow the system-instruction warning's `initial_content` to `str \| None`, and cast the custom-tools list to `list[ChatCompletionToolParam]`. open_ai_responses_adapter.py — pure typing. Same shape: narrow `first_content` to `str \| None` for the warning resolver, cast the constructed dict literals at append sites where the target is `ResponseInputItemParam`, and cast `get_messages_for_logging`'s return to the declared `list[dict[str, Any]]`. processors/aggregators/llm_context.py — pure typing. Cast the deepcopied message in the redaction loop in `get_messages` to `dict[str, Any]` and the create_image/audio_message return-dict literals to `LLMContextMessage`. Removes 6 newly-clean files from the pyright ignore list. Net: -77 pyright errors (full-config: 680 -> 603).	2026-05-01 09:36:14 -04:00
Paul Kompfner	5e24027fd5	fix: type fixes (and a few latent bug fixes) in 4 LLM adapters Same shape of fix we applied to anthropic_adapter.py earlier — these adapters do dict-style mutation on values typed as ChatCompletionMessageParam (a union of TypedDicts) or against Optional fields. Apply boundary casts (`cast(dict[str, Any], ...)` for the mutation block, cast back to the TypedDict at return sites). Most changes are pure typing (rename + cast); a handful in gemini and openai_realtime are small defensive bug fixes for code paths that were latently broken by Optional fields slipping through: perplexity_adapter.py — pure typing. Cast the deepcopied messages to `list[dict[str, Any]]` for the role-merging / system-conversion / trailing-assistant-removal transformations and cast back to ChatCompletionMessageParam at the return. bedrock_adapter.py — pure typing. Cast the message to `dict[str, Any]` at the top of `_from_standard_message` for the tool-result / tool-use / image-content transformations. Cast the constructed dict at the return site of `get_llm_invocation_params`. gemini_adapter.py — typing + several None guards on Content.parts and related Optional fields. Each guard turns a latent `TypeError`/`AttributeError` (when the type-system-allowed None showed up at runtime) into a defensive skip — the type annotations say these can be None and we now handle that. open_ai_realtime_adapter.py: - Typing: cast the deepcopied messages, cast back where needed. - LLMSpecificMessage handling: previously the function would crash on the first `.get()` call if any LLMSpecificMessage was in the list. Filter them out and document the limitation — this adapter's pack-into-single-text-message strategy doesn't compose with opaque per-provider payloads. - Real bug fix: `events.ConversationItem` is a Pydantic BaseModel, not a TypedDict. The bulk-packing path was constructing a raw dict where a ConversationItem was expected. Replaced with proper constructor calls (matches what the single-user-message path already does). - Real bug fix: `_from_universal_context_message` was declared `-> events.ConversationItem` but on the unhandled-message fallthrough it logged and returned None implicitly. Raise ValueError so the violation is loud, not silent. Removes 4 newly-clean files from the pyright ignore list: adapters/services/{perplexity,bedrock,gemini,open_ai_realtime}_adapter.py. Net: -95 pyright errors (full-config: 775 -> 680).	2026-05-01 09:36:14 -04:00
Paul Kompfner	bec407ce3a	fix: handle Optional websocket/client receivers across services Pyright flagged 19 sites where `await self._<connection>.send/recv/...` was called on a receiver typed `X \| None`. Each kind of call site needed a slightly different fix to be both type-safe and behaviour- preserving: Streaming/user-facing paths (early return + warn — drop and warn is the right runtime fail-safe when reconnect didn't succeed): - cartesia/stt.py (run_stt) - soniox/stt.py (_send_keepalive) - elevenlabs/tts.py (run_tts — yields ErrorFrame and returns) - deepgram/sagemaker/tts.py (run_tts) - transports/lemonslice/transport.py (send_message) - transports/tavus/transport.py (send_message) "Should never happen" cases (early return with comment, no warn — caller already gated on a separate `_is_*` check, so a warn would be noise): - deepgram/flux/stt.py (transport methods, gated by _transport_is_active) - deepgram/flux/sagemaker/stt.py (same) - stt_service.py (_send_keepalive, gated by _is_keepalive_ready) - elevenlabs/stt.py (_send_keepalive, same) - llm_service.py (_ws_recv — raises ConnectionError to match _ensure_connected's contract) - heygen/client.py (receive loop, gated by self._connected) Just-assigned-above (use a local variable so pyright keeps the narrowing across statements): - lmnt/tts.py - gradium/stt.py - fish/tts.py Other: - transports/websocket/server.py — used the existing local `websocket` parameter in scope instead of `self._websocket` for the close call. - websocket_service.py — `send_with_retry` raises ConnectionError when `self._websocket` is None inside the existing try-block, so the broad `except Exception` triggers reconnect just as it would on a real send failure (preserving the prior behaviour where None silently fell through to the AttributeError-driven reconnect path). Drops three now-clean files from the pyright ignore list: cartesia/stt.py, elevenlabs/stt.py, and soniox/stt.py.	2026-05-01 09:36:14 -04:00
Paul Kompfner	5e1bb4cbe5	chore: remove anthropic_adapter.py from pyright ignore list The file is now clean under pyright's basic type checking, so it can move out of the ignore list and be enforced on every run.	2026-05-01 09:36:14 -04:00
Mark Backman	3e5aabc5f2	fix(cartesia): guard HTTP session before use Pyright flagged the .post() call on a possibly-None _session. Raise a clear RuntimeError if start() wasn't called instead of crashing on the attribute access.	2026-04-30 09:34:02 -04:00
Paul Kompfner	0302f6d05c	chore(pyright): drop newly-clean files from ignore list asyncai/tts and google/vertex/llm are now clean after the missing-None sweep (both benefited from the TTSSettings.voice / LLMSettings cascades). - src/pipecat/services/asyncai/tts.py - src/pipecat/services/google/vertex/llm.py	2026-04-23 18:18:00 -04:00
Paul Kompfner	92610944af	chore(pyright): drop newly-clean files from ignore list Three files no longer have pyright errors after the is_given / assert_given sweep — remove them from the ignore list (which serves as a live todo of files with remaining type errors). - src/pipecat/processors/gstreamer/pipeline_source.py - src/pipecat/services/camb/tts.py - src/pipecat/services/speechmatics/tts.py	2026-04-23 17:44:17 -04:00
Mark Backman	b0962861c8	Acknowledge Tkinter's GC-reference idiom with a scoped type ignore Tkinter's `Label` only stores `PhotoImage` references at the C level, so Python GC eats them unless something on the Python side keeps a reference. The canonical fix is to stash the reference on the widget itself: `label.image = photo`. Tkinter widgets are plain Python objects, so the assignment works at runtime, but the stub declares no `image` attribute (correctly — there isn't one; we're adding it). Narrow the suppression to `# type: ignore[attr-defined]` on the one line. The existing comment above the assignment already documents why.	2026-04-22 12:19:16 -04:00
Mark Backman	ec7c35fe98	Move Mistral message fixups into MistralLLMAdapter Mistral imposes three conversation-history quirks on top of the OpenAI-compatible wire format: tool messages must be followed by an assistant message; non-initial system messages are rejected; trailing assistant messages require `prefix=True`. These rules were applied inline in `MistralLLMService.build_chat_completion_params`, which is the wrong layer — every other provider with OpenAI-compatible-but-quirky shape (Perplexity, etc.) owns its transformations in a `BaseLLMAdapter` subclass that runs during `get_llm_invocation_params`. Create `MistralLLMAdapter(OpenAILLMAdapter)` on the Perplexity template and wire it in via the existing `adapter_class` dispatch. The service now only handles Mistral-specific request-level mapping (`random_seed` in place of `seed`), and the message shape concerns live with other provider format logic. No behavior change. The transform function casts to `list[dict[str, Any]]` internally because mutating `role` and attaching Mistral's non-standard `prefix` field both step outside OpenAI's TypedDict contract; the cast at the return boundary encodes that we're emitting Mistral's extended schema, not OpenAI's.	2026-04-22 12:17:46 -04:00
Mark Backman	10b86b4bbe	Coerce inspect.getdoc() None to empty string before parsing `inspect.getdoc()` returns `str \| None`, but `docstring_parser.parse()` requires `str`. Functions without a docstring produced `None`, which the type checker correctly flagged. Coerce to `""` at the call site. `docstring_parser.parse("")` returns an empty docstring whose `.description` and `.params` are already handled by the surrounding `or ""` fallbacks, so runtime behavior is unchanged.	2026-04-22 12:01:00 -04:00
Mark Backman	8ec56092c0	Remove duplicate ResponseCreated type	2026-04-22 11:58:15 -04:00
Mark Backman	0c3c5e5c7d	Widen ToolsSchema.standard_tools to Sequence for covariance `ToolsSchema.__init__` declared `standard_tools: list[FunctionSchema \| DirectFunction]`. Callers (`BaseLLMAdapter`, `MCPService`) pass in `list[FunctionSchema]`, which is not assignable to the union list because `list` is invariant in its element type. Widen the parameter to `Sequence[...]` (covariant) so `list[X]` and `list[X \| Y]` both fit. A narrower `list[FunctionSchema]` is still accepted, and nothing in this class mutates the argument — the constructor immediately copies it via `_map_standard_tools`. Also correct the `custom_tools` property return type to include `None`, matching the stored `_custom_tools` field. This single edit clears the pyright errors for three ignore-list entries: `tools_schema.py`, `base_llm_adapter.py`, and `mcp_service.py`.	2026-04-22 11:54:20 -04:00
Mark Backman	b64ed3f9e2	Narrow settings.model at service boundaries, not via truthiness Two services were reading `_settings.model` (typed `str \| _NotGiven \| None` because NOT_GIVEN is the default) and coercing it with `or ""` or similar. `_NotGiven.__bool__` returns False, so the runtime behavior happened to work, but the type was a lie — pyright saw `str \| _NotGiven` flowing into APIs that required `str` or `str \| None`. - `AIService._sync_model_name_to_metrics`: use `isinstance(model, str)` narrowing with an empty-string fallback. Equivalent runtime behavior, honest type, no truthiness dependency on a sentinel. - `SarvamLLMService.__init__`: validate the model is a real string before handing it to `_validate_model(str)`. A non-string model at this point is a configuration bug; raise `ValueError` so the error is clear and survives `python -O` (unlike an assert).	2026-04-22 11:52:20 -04:00
Mark Backman	5872006d6b	Encode lazy-init invariants at the right site, not at read sites Three spots had the same shape: a field starts None, a later method populates it, a read site later reads it. Pyright can't track the cross-method invariant. Rather than spray assertions at the read sites, fix each site at the structural level: - `FastAPIWebsocketInputTransport._monitor_websocket` now takes the session timeout as an argument. The task-creation site already guards on truthiness, so the call can pass the non-None value directly and the method's signature tells the truth. - `FrameProcessorMetrics.task_manager` raises `RuntimeError` instead of asserting. Asserts are stripped under `python -O`; a real raise keeps the runtime safety net and still narrows the type for pyright. - `SOXRStreamAudioResampler._maybe_initialize_sox_stream` returns the initialized stream. Callers use the return value and never touch the Optional `_soxr_stream` attribute, so narrowing stays inside the init method where the invariant is established.	2026-04-22 11:45:18 -04:00
Mark Backman	457eb7aa92	Mark abstract image/vision generators as real async generators `ImageGenService.run_image_gen` and `VisionService.run_vision` were declared `async def ... -> AsyncGenerator[Frame, None]` with `pass` bodies. Without a `yield` anywhere in the body, Python treats the function as a coroutine returning an `AsyncGenerator`, not as an async generator itself, so callers got a coroutine where they expected an iterator. Add `raise NotImplementedError; yield` so the body contains a yield (making this a real async generator) while still raising cleanly if a subclass ever calls `super().run_*` by mistake.	2026-04-22 11:19:23 -04:00
Mark Backman	14cd476b20	Drop pyright ignores for services fixed by run_stt/run_tts widening Deepgram STT, Gradium TTS, Smallest STT, and xAI STT/TTS had exactly one pyright error each, all of them the AsyncGenerator return-type mismatch resolved in `08fe9157c`. Remove them from the ignore list.	2026-04-22 11:09:27 -04:00
Mark Backman	7bba74ebd6	Expand pyright coverage to full src/pipecat with per-file ignores Previously, six modules (adapters, audio, processors, serializers, services, transports) were ignored wholesale. Many files in those modules already pass type checking, but we had no way to protect them from regressions or make the remaining work visible. Switch the include list to src/pipecat so any new module is checked by default, and replace directory-level ignores with the 140 specific files that still fail. This puts 189 previously-untyped files under type checking immediately and turns the remaining work into a concrete, shrinking TODO list.	2026-04-22 09:45:31 -04:00
Mark Backman	df9642eb5a	Fix type errors in serializers and add to pyright checked set Moves src/pipecat/serializers into pyright's include list. Narrows self._params to each subclass's InputParams in exotel, vonage, plivo, twilio, genesys, and telnyx. In protobuf.py, renames the reassigned frame local to avoid clobbering its Frame type and silences two dynamic attribute accesses on the generated frames_pb2 module. Also aligns telnyx and plivo hangup validation with twilio: if auto_hang_up=True (the default) but required credentials are missing, __init__ now raises ValueError instead of silently logging a warning at call-end time. Previously a misconfigured serializer would construct fine and fail to hang up the call later, leaving a phantom billable session.	2026-04-21 18:12:54 -04:00
Mark Backman	21f5cfe21a	Fix type errors in utils and add to pyright checked set	2026-04-21 16:47:12 -04:00
Mark Backman	c244a950eb	Add src/pipecat/tests to include list, alphabetize list	2026-04-21 16:24:53 -04:00
Mark Backman	847bd8af4b	Remove src/pipecat/sync which doesn't exist	2026-04-21 16:21:46 -04:00
Mark Backman	10e58d6e42	Fix type errors in scripts and add to pyright checked set	2026-04-21 16:17:49 -04:00
Mark Backman	58a17c7b1b	Include examples in type checking Remove `examples/` from the `pyrightconfig.json` ignore list and fix the resulting type errors across all example files. Common fixes: - Required API keys: `os.getenv("X")` -> `os.environ["X"]` so the return type is `str` rather than `str \| None`, and misconfiguration fails fast. - Narrow `LLMContextMessage` union members with `isinstance(..., dict)` before dict-style access. - `assert isinstance(params.llm, ...)` before calling service-specific methods that aren't on the base `LLMService`. - Guard optional frame fields (e.g. `LLMSearchResponseFrame.search_result`) before use.	2026-04-21 15:43:31 -04:00
Mark Backman	74d11dc0aa	Silence pyright diagnostics for unchecked modules in IDE Pylance analyzes open files even when they're outside the `include` set, producing noise in the editor. Adding these paths to `ignore` suppresses diagnostics without affecting import resolution.	2026-04-19 09:19:15 -04:00
Mark Backman	cb1463f9f1	Fix type errors in runner and add to pyright checked set Make required parameters non-optional: LiveKitRunnerArguments.token, _create_telephony_transport args. Use os.environ[] instead of os.getenv() for required WhatsApp env vars. Guard spec/loader None in module loading. Tighten sip_caller_phone guard in daily.py.	2026-04-17 09:39:55 -04:00
Mark Backman	ab91047300	Fix type errors in pipeline and add to pyright checked set Use Sequence[FrameProcessor] instead of list[FrameProcessor] in Pipeline, ServiceSwitcher, and ServiceSwitcherStrategy parameters to accept subtype lists. Add cast() in LLMSwitcher for narrowed return types. Guard against None in task_observer._send_to_proxy and replace hasattr with truthiness check in task._cleanup.	2026-04-16 21:47:11 -04:00
Mark Backman	3127cc6161	Fix type errors in turns and add to pyright checked set Widen base strategy process_frame return types to ProcessFrameResult \| None to match actual behavior (None treated as CONTINUE). Give UserTurnCompletionLLMServiceMixin a FrameProcessor base class so pyright can see create_task, cancel_task, process_frame, and push_frame.	2026-04-16 21:33:43 -04:00
Mark Backman	c6a1837844	Fix type errors in extensions and add to pyright checked set Tighten LLMMessagesAppendFrame and LLMMessagesUpdateFrame message fields from list[dict] to list[LLMContextMessage] to match actual usage. Add type annotations on inline message lists in IVR navigator and voicemail detector.	2026-04-16 21:22:46 -04:00
Mark Backman	aa355e3d32	Fix type errors in observers and add to pyright checked set Group three co-assigned fields (_start_frame_id, _start_frame_arrival_ns, _start_wall_clock) into a single _StartFrameInfo dataclass. This makes the "always set together" invariant structural rather than implicit, and fixes the incorrect str \| None annotation on _start_frame_id (Frame.id is int).	2026-04-16 18:25:10 -04:00
Mark Backman	9bd51cd88c	Add incremental pyright type checking with CI enforcement Add pyrightconfig.json with basic type checking for zero-error modules (clocks, metrics, transcriptions, frames) and enforce via CI. The include list will expand as modules are fixed.	2026-04-16 18:04:42 -04:00

34 Commits