LLMUserAggregator: use queue_task_frame() to push user speaking frames

FrameProcessor: add queue_task_frame() and queue_task_frames()
PipelineTask: use QueueTaskFrame
2025-12-30 15:18:38 -08:00 · 2025-12-30 15:18:38 -08:00 · 2025-12-30 15:16:52 -08:00 · 2025-12-30 15:16:52 -08:00
736 changed files with 9892 additions and 25714 deletions
--- a/.claude/skills/changelog/SKILL.md
+++ b/.claude/skills/changelog/SKILL.md
@@ -1,47 +0,0 @@
---
-name: changelog
-description: Create changelog files for important commits in a PR
---
-
-Create changelog files for the important commits in this PR. The PR number is provided as an argument.
-
-## Instructions
-
-1. Skip changelog for: documentation-only, internal refactoring, test-only, CI changes.
-
-2. First, check what commits are on the current branch compared to main:
-   ```
-   git log main..HEAD --oneline
-   ```
-
-3. For each significant change, create a changelog file in the `changelog/` folder using the format:
-   Allowed types: `added`, `changed`, `deprecated`, `removed`, `fixed`, `security`, `performance`, `other`
-   - `{PR_NUMBER}.added.md` - for new features
-   - `{PR_NUMBER}.added.2.md`, `{PR_NUMBER}.added.3.md` - for additional entries of the same type
-   - `{PR_NUMBER}.changed.md` - for changes to existing functionality
-   - `{PR_NUMBER}.fixed.md` - for bug fixes
-   - `{PR_NUMBER}.deprecated.md` - for deprecations
-   - `{PR_NUMBER}.removed.md` - for removed features
-   - `{PR_NUMBER}.security.md` - for security fixes
-   - `{PR_NUMBER}.performance.md` - for performance improvements
-   - `{PR_NUMBER}.other.md` - for other changes
-
-4. Each changelog file should at least contain a main single line starting with `- ` followed by a clear description of the change.
-
-5. If the change is complicated, changelog files can have indented lines after the main line with additional details or code samples.
-
-6. Use ⚠️ emoji prefix for breaking changes.
-
-## Example
-
-For PR #3519 with a new feature and a bug fix:
-
-`changelog/3519.added.md`:
-```
- Added `SomeNewFeature` for doing something useful.
-```
-
-`changelog/3519.fixed.md`:
-```
- Fixed an issue where something was not working correctly.
-```
--- a/.claude/skills/docstring/SKILL.md
+++ b/.claude/skills/docstring/SKILL.md
@@ -1,257 +0,0 @@
---
-name: docstring
-description: Document a Python module and its classes using Google style
---
-
-Document a Python module and its classes using Google-style docstrings following project conventions. The class name is provided as an argument.
-
-## Instructions
-
-1. First, find the class in the codebase:
-   ```
-   Search for "class ClassName" in src/pipecat/
-   ```
-
-2. If multiple files contain that class name:
-   - List all matches with their file paths
-   - Ask the user which one they want to document
-   - Wait for confirmation before proceeding
-
-3. Once the file is identified, read the module to understand its structure:
-   - Identify all classes, functions, and important type aliases
-   - Understand the purpose of each component
-
-4. Apply documentation in this order:
-   - Module docstring (at top, after imports)
-   - Class docstrings
-   - `__init__` methods (always document constructor parameters)
-   - Public methods (not starting with `_`)
-   - Dataclass/config classes with field descriptions
-
-5. Skip documentation for:
-   - Private methods (starting with `_`)
-   - Simple dunder methods (`__str__`, `__repr__`, `__post_init__`)
-   - Very simple pass-through properties
-   - **Already documented code** - If a class, method, or function already has a complete docstring that follows the project style, do not modify it. A docstring is complete if it has:
-     - A one-line summary
-     - Args section (if it has parameters)
-     - Returns section (if it returns something meaningful)
-   - Only add or improve documentation where it is missing or incomplete
-
-## Module Docstring Format
-
-```python
-"""[One-line description of module purpose].
-
-[Optional: Longer explanation of functionality, key classes, or use cases.]
-"""
-```
-
-Example:
-```python
-"""Neuphonic text-to-speech service implementations.
-
-This module provides WebSocket and HTTP-based integrations with Neuphonic's
-text-to-speech API for real-time audio synthesis.
-"""
-```
-
-## Class Docstring Format
-
-```python
-class ClassName:
-    """One-line summary describing what the class does.
-
-    [Longer description explaining purpose, behavior, and key features.
-    Use action-oriented language.]
-
-    [Optional: Event handlers, usage notes, or important caveats.]
-    """
-```
-
-Example:
-```python
-class FrameProcessor(BaseObject):
-    """Base class for all frame processors in the pipeline.
-
-    Frame processors are the building blocks of Pipecat pipelines, they can be
-    linked to form complex processing pipelines. They receive frames, process
-    them, and pass them to the next or previous processor in the chain.
-
-    Event handlers available:
-
-    - on_before_process_frame: Called before a frame is processed
-    - on_after_process_frame: Called after a frame is processed
-
-    Example::
-
-        @processor.event_handler("on_before_process_frame")
-        async def on_before_process_frame(processor, frame):
-            ...
-
-        @processor.event_handler("on_after_process_frame")
-        async def on_after_process_frame(processor, frame):
-            ...
-    """
-```
-
-Note: When listing event handlers, do NOT use backticks. Include an `Example::` section (with double colon for Sphinx) showing the decorator pattern and function signature for each event.
-
-## Constructor (`__init__`) Format
-
-```python
-def __init__(self, *, param1: Type, param2: Type = default, **kwargs):
-    """Initialize the [ClassName].
-
-    Args:
-        param1: Description of param1 and its purpose.
-        param2: Description of param2. Defaults to [default].
-        **kwargs: Additional arguments passed to parent class.
-    """
-```
-
-Example:
-```python
-def __init__(
-    self,
-    *,
-    api_key: str,
-    voice_id: Optional[str] = None,
-    sample_rate: Optional[int] = 22050,
-    **kwargs,
-):
-    """Initialize the Neuphonic TTS service.
-
-    Args:
-        api_key: Neuphonic API key for authentication.
-        voice_id: ID of the voice to use for synthesis.
-        sample_rate: Audio sample rate in Hz. Defaults to 22050.
-        **kwargs: Additional arguments passed to parent InterruptibleTTSService.
-    """
-```
-
-## Method Docstring Format
-
-```python
-async def method_name(self, param1: Type) -> ReturnType:
-    """One-line summary of what method does.
-
-    [Longer description if behavior isn't obvious.]
-
-    Args:
-        param1: Description of param1.
-
-    Returns:
-        Description of return value.
-
-    Raises:
-        ExceptionType: When this exception is raised.
-    """
-```
-
-Example:
-```python
-async def put(self, item: Tuple[Frame, FrameDirection, FrameCallback]):
-    """Put an item into the priority queue.
-
-    System frames (`SystemFrame`) have higher priority than any other
-    frames. If a non-frame item is provided it will have the highest priority.
-
-    Args:
-        item: The item to enqueue.
-    """
-```
-
-## Dataclass/Config Format
-
-```python
-@dataclass
-class ConfigName:
-    """One-line description of configuration.
-
-    [Explanation of when/how to use this config.]
-
-    Parameters:
-        field1: Description of field1.
-        field2: Description of field2. Defaults to [default].
-    """
-
-    field1: Type
-    field2: Type = default_value
-```
-
-Example:
-```python
-@dataclass
-class FrameProcessorSetup:
-    """Configuration parameters for frame processor initialization.
-
-    Parameters:
-        clock: The clock instance for timing operations.
-        task_manager: The task manager for handling async operations.
-        observer: Optional observer for monitoring frame processing events.
-    """
-
-    clock: BaseClock
-    task_manager: BaseTaskManager
-    observer: Optional[BaseObserver] = None
-```
-
-## Enum Documentation Format
-
-```python
-class EnumName(Enum):
-    """One-line description of the enum purpose.
-
-    [Longer description of how the enum is used.]
-
-    Parameters:
-        VALUE1: Description of VALUE1.
-        VALUE2: Description of VALUE2.
-    """
-
-    VALUE1 = 1
-    VALUE2 = 2
-```
-
-## Writing Style Guidelines
-
- **Concise and professional** - No casual language or filler words
- **Action-oriented** - Start with verbs: "Processes...", "Manages...", "Converts..."
- **Purpose before implementation** - Explain WHY before HOW
- **Clear parameter descriptions** - Include type hints, defaults, and purpose
- **No redundant type info** - Type hints are in the signature, don't repeat in description
- **Use backticks for code references** - Wrap class names, method names, event names, parameter names, and code snippets in backticks
-
-Good: "Neuphonic API key for authentication."
-Bad: "str: The API key (string) that is used for authenticating with Neuphonic."
-
-Good: "Triggers `on_speech_started` when the `VADAnalyzer` detects speech."
-Bad: "Triggers on_speech_started when the VADAnalyzer detects speech."
-
-## Deprecation Notice Format
-
-When documenting deprecated code:
-
-```python
-"""[Description].
-
-.. deprecated:: X.X.X
-    `ClassName` is deprecated and will be removed in a future version.
-    Use `NewClassName` instead.
-"""
-```
-
-## Checklist
-
-Before finishing, verify:
-
- [ ] Module has a docstring at the top (after copyright header and imports)
- [ ] All public classes have docstrings
- [ ] All `__init__` methods document their parameters
- [ ] All public methods have docstrings with Args/Returns/Raises as needed
- [ ] Dataclasses use "Parameters:" section for field descriptions
- [ ] Enums document each value in "Parameters:" section
- [ ] Writing is concise and action-oriented
- [ ] No documentation added to private methods (starting with `_`)
- [ ] Existing complete docstrings were left unchanged
--- a/.claude/skills/pr-description/SKILL.md
+++ b/.claude/skills/pr-description/SKILL.md
@@ -1,128 +0,0 @@
---
-name: pr-description
-description: Update a GitHub PR description with a summary of changes
---
-
-Update a GitHub pull request description based on the changes in the PR.
-
-## Arguments
-
-```
-/pr-description <PR_NUMBER> [--fixes <ISSUE_NUMBERS>]
-```
-
- `PR_NUMBER` (required): The pull request number to update
- `--fixes` (optional): Comma-separated issue numbers that this PR fixes (e.g., `--fixes 123,456`)
-
-Examples:
- `/pr-description 3534`
- `/pr-description 3534 --fixes 123`
- `/pr-description 3534 --fixes 123,456,789`
-
-## Instructions
-
-1. First, gather information about the PR:
-   - Use GitHub plugin to get PR details (title, current description, base branch)
-   - Use local git to get commits: `git log main..HEAD --oneline`
-   - Use local git to get the diff: `git diff main..HEAD`
-   - Parse any `--fixes` argument for issue numbers
-
-2. Check the existing PR description:
-   - If it already has a complete, accurate description that reflects the changes, do nothing
-   - If it's missing sections, incomplete, or outdated compared to the actual changes, proceed to update
-   - If it only has the template placeholder text, generate a full description
-
-3. Analyze the changes:
-   - Understand the purpose of each commit
-   - Identify any breaking changes (API changes, removed features, behavior changes)
-   - Look for new features, bug fixes, refactoring, or documentation changes
-   - Collect issue numbers from:
-     - The `--fixes` argument (if provided)
-     - Commit messages (patterns like "Fixes #123", "Closes #456", "Resolves #789")
-
-4. Generate or update the PR description with these sections:
-
-## PR Description Format
-
-### Summary (always include)
-
-Brief bullet points describing what changed and why. Focus on the *purpose* and *impact*, not implementation details.
-
-```markdown
-## Summary
-
- Added X to enable Y
- Fixed bug where Z would happen
- Refactored W for better maintainability
-```
-
-### Breaking Changes (include only if applicable)
-
-Document any changes that affect existing users or APIs.
-
-```markdown
-## Breaking Changes
-
- `ClassName.method()` now requires a `param` argument
- Removed deprecated `old_function()` - use `new_function()` instead
-```
-
-### Testing (include when non-obvious)
-
-How to verify the changes work. Skip for trivial changes.
-
-```markdown
-## Testing
-
- Run `uv run pytest tests/test_feature.py` to verify the fix
- Example usage: `uv run examples/new_feature.py`
-```
-
-### Fixes (include if issues are provided or found in commits)
-
-List issues this PR fixes. GitHub will automatically close these issues when the PR is merged.
-
-```markdown
-## Fixes
-
- Fixes #123
- Fixes #456
-```
-
-Note: Use "Fixes #X" format (not "Closes" or "Resolves") for consistency. Each issue should be on its own line with "Fixes" to ensure GitHub auto-closes them.
-
-## Guidelines
-
- **Be concise** - Reviewers should understand the PR in 30 seconds
- **Focus on why** - The diff shows *what* changed, explain *why*
- **Skip empty sections** - Only include sections that have content
- **Use bullet points** - Easier to scan than paragraphs
- **Don't duplicate the diff** - Avoid listing every file or line changed
-
-## Example Output
-
-```markdown
-## Summary
-
- Added `/docstring` skill for documenting Python modules with Google-style docstrings
- Skill finds classes by name and handles conflicts when multiple matches exist
- Skips already-documented code to avoid unnecessary changes
-
-## Testing
-
-/docstring ClassName
-
-## Fixes
-
- Fixes #123
-```
-
-## Checklist
-
-Before updating the PR:
-
- [ ] Verified existing description needs updating (not already complete)
- [ ] Summary accurately reflects the changes
- [ ] Breaking changes are clearly documented (if any)
- [ ] No unnecessary sections included
- [ ] Description is concise and scannable
--- a/.github/workflows/coverage.yaml
+++ b/.github/workflows/coverage.yaml
@@ -33,14 +33,7 @@ jobs:

      - name: Install dependencies
        run: |
-          uv sync --group dev \
-            --extra anthropic \
-            --extra aws \
-            --extra google \
-            --extra langchain \
-            --extra livekit \
-            --extra piper \
-            --extra websocket
+          uv sync --group dev --extra anthropic --extra aws --extra google --extra langchain

      - name: Run tests with coverage
        run: |
--- a/.github/workflows/tests.yaml
+++ b/.github/workflows/tests.yaml
@@ -37,14 +37,7 @@ jobs:

      - name: Install dependencies
        run: |
-          uv sync --group dev \
-            --extra anthropic \
-            --extra aws \
-            --extra google \
-            --extra langchain \
-            --extra livekit \
-            --extra piper \
-            --extra websocket
+          uv sync --group dev --extra anthropic --extra aws --extra google --extra langchain

      - name: Test with pytest
        run: |
--- a/.gitignore
+++ b/.gitignore
@@ -4,14 +4,7 @@ __pycache__/
 *~
 venv
 .venv
-.idea
-.gradle
-.next
-next-env.d.ts
-local.properties
-*.log
-*.lock
-smart_turn_audio_log
+/.idea
 #*#

 # Distribution / Packaging
@@ -34,7 +27,7 @@ share/python-wheels/
 *.egg
 MANIFEST
 .DS_Store
-.env*
+.env
 fly.toml

 # Examples
@@ -58,7 +51,4 @@ docs/api/_build/
 docs/api/api

 # uv
-.python-version
-
-# Pipecat
-whisker_setup.py
+.python-version
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -7,916 +7,6 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0

 <!-- towncrier release notes start -->

-## [0.0.101] - 2026-01-30
-
-### Added
-
- Additions for `AICFilter` and `AICVADAnalyzer`:
-    - Added model downloading support to `AICFilter` with `model_id` and
-      `model_download_dir` parameters.
-    - Added `model_path` parameter to `AICFilter` for loading local `.aicmodel`
-      files.
-    - Added unit tests for `AICFilter` and `AICVADAnalyzer`.
-  (PR [#3408](https://github.com/pipecat-ai/pipecat/pull/3408))
-
- Added handling for `server_content.interrupted` signal in the Gemini Live
-  service for faster interruption response in the case where there isn't
-  already turn tracking in the pipeline, e.g. local VAD + context aggregators.
-  When there is already turn tracking in the pipeline, the additional
-  interruption does no harm.
-  (PR [#3429](https://github.com/pipecat-ai/pipecat/pull/3429))
-
- Added new `GenesysFrameSerializer` for the Genesys AudioHook WebSocket
-  protocol, enabling bidirectional audio streaming between Pipecat pipelines
-  and Genesys Cloud contact center.
-  (PR [#3500](https://github.com/pipecat-ai/pipecat/pull/3500))
-
- Added `reached_upstream_types` and `reached_downstream_types` read-only
-  properties to `PipelineTask` for inspecting current frame filters.
-  (PR [#3510](https://github.com/pipecat-ai/pipecat/pull/3510))
-
- Added `add_reached_upstream_filter()` and `add_reached_downstream_filter()`
-  methods to `PipelineTask` for appending frame types.
-  (PR [#3510](https://github.com/pipecat-ai/pipecat/pull/3510))
-
- Added `UserTurnCompletionLLMServiceMixin` for LLM services to detect and
-  filter incomplete user turns. When enabled via `filter_incomplete_user_turns`
-  in `LLMUserAggregatorParams`, the LLM outputs a turn completion marker at the
-  start of each response: ✓ (complete), ○ (incomplete short), or ◐ (incomplete
-  long). Incomplete turns are suppressed, and configurable timeouts
-  automatically re-prompt the user.
-  (PR [#3518](https://github.com/pipecat-ai/pipecat/pull/3518))
-
- Added `FrameProcessor.broadcast_frame_instance(frame)` method to broadcast a
-  frame instance by extracting its fields and creating new instances for each
-  direction.
-  (PR [#3519](https://github.com/pipecat-ai/pipecat/pull/3519))
-
- `PipelineTask` now automatically adds `RTVIProcessor` and registers
-  `RTVIObserver` when `enable_rtvi=True` (default), simplifying pipeline setup.
-  (PR [#3519](https://github.com/pipecat-ai/pipecat/pull/3519))
-
- Added `RTVIProcessor.create_rtvi_observer()` factory method for creating RTVI
-  observers.
-  (PR [#3519](https://github.com/pipecat-ai/pipecat/pull/3519))
-
- Added `video_out_codec` parameter to `TransportParams` allowing configuration
-  of the preferred video codec (e.g., `"VP8"`, `"H264"`, `"H265"`) for video
-  output in `DailyTransport`.
-  (PR [#3520](https://github.com/pipecat-ai/pipecat/pull/3520))
-
- Added `location` parameter to Google TTS services (`GoogleHttpTTSService`,
-  `GoogleTTSService`, `GeminiTTSService`) for regional endpoint support.
-  (PR [#3523](https://github.com/pipecat-ai/pipecat/pull/3523))
-
- Added new `PIPECAT_SMART_TURN_LOG_DATA` environment variable, which causes
-  Smart Turn input data to be saved to disk
-  (PR [#3525](https://github.com/pipecat-ai/pipecat/pull/3525))
-
- Added `result_callback` parameter to `UserImageRequestFrame` to support
-  deferred function call results.
-  (PR [#3571](https://github.com/pipecat-ai/pipecat/pull/3571))
-
- Added `function_call_timeout_secs` parameter to `LLMService` to configure
-  timeout for deferred function calls (defaults to 10.0 seconds).
-  (PR [#3571](https://github.com/pipecat-ai/pipecat/pull/3571))
-
- Added `vad_analyzer` parameter to `LLMUserAggregatorParams`. VAD analysis is
-  now handled inside the `LLMUserAggregator` rather than in the transport,
-  keeping voice activity detection closer to where it is consumed. The
-  `vad_analyzer` on `BaseInputTransport` is now deprecated.
-
-    ```python
-    context_aggregator = LLMContextAggregatorPair(
-        context,
-        user_params=LLMUserAggregatorParams(
-            vad_analyzer=SileroVADAnalyzer(),
-        ),
-    )
-    ```
-  (PR [#3583](https://github.com/pipecat-ai/pipecat/pull/3583))
-
- Added `VADProcessor` for detecting speech in audio streams within a pipeline.
-  Pushes `VADUserStartedSpeakingFrame`, `VADUserStoppedSpeakingFrame`, and
-  `UserSpeakingFrame` downstream based on VAD state changes.
-  (PR [#3583](https://github.com/pipecat-ai/pipecat/pull/3583))
-
- Added `VADController` for managing voice activity detection state and
-  emitting speech events independently of transport or pipeline processors.
-  (PR [#3583](https://github.com/pipecat-ai/pipecat/pull/3583))
-
- Added local `PiperTTSService` for offline text-to-speech using Piper voice
-  models. The existing HTTP-based service has been renamed to
-  `PiperHttpTTSService`.
-  (PR [#3585](https://github.com/pipecat-ai/pipecat/pull/3585))
-
- `main()` in `pipecat.runner.run` now accepts an optional
-  `argparse.ArgumentParser`, allowing bots to define custom CLI arguments
-  accessible via `runner_args.cli_args`.
-  (PR [#3590](https://github.com/pipecat-ai/pipecat/pull/3590))
-
- Added `KokoroTTSService` for local text-to-speech synthesis using the
-  Kokoro-82M model.
-  (PR [#3595](https://github.com/pipecat-ai/pipecat/pull/3595))
-
-### Changed
-
- Updated `AICFilter` and `AICVADAnalyzer` to use aic-sdk ~= 2.0.1.
-  (PR [#3408](https://github.com/pipecat-ai/pipecat/pull/3408))
-
- Improved the STT TTFB (Time To First Byte) measurement, reporting the delay
-  between when the user stops speaking and when the final transcription is
-  received. Note: Unlike traditional TTFB which measures from a discrete
-  request, STT services receive continuous audio input—so we measure from
-  speech end to final transcript, which captures the latency that matters for
-  voice AI applications. In support of this change, added `finalized` field to
-  `TranscriptionFrame` to indicate when a transcript is the final result for an
-  utterance.
-  (PR [#3495](https://github.com/pipecat-ai/pipecat/pull/3495))
-
- `SarvamSTTService` now defaults `vad_signals` and `high_vad_sensitivity` to
-  `None` (omitted from connection parameters), improving latency by ~300ms
-  compared to the previous defaults.
-  (PR [#3495](https://github.com/pipecat-ai/pipecat/pull/3495))
-
- Changed frame filter storage from tuples to sets in `PipelineTask`.
-  (PR [#3510](https://github.com/pipecat-ai/pipecat/pull/3510))
-
- Changed default Inworld TTS model from `inworld-tts-1` to
-  `inworld-tts-1.5-max`.
-  (PR [#3531](https://github.com/pipecat-ai/pipecat/pull/3531))
-
- `FrameSerializer` now subclasses from `BaseObject` to enable event support.
-  (PR [#3560](https://github.com/pipecat-ai/pipecat/pull/3560))
-
- Added support for TTFS in `SpeechmaticsSTTService` and set the default mode
-  to `EXTERNAL` to support Pipecat-controlled VAD.
-  - Changed dependency to `speechmatics-voice[smart]>=0.2.8`
-  (PR [#3562](https://github.com/pipecat-ai/pipecat/pull/3562))
-
- ⚠️ Changed function call handling to use timeout-based completion instead of
-  immediate callback execution.
-    - Function calls that defer their results (e.g., `UserImageRequestFrame`)
-      now use a timeout mechanism
-    - The `result_callback` is invoked automatically when the deferred
-      operation completes or after timeout
-    - This change affects examples using `UserImageRequestFrame` - the
-      `result_callback` should now be passed to the frame instead of being called
-      immediately
-  (PR [#3571](https://github.com/pipecat-ai/pipecat/pull/3571))
-
- Pipecat runner now uses `DAILY_ROOM_URL` instead of `DAILY_SAMPLE_ROOM_URL`.
-  (PR [#3582](https://github.com/pipecat-ai/pipecat/pull/3582))
-
- Updates to `GradiumSTTService`:
-    - Now flushes pending transcriptions when VAD detects the user stopped
-      speaking, improving response latency.
-    - `GradiumSTTService` now supports `InputParams` for configuring `language`
-      and `delay_in_frames` settings.
-  (PR [#3587](https://github.com/pipecat-ai/pipecat/pull/3587))
-
-### Deprecated
-
- ⚠️ Deprecated `vad_analyzer` parameter on `BaseInputTransport`. Pass
-  `vad_analyzer` to `LLMUserAggregatorParams` instead or use `VADProcessor` in
-  the pipeline.
-  (PR [#3583](https://github.com/pipecat-ai/pipecat/pull/3583))
-
-### Removed
-
- Removed deprecated `AICFilter` parameters: `enhancement_level`, `voice_gain`,
-  `noise_gate_enable`.
-  (PR [#3408](https://github.com/pipecat-ai/pipecat/pull/3408))
-
-### Fixed
-
- Fixed an issue where if you were using `OpenRouterLLMService` with a Gemini
-  model, it wouldn't handle multiple `"system"` messages as expected (and as we
-  do in `GoogleLLMService`), which is to convert subsequent ones into `"user"`
-  messages. Instead, the latest `"system"` message would overwrite the previous
-  ones.
-  (PR [#3406](https://github.com/pipecat-ai/pipecat/pull/3406))
-
- Transports now properly broadcast `InputTransportMessageFrame` frames both
-  upstream and downstream instead of only pushing downstream.
-  (PR [#3519](https://github.com/pipecat-ai/pipecat/pull/3519))
-
- Fixed `FrameProcessor.broadcast_frame()` to deep copy kwargs, preventing
-  shared mutable references between the downstream and upstream frame
-  instances.
-  (PR [#3519](https://github.com/pipecat-ai/pipecat/pull/3519))
-
- Fixed OpenAI LLM services to emit `ErrorFrame` on completion timeout,
-  enabling proper error handling and LLMSwitcher failover.
-  (PR [#3529](https://github.com/pipecat-ai/pipecat/pull/3529))
-
- Fixed a logging issue where non-ASCII characters (e.g., Japanese, Chinese,
-  etc.) were being unnecessarily escaped to Unicode sequences when function
-  call occurred.
-  (PR [#3536](https://github.com/pipecat-ai/pipecat/pull/3536))
-
- Fixed how audio tracks are synchronized inside the `AudioBufferProcessor` to
-  fix timing issues where silence and audio were misaligned between user and
-  bot buffers.
-  (PR [#3541](https://github.com/pipecat-ai/pipecat/pull/3541))
-
- Fixed race condition in `OpenAIRealtimeBetaLLMService` that could cause an
-  error when truncating the conversation.
-  (PR [#3567](https://github.com/pipecat-ai/pipecat/pull/3567))
-
- Fixed an infinite loop in `WebsocketService` that blocked the event loop when
-  a remote server closed the connection gracefully.
-  (PR [#3574](https://github.com/pipecat-ai/pipecat/pull/3574))
-
- Fixed `LLMUserAggregator` and `LLMAssistantAggregator` not emitting pending
-  transcripts via `on_user_turn_stopped` and `on_assistant_turn_stopped` events
-  when the conversation ends (`EndFrame`) or is cancelled (`CancelFrame`).
-  (PR [#3575](https://github.com/pipecat-ai/pipecat/pull/3575))
-
- Added missing `LiveKitRunnerArguments` and `LiveKitTransport` support in
-  runner utilities to enable LiveKit transport configuration.
-  (PR [#3580](https://github.com/pipecat-ai/pipecat/pull/3580))
-
- Fixed race condition in `OpenAIRealtimeLLMService` that could cause an error
-  when truncating the conversation.
-  (PR [#3581](https://github.com/pipecat-ai/pipecat/pull/3581))
-
- Fixed `PiperHttpTTSService` (olf `PiperTTSService`) to resample audio output
-  based on the model's sample rate parsed from the WAV header.
-  (PR [#3585](https://github.com/pipecat-ai/pipecat/pull/3585))
-
- Fixed `UserTurnController` to reset user turn timeout when interim
-  transcriptions are received.
-  (PR [#3594](https://github.com/pipecat-ai/pipecat/pull/3594))
-
- Fixed an issue in the `IVRNavigator` where the `TextFrame`s pushed had
-  incorrect spacing. Now, the internal `IVRProcessor` pushes
-  `AggregatedTextFrame`s when in conversation mode. This allows for controlling
-  spacing of the outputted, aggregated text.
-  (PR [#3604](https://github.com/pipecat-ai/pipecat/pull/3604))
-
- Fixed `GeminiLiveLLMService` transcription timeout handler not being
-  scheduled by yielding to the event loop after task creation.
-  (PR [#3605](https://github.com/pipecat-ai/pipecat/pull/3605))
-
-## [0.0.100] - 2026-01-20
-
-### Added
-
- Added Hathora service to support Hathora-hosted TTS and STT models (only
-  non-streaming)
-  (PR [#3169](https://github.com/pipecat-ai/pipecat/pull/3169))
-
- Added `CambTTSService`, using Camb.ai's TTS integration with MARS models
-  (mars-flash, mars-pro, mars-instruct) for high-quality text-to-speech
-  synthesis.
-  (PR [#3349](https://github.com/pipecat-ai/pipecat/pull/3349))
-
- Added the `additional_headers` param to `WebsocketClientParams`, allowing
-  `WebsocketClientTransport` to send custom headers on connect, for cases such
-  as authentication.
-  (PR [#3461](https://github.com/pipecat-ai/pipecat/pull/3461))
-
- Added `UserIdleController` for detecting user idle state, integrated into
-  `LLMUserAggregator` and `UserTurnProcessor` via optional `user_idle_timeout`
-  parameter. Emits `on_user_turn_idle` event for application-level handling.
-  Deprecated `UserIdleProcessor` in favor of the new compositional approach.
-  (PR [#3482](https://github.com/pipecat-ai/pipecat/pull/3482))
-
- Added `on_user_mute_started` and `on_user_mute_stopped` event handlers to
-  `LLMUserAggregator` for tracking user mute state changes.
-  (PR [#3490](https://github.com/pipecat-ai/pipecat/pull/3490))
-
-### Changed
-
- Enhanced interruption handling in `AsyncAITTSService` by supporting
-  multi-context WebSocket sessions for more robust context management.
-  (PR [#3287](https://github.com/pipecat-ai/pipecat/pull/3287))
-
- Throttle `UserSpeakingFrame` to broadcast at most every 200ms instead of on
-  every audio chunk, reducing frame processing overhead during user speech.
-  (PR [#3483](https://github.com/pipecat-ai/pipecat/pull/3483))
-
-### Deprecated
-
- For consistency with other package names, we just deprecated
-  `pipecat.turns.mute` (introduced in Pipecat 0.0.99) in favor of
-  `pipecat.turns.user_mute`.
-  (PR [#3479](https://github.com/pipecat-ai/pipecat/pull/3479))
-
-### Fixed
-
- Corrected TTFB metric calculation in `AsyncAIHttpTTSService`.
-  (PR [#3287](https://github.com/pipecat-ai/pipecat/pull/3287))
-
- Fixed an issue where the "bot-llm-text" RTVI event would not fire for
-  realtime (speech-to-speech) services:
-
-    - `AWSNovaSonicLLMService`
-    - `GeminiLiveLLMService`
-    - `OpenAIRealtimeLLMService`
-    - `GrokRealtimeLLMService`
-
-  The issue was that these services weren't pushing `LLMTextFrame`s. Now
-  they  do.
-  (PR [#3446](https://github.com/pipecat-ai/pipecat/pull/3446))
-
- Fixed an issue where `on_user_turn_stop_timeout` could fire while a user is
-  talking when using `ExternalUserTurnStrategies`.
-  (PR [#3454](https://github.com/pipecat-ai/pipecat/pull/3454))
-
- Fixed an issue where user turn start strategies were not being reset after a
-  user turn started, causing incorrect strategy behavior.
-  (PR [#3455](https://github.com/pipecat-ai/pipecat/pull/3455))
-
- Fixed `MinWordsUserTurnStartStrategy` to not aggregate transcriptions,
-  preventing incorrect turn starts when words are spoken with pauses between
-  them.
-  (PR [#3462](https://github.com/pipecat-ai/pipecat/pull/3462))
-
- Fixed an issue where Grok Realtime would error out when running with
-  SmallWebRTC transport.
-  (PR [#3480](https://github.com/pipecat-ai/pipecat/pull/3480))
-
- Fixed a `Mem0MemoryService` issue where passing `async_mode: true` was
-  causing an error. See
-  https://docs.mem0.ai/platform/features/async-mode-default-change.
-  (PR [#3484](https://github.com/pipecat-ai/pipecat/pull/3484))
-
- Fixed `AWSNovaSonicLLMService.reset_conversation()`, which would previously
-  error out. Now it successfully reconnects and "rehydrates" from the context
-  object.
-  (PR [#3486](https://github.com/pipecat-ai/pipecat/pull/3486))
-
- Fixed `AzureTTSService` transcript formatting issues:
-    - Punctuation now appears without extra spaces (e.g., "Hello!" instead of
-      "Hello !")
-    - CJK languages (Chinese, Japanese, Korean) no longer have unwanted spaces
-      between characters
-  (PR [#3489](https://github.com/pipecat-ai/pipecat/pull/3489))
-
- Fixed an issue where `UninterruptibleFrame` frames would not be preserved in
-  some cases.
-  (PR [#3494](https://github.com/pipecat-ai/pipecat/pull/3494))
-
- Fixed memory leak in `LiveKitTransport` when `video_in_enabled` is `False`.
-  (PR [#3499](https://github.com/pipecat-ai/pipecat/pull/3499))
-
- Fixed an issue in `AIService` where unhandled exceptions in `start()`,
-  `stop()`, or `cancel()` implementations would prevent `process_frame()` to
-  continue and therefore `StartFrame`, `EndFrame`, or `CancelFrame` from being
-  pushed downstream, causing the pipeline to not start or stop properly.
-  (PR [#3503](https://github.com/pipecat-ai/pipecat/pull/3503))
-
- Moved `NVIDIATTSService` and `NVIDIASTTService` client initialization from
-  constructor to `start()` for better error handling.
-  (PR [#3504](https://github.com/pipecat-ai/pipecat/pull/3504))
-
- Optimized `NVIDIATTSService` to process incoming audio frames immediately.
-  (PR [#3509](https://github.com/pipecat-ai/pipecat/pull/3509))
-
- Optimized `NVIDIASTTService` by removing unnecessary queue and task.
-  (PR [#3509](https://github.com/pipecat-ai/pipecat/pull/3509))
-
- Fixed a `CambTTSService` issue where client was being initialized in the
-  constructor which wouldn't allow for proper Pipeline error handling.
-  (PR [#3511](https://github.com/pipecat-ai/pipecat/pull/3511))
-
-## [0.0.99] - 2026-01-13
-
-### Added
-
- Introducing user turn strategies. User turn strategies indicate when the user
-  turn starts or stops. In conversational agents, these are often referred to
-  as start/stop speaking or turn-taking plans or policies.
-
-  User turn start strategies indicate when the user starts speaking (e.g.
-  using VAD events or when a user says one or more words).
-
-  User turn stop strategies indicate when the user stops speaking (e.g. using
-  an end-of-turn detection model or by observing incoming transcriptions).
-
-  A list of strategies can be specified for both strategies; strategies are
-  evaluated in order until one evaluates to true.
-
-  Available user turn start strategies:
-
-  - VADUserTurnStartStrategy
-  - TranscriptionUserTurnStartStrategy
-  - MinWordsUserTurnStartStrategy
-  - ExternalUserTurnStartStrategy
-
-  Available user turn stop strategies:
-
-  - TranscriptionUserTurnStopStrategy
-  - TurnAnalyzerUserTurnStopStrategy
-  - ExternalUserTurnStopStrategy
-
-  The default strategies are:
-
-  - start: [VADUserTurnStartStrategy, TranscriptionUserTurnStartStrategy]
-  - stop: [TranscriptionUserTurnStopStrategy]
-
-  Turn strategies are configured when setting up `LLMContextAggregatorPair`.
-  For example:
-
-  ```python
-  context_aggregator = LLMContextAggregatorPair(
-      context,
-      user_params=LLMUserAggregatorParams(
-          user_turn_strategies=UserTurnStrategies(
-              stop=[
-                  TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3(params=SmartTurnParams())
-                  )
-              ],
-          )
-      ),
-  )
-  ```
-
-  In order to use the user turn strategies you must update to the new
-  universal `LLMContext` and `LLMContextAggregatorPair`.
-  (PR [#3045](https://github.com/pipecat-ai/pipecat/pull/3045))
-
- Added `RNNoiseFilter` for real-time noise suppression using RNNoise neural
-  network via pyrnnoise library.
-  (PR [#3205](https://github.com/pipecat-ai/pipecat/pull/3205))
-
- Added `GrokRealtimeLLMService` for xAI's Grok Voice Agent API with real-time
-  voice conversations:
-
-  - Support for real-time audio streaming with WebSocket connection
-  - Built-in server-side VAD (Voice Activity Detection)
-  - Multiple voice options: Ara, Rex, Sal, Eve, Leo
-  - Built-in tools support: web_search, x_search, file_search
-  - Custom function calling with standard Pipecat tools schema
-  - Configurable audio formats (PCM at 8kHz-48kHz)
-    (PR [#3267](https://github.com/pipecat-ai/pipecat/pull/3267))
-
- Added an approximation of TTFB for Ultravox.
-  (PR [#3268](https://github.com/pipecat-ai/pipecat/pull/3268))
-
- Added a new `AudioContextTTSService` to the TTS service base classes. The
-  `AudioContextWordTTSService` now inherits from `AudioContextTTSService` and
-  `WebsocketWordTTSService`.
-  (PR [#3289](https://github.com/pipecat-ai/pipecat/pull/3289))
-
- `LLMUserAggregator` now exposes the following events:
-
-  - `on_user_turn_started`: triggered when a user turn starts
-  - `on_user_turn_stopped`: triggered when a user turn ends
-  - `on_user_turn_stop_timeout`: triggered when a user turn does not stop
-    and times out
-    (PR [#3291](https://github.com/pipecat-ai/pipecat/pull/3291))
-
- Introducing user mute strategies. User mute strategies indicate when user
-  input should be muted based on the current system state.
-
-  In conversational agents, user mute strategies are used to prevent user
-  input from interrupting bot speech, tool execution, or other critical system
-  operations.
-
-  A list of strategies can be specified; all strategies are evaluated for
-  every frame so that each strategy can maintain its internal state. A user
-  frame is muted if any of the configured strategies indicates it should be
-  muted.
-
-  Available user mute strategies:
-
-  - `FirstSpeechUserMuteStrategy`
-  - `MuteUntilFirstBotCompleteUserMuteStrategy`
-  - `AlwaysUserMuteStrategy`
-  - `FunctionCallUserMuteStrategy`
-
-  User mute strategies replace the legacy `STTMuteFilter` and provide a more
-  flexible and composable approach to muting user input.
-
-  User mute strategies are configured when setting up the
-  `LLMContextAggregatorPair`. For example:
-
-  ```python
-  context_aggregator = LLMContextAggregatorPair(
-      context,
-      user_params=LLMUserAggregatorParams(
-          user_mute_strategies=[
-              FirstSpeechUserMuteStrategy(),
-          ]
-      ),
-  )
-  ```
-
-  In order to use user mute strategies you should update to the new universal
-  `LLMContext` and `LLMContextAggregatorPair`.
-  (PR [#3292](https://github.com/pipecat-ai/pipecat/pull/3292))
-
- Added `use_ssl` parameter to `NvidiaSTTService`, `NvidiaSegmentedSTTService`
-  and `NvidiaTTSService`.
-  (PR [#3300](https://github.com/pipecat-ai/pipecat/pull/3300))
-
- Added `enable_interruptions` constructor argument to all user turn
-  strategies. This tells the `LLMUserAggregator` to push or not push an
-  `InterruptionFrame`.
-  (PR [#3316](https://github.com/pipecat-ai/pipecat/pull/3316))
-
- Added `split_sentences` parameter to `SpeechmaticsSTTService` to control
-  sentence splitting behavior for finals on sentence boundaries.
-  (PR [#3328](https://github.com/pipecat-ai/pipecat/pull/3328))
-
- Added word-level timestamp support to `AzureTTSService` for accurate
-  text-to-audio synchronization.
-  (PR [#3334](https://github.com/pipecat-ai/pipecat/pull/3334))
-
- Added `pronunciation_dict_id` parameter to `CartesiaTTSService.InputParams`
-  and `CartesiaHttpTTSService.InputParams` to support Cartesia's pronunciation
-  dictionary feature for custom pronunciations.
-  (PR [#3346](https://github.com/pipecat-ai/pipecat/pull/3346))
-
- Added support for using the HeyGen LiveAvatar API with the `HeyGenTransport`
-  (see https://www.liveavatar.com/).
-  (PR [#3357](https://github.com/pipecat-ai/pipecat/pull/3357))
-
- Added image support to `OpenAIRealtimeLLMService` via `InputImageRawFrame`:
-
-  - New `start_video_paused` parameter to control initial video input state
-  - New `video_frame_detail` parameter to set image processing quality
-    ("auto",
-    "low", or "high"). This corresponds to OpenAI Realtime's `image_detail`
-    parameter.
-  - `set_video_input_paused()` method to pause/resume video input at runtime
-  - `set_video_frame_detail()` method to adjust video frame quality
-    dynamically
-  - Automatic rate limiting (1 frame per second) to prevent API overload
-    (PR [#3360](https://github.com/pipecat-ai/pipecat/pull/3360))
-
- Added `UserTurnProcessor`, a frame processor built on `UserTurnController`
-  that pushes `UserStartedSpeakingFrame` and `UserStoppedSpeakingFrame` frames
-  and interruptions based on the controller's user turn strategies.
-  (PR [#3372](https://github.com/pipecat-ai/pipecat/pull/3372))
-
- Added `UserTurnController` to manage user turns. It emits
-  `on_user_turn_started`, `on_user_turn_stopped`, and
-  `on_user_turn_stop_timeout` events, and can be integrated into processors to
-  detect and handle user turns. `LLMUserAggregator` and `UserTurnProcessor` are
-  implemented using this controller.
-  (PR [#3372](https://github.com/pipecat-ai/pipecat/pull/3372))
-
- Added `should_interrupt` property to `DeepgramFluxSTTService`,
-  `DeepgramSTTService`, and `SpeechmaticsSTTService` to configure whether the
-  bot should be interrupted when the external service detects user speech.
-  (PR [#3374](https://github.com/pipecat-ai/pipecat/pull/3374))
-
- `LLMAssistantAggregator` now exposes the following events:
-
-  - `on_assistant_turn_started`: triggered when the assistant turn starts
-  - `on_assistant_turn_stopped`: triggered when the assistant turn ends
-  - `on_assistant_thought`: triggered when there's an assistant thought
-    available
-    (PR [#3385](https://github.com/pipecat-ai/pipecat/pull/3385))
-
- Added `KrispVivaTurn` analyzer for end of turn detection using the Krisp VIVA
-  SDK (requires `krisp_audio`).
-  (PR [#3391](https://github.com/pipecat-ai/pipecat/pull/3391))
-
- Added support for setting up a pipeline task from external files. You can now
-  register custom pipeline task setup files by setting the
-  `PIPECAT_SETUP_FILES` environment variable. This variable should contain a
-  colon-separated list of Python files (e.g. `export
-PIPECAT_SETUP_FILES="setup1.py:setup.py:..."`). Each file must define a
-  function with the following signature:
-
-  ```python
-  async def setup_pipeline_task(task: PipelineTask):
-      ...
-  ```
-
-  (PR [#3397](https://github.com/pipecat-ai/pipecat/pull/3397))
-
- Added a keepalive task for `InworldTTSService` to keep the service connected
-  in the event of no generations for longer periods of time.
-  (PR [#3403](https://github.com/pipecat-ai/pipecat/pull/3403))
-
- Added `enable_vad` to `Params` for use in the `GladiaSTTService`. When
-  enabled, `GladiaSTTService` acts as the turn controller, emitting
-  `UserStartedSpeakingFrame`, `UserStoppedSpeakingFrame`, and optionally
-  `InterruptionFrame`.
-  (PR [#3404](https://github.com/pipecat-ai/pipecat/pull/3404))
-
- Added `should_interrupt` property to `GladiaSTTService` to configure whether
-  the bot should be interrupted when the external service detects user speech.
-  (PR [#3404](https://github.com/pipecat-ai/pipecat/pull/3404))
-
- Added `VonageFrameSerializer` for the Vonage Video API Audio Connector
-  WebSocket protocol.
-  (PR [#3410](https://github.com/pipecat-ai/pipecat/pull/3410))
-
- Added `append_trailing_space` parameter to `TTSService` to automatically
-  append a trailing space to text before sending to TTS, helping prevent some
-  services from vocalizing trailing punctuation.
-  (PR [#3424](https://github.com/pipecat-ai/pipecat/pull/3424))
-
-### Changed
-
- Updated `ElevenLabsRealtimeSTTService` to accept the
-  `include_language_detection` parameter to detect language.
-
-  ```python
-    stt = ElevenLabsRealtimeSTTService(
-        api_key=os.getenv("ELEVENLABS_API_KEY"),
-        include_language_detection=True
-    )
-  ```
-
-  (PR [#3216](https://github.com/pipecat-ai/pipecat/pull/3216))
-
- Updated `SpeechmaticsSTTService` to use new Python Voice SDK with improved
-  VAD, Smart Turn capabilities, and brings dramatic improvements to latency
-  without any impact on accuracy. Use the `turn_detection_mode` parameter to control
-  the endpointing of speech, with `TurnDetectionMode.EXTERNAL` (default),
-  `TurnDetectionMode.ADAPTIVE`, or `TurnDetectionMode.SMART_TURN`.
-
-  ```python
-      stt = SpeechmaticsSTTService(
-          api_key=os.getenv("SPEECHMATICS_API_KEY"),
-          params=SpeechmaticsSTTService.InputParams(
-              language=Language.EN,
-              turn_detection_mode=SpeechmaticsSTTService.TurnDetectionMode.ADAPTIVE,
-              speaker_active_format="<{speaker_id}>{text}</{speaker_id}>",
-          ),
-      )
-  ```
-
-  (PR [#3225](https://github.com/pipecat-ai/pipecat/pull/3225))
-
- `daily-python` updated to 0.23.0.
-  (PR [#3257](https://github.com/pipecat-ai/pipecat/pull/3257))
-
- `TranscriptionFrame` and `InterimTranscriptionFrame` produced by
-  `DailyTransport` now include the transport source (i.e., the originating
-  audio track).
-  (PR [#3257](https://github.com/pipecat-ai/pipecat/pull/3257))
-
- Updates to Inworld TTS services:
-
-  - Improved `InworldTTSService`'s websocket implementation to better flush
-    and close context to better handle long inputs.
-  - Improved docstrings for `InworldTTSService` and `InworldHttpTTSService`.
-    (PR [#3288](https://github.com/pipecat-ai/pipecat/pull/3288))
-
- Improved the error handling and reconnection logic for `WebsocketServer` by
-  distinguishing between errors when disconnecting and websocket communication
-  errors.
-  (PR [#3392](https://github.com/pipecat-ai/pipecat/pull/3392))
-
- Updated `DeepgramSTTService` to push user started/stopped speaking and
-  interruption frames when `vad_enabled` is set to true. This centralizes the
-  frames into the service, removing the need to have your application code
-  handle Deepgram's events and push these frames.
-  (PR [#3314](https://github.com/pipecat-ai/pipecat/pull/3314))
-
- Added encoding validation to `DeepgramTTSService` to prevent unsupported
-  encodings from reaching the API. The service now raises `ValueError` at
-  initialization with a clear error message.
-  (PR [#3329](https://github.com/pipecat-ai/pipecat/pull/3329))
-
- Updated `read_audio_frame` & `read_video_frame` methods in
-  `SmallWebRTCClient` to check if the track is enabled before logging a
-  warning.
-  (PR [#3336](https://github.com/pipecat-ai/pipecat/pull/3336))
-
- Updated `CartesiaTTSService` to support setting `language=None`, resulting in
-  Cartesia auto-detecting the language of the conversation.
-  (PR [#3366](https://github.com/pipecat-ai/pipecat/pull/3366))
-
- The bundled Smart Turn weights are now updated to v3.2, which has better
-  handling of short utterances, and is more robust against background noise.
-  (PR [#3367](https://github.com/pipecat-ai/pipecat/pull/3367))
-
- Updated `SpeechmaticsSTTService` dependency to `speechmatics-voice[smart]>=0.2.6`
-  (PR [#3371](https://github.com/pipecat-ai/pipecat/pull/3371))
-
- Smart Turn now takes into account `vad_start_seconds` when buffering audio,
-  meaning that the start of the turn audio is not cut off. This improves
-  accuracy for short utterances.
-
- The default value of `pre_speech_ms` is now set to 500ms for Smart Turn.
-  (PR [#3377](https://github.com/pipecat-ai/pipecat/pull/3377))
-
- Improved Krisp SDK management to allow `KrispVivaTurn` and `KrispVivaFilter`
-  to share a single SDK instance within the same process.
-  (PR [#3391](https://github.com/pipecat-ai/pipecat/pull/3391))
-
- Updated default model for `GroqTTSService` to `canopylabs/orpheus-v1-english`
-  and voice ID to `autumn`.
-  (PR [#3399](https://github.com/pipecat-ai/pipecat/pull/3399))
-
- Enhanced `FastAPIWebsocketTransport` with optional protocol-level audio
-  packetization via the `fixed_audio_packet_size` parameter to support media
-  endpoints requiring strict framing and real-time pacing.
-  (PR [#3410](https://github.com/pipecat-ai/pipecat/pull/3410))
-
- `DeepgramTTSService` and `RimeTTSService` now set `append_trailing_space` to
-  `True` to prevent punctuation (e.g., “dot”) from being pronounced.
-  (PR [#3424](https://github.com/pipecat-ai/pipecat/pull/3424))
-
- Updated `GeminiLiveLLMService` to push `LLMThoughtStartFrame`,
-  `LLMThoughtTextFrame`, and `LLMThoughtEndFrame` when the model returns
-  thought content.
-  (PR [#3431](https://github.com/pipecat-ai/pipecat/pull/3431))
-
-### Deprecated
-
- `pipecat.audio.interruptions.MinWordsInterruptionStrategy` is deprecated. Use
-  `pipecat.turns.user_start.MinWordsUserTurnStartStrategy` with
-  `LLMUserAggregator`'s new `user_turn_strategies` parameter instead.
-  (PR [#3045](https://github.com/pipecat-ai/pipecat/pull/3045))
-
- `FrameProcessor.interruption_strategies` is deprecated, use
-  `LLMUserAggregator`'s new `user_turn_strategies` parameter instead.
-  (PR [#3045](https://github.com/pipecat-ai/pipecat/pull/3045))
-
- The `LLMUserAggregatorParams` and `LLMAssistantAggregatorParams` classes in
-  `pipecat.processors.aggregators.llm_response` are now deprecated. Use the new
-  universal `LLMContext` and `LLMContextAggregatorPair` instead.
-  (PR [#3045](https://github.com/pipecat-ai/pipecat/pull/3045))
-
- Deprecated the `emulated` field in the `UserStartedSpeakingFrame` and
-  `UserStoppedSpeakingFrame` frames.
-  (PR [#3045](https://github.com/pipecat-ai/pipecat/pull/3045))
-
- `EmulateUserStartedSpeakingFrame` and `EmulateUserStoppedSpeakingFrame`
-  frames are deprecated.
-  (PR [#3045](https://github.com/pipecat-ai/pipecat/pull/3045))
-
- ⚠️ `TransportParams.turn_analyzer` is deprecated and might result in
-  unexpected behavior, use `LLMUserAggregator`'s new `user_turn_strategies`
-  parameter instead.
-  (PR [#3045](https://github.com/pipecat-ai/pipecat/pull/3045))
-
- For `SpeechmaticsSTTService`, the `end_of_utterance_mode` parameter is
-  deprecated. Use the new `turn_detection_mode` parameter instead, with
-  `TurnDetectionMode.EXTERNAL`,`TurnDetectionMode.ADAPTIVE`, or
-  `TurnDetectionMode.SMART_TURN`. The `enable_vad` parameter is also
-  deprecated and is inferred from the `turn_detection_mode`.
-  (PR [#3225](https://github.com/pipecat-ai/pipecat/pull/3225))
-
- `OpenAILLMContext` and its associated things (context aggregators, etc.) are
-  now deprecated in favor of the universal `LLMContext` and its associated
-  things.
-
-  From the developer's point of view, switching to using `LLMContext`
-  machinery will usually be a matter of going from this:
-
-  ```python
-  context = OpenAILLMContext(messages, tools)
-  context_aggregator = llm.create_context_aggregator(context)
-  ```
-
-  To this:
-
-  ```
-  context = LLMContext(messages, tools)
-  context_aggregator = LLMContextAggregatorPair(context)
-  ```
-
-  (PR [#3263](https://github.com/pipecat-ai/pipecat/pull/3263))
-
- `STTMuteFilter` is deprecated and will be removed in a future version. Use
-  `LLMUserAggregator`'s new `user_mute_strategies` instead.
-  (PR [#3292](https://github.com/pipecat-ai/pipecat/pull/3292))
-
- `FrameProcessor.interruptions_allowed` is now deprecated, use
-  `LLMUserAggregator`'s new parameter `user_mute_strategies` instead.
-  (PR [#3297](https://github.com/pipecat-ai/pipecat/pull/3297))
-
- `PipelineParams.allow_interruptions` is now deprecated, use
-  `LLMUserAggregator`'s new parameter `user_turn_strategies` instead. For
-  example, to disable interruptions but still get user turns you can do:
-
-  ```python
-  context_aggregator = LLMContextAggregatorPair(
-      context,
-      user_params=LLMUserAggregatorParams(
-          user_turn_strategies=UserTurnStrategies(
-              start=[TranscriptionUserTurnStartStrategy(enable_interruptions=False)],
-          ),
-      ),
-  )
-  ```
-
-  (PR [#3297](https://github.com/pipecat-ai/pipecat/pull/3297))
-
- `TranscriptProcessor` and related data classes and frames
-  (`TranscriptionMessage`, `ThoughtTranscriptionMessage`,
-  `TranscriptionUpdateFrame`) are deprecated. Use `LLMUserAggregator`'s and
-  `LLMAssistantAggregator`'s new events (`on_user_turn_stopped` and
-  `on_assistant_turn_stopped`) instead.
-  (PR [#3385](https://github.com/pipecat-ai/pipecat/pull/3385))
-
- Deprecated support for the `vad_events` `LiveOptions` in
-  `DeepgramSTTService`. Instead, use a local Silero VAD for VAD events.
-  Additionally, deprecated `should_interrupt` which will be removed along with
-  `vad_events` support in a future release.
-  (PR [#3386](https://github.com/pipecat-ai/pipecat/pull/3386))
-
- Loading external observers from files is deprecated, use the new pipeline
-  task setup files and `PIPECAT_SETUP_FILES` environment variable instead.
-  (PR [#3397](https://github.com/pipecat-ai/pipecat/pull/3397))
-
-### Fixed
-
- Improved error handling in `ElevenLabsRealtimeSTTService`
-  (PR [#3233](https://github.com/pipecat-ai/pipecat/pull/3233))
-
- Fixed an issue in `ElevenLabsRealtimeSTTService` causing an infinite loop
-  that blocks the process if the websocket disconnects due to an error
-  (PR [#3233](https://github.com/pipecat-ai/pipecat/pull/3233))
-
- Fixed a bug in `STTMuteFilter` where the user was not always muted during
-  function calls, especially when there were multiple simultaneous calls.
-  (PR [#3292](https://github.com/pipecat-ai/pipecat/pull/3292))
-
- Fixed a `RNNoiseFilter` issue that would cause a "[Errno 12] Cannot allocate
-  memory" error when processing silence audio frames.
-  (PR [#3322](https://github.com/pipecat-ai/pipecat/pull/3322))
-
- Updated `SpeechmaticsSTTService` for version `0.0.99+`:
-
-  - Fixed `SpeechmaticsSTTService` to listen for `VADUserStoppedSpeakingFrame`
-    in order to finalize transcription.
-  - Default to `TurnDetectionMode.FIXED` for Pipecat-controlled end of turn
-    detection.
-  - Only emit VAD + interruption frames if VAD is enabled within the plugin
-    (modes other than `TurnDetectionMode.FIXED` or `TurnDetectionMode.EXTERNAL`).
-    (PR [#3328](https://github.com/pipecat-ai/pipecat/pull/3328))
-
- Fixed an issue with function calling where a handler failing to invoke its
-  result callback could leave the context stuck in IN_PROGRESS, causing LLM
-  inference for subsequent function call results to block while waiting on the
-  unresolved call.
-  (PR [#3343](https://github.com/pipecat-ai/pipecat/pull/3343))
-
- Fixed an issue with DeepgramTTSService where the model would output "Dot"
-  instead of a period in some circumstances.
-  (PR [#3345](https://github.com/pipecat-ai/pipecat/pull/3345))
-
- Fixed an issue in `traced_stt` where `model_name` in OpenTelemetry appears as
-  `unknown`.
-  (PR [#3351](https://github.com/pipecat-ai/pipecat/pull/3351))
-
- Fixed an issue in GeminiLiveLLMService where TranscriptionFrames were
-  occasionally not pushed.
-  (PR [#3356](https://github.com/pipecat-ai/pipecat/pull/3356))
-
- Fixed potential memory leaks and initialization issues in `KrispVivaFilter`
-  by improving SDK lifecycle management.
-  (PR [#3391](https://github.com/pipecat-ai/pipecat/pull/3391))
-
- Fixed timing issue in `BaseOutputTransport` where the bot speaking flag was
-  set after awaiting, allowing the event loop to re-enter the method before the
-  guard was set.
-  (PR [#3400](https://github.com/pipecat-ai/pipecat/pull/3400))
-
- Fixed parallel function calling when using Gemini thinking.
-  (PR [3420](https://github.com/pipecat-ai/pipecat/pull/3420))
-
- Fixed an issue in `traced_llm` where `model_name` in OpenTelemetry appears as
-  `unknown`.
-  (PR [#3422](https://github.com/pipecat-ai/pipecat/pull/3422))
-
- Fixed an issue in `traced_tts`, `traced_gemini_live`, and
-  `traced_openai_realtime` where `model_name` in OpenTelemetry appears as
-  `unknown`.
-  (PR [#3428](https://github.com/pipecat-ai/pipecat/pull/3428))
-
- Fixed `request_image_frame` (for backwards compatibility) and restored
-  function-call–related fields in `UserImageRequestFrame` and
-  `UserImageRawFrame`, preventing a case where adding a non-LLM message to the
-  context could trigger duplicate LLM inferences (on image arrival and on
-  function-call result), potentially causing an infinite inference loop.
-  (PR [#3430](https://github.com/pipecat-ai/pipecat/pull/3430))
-
- Fixed `LLMContext.create_audio_message()` by correcting an internal helper
-  that was incorrectly declared async while being run in `asyncio.to_thread()`.
-  (PR [#3435](https://github.com/pipecat-ai/pipecat/pull/3435))
-
-### Other
-
- Added `52-live-transcription.py` foundational example demonstrating live
-  transcription and translation from English to Spanish. In this example, the
-  bot is not interruptible: as the user continues speaking, English
-  transcriptions are queued, and the bot continuously translates and speaks
-  each queued sentence in Spanish without being interrupted by new user speech.
-  (PR [#3316](https://github.com/pipecat-ai/pipecat/pull/3316))
-
- Added a new foundational example `53-concurrent-llm-evaluation.py` that shows
-  how to use `UserTurnProcessor`.
-  (PR [#3372](https://github.com/pipecat-ai/pipecat/pull/3372))
-
- Added a new foundational example `28-user-assistant-turns.py` that shows how
-  to use the new `LLMUserAggregator` and `LLMAssistantAggregator` events to
-  gather a conversation transcript.
-  (PR [#3385](https://github.com/pipecat-ai/pipecat/pull/3385))
-
 ## [0.0.98] - 2025-12-17

 ### Added
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -1,143 +0,0 @@
-# CLAUDE.md
-
-This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
-
-## Project Overview
-
-Pipecat is an open-source Python framework for building real-time voice and multimodal conversational AI agents. It orchestrates audio/video, AI services, transports, and conversation pipelines using a frame-based architecture.
-
-## Common Commands
-
-```bash
-# Setup development environment
-uv sync --group dev --all-extras --no-extra gstreamer --no-extra krisp
-
-# Install pre-commit hooks
-uv run pre-commit install
-
-# Run all tests
-uv run pytest
-
-# Run a single test file
-uv run pytest tests/test_name.py
-
-# Run a specific test
-uv run pytest tests/test_name.py::test_function_name
-
-# Preview changelog
-towncrier build --draft --version Unreleased
-
-# Lint and format check
-uv run ruff check
-uv run ruff format --check
-
-# Update dependencies (after editing pyproject.toml)
-uv lock && uv sync
-```
-
-## Architecture
-
-### Frame-Based Pipeline Processing
-
-All data flows as **Frame** objects through a pipeline of **FrameProcessors**:
-
-```
-Transport Input → Pipeline Source → [Processor1] → [Processor2] → ... → Pipeline Sink → Transport Output
-```
-
-**Key components:**
-
- **Frames** (`src/pipecat/frames/frames.py`): Data units (audio, text, video) and control signals. Flow DOWNSTREAM (input→output) or UPSTREAM (acknowledgments/errors).
-
- **FrameProcessor** (`src/pipecat/processors/frame_processor.py`): Base processing unit. Each processor receives frames, processes them, and pushes results downstream.
-
- **Pipeline** (`src/pipecat/pipeline/pipeline.py`): Chains processors together.
-
- **ParallelPipeline** (`src/pipecat/pipeline/parallel_pipeline.py`): Runs multiple pipelines in parallel.
-
- **Transports** (`src/pipecat/transports/`): External I/O layer (Daily WebRTC, LiveKit WebRTC, WebSocket, Local). Abstract interface via `BaseTransport`.
-
- **Services** (`src/pipecat/services/`): 60+ AI provider integrations (STT, TTS, LLM, etc.). Extend base classes: `AIService`, `LLMService`, `STTService`, `TTSService`, `VisionService`.
-
- **Serializers** (`src/pipecat/serializers/`): Convert frames to/from wire formats for WebSocket transports. `FrameSerializer` base class defines `serialize()` and `deserialize()`. Telephony serializers (Twilio, Plivo, Vonage, Telnyx, Exotel, Genesys) handle provider-specific protocols and audio encoding (e.g., μ-law).
-
- **RTVI** (`src/pipecat/processors/frameworks/rtvi.py`): Real-Time Voice Interface protocol bridging clients and the pipeline. `RTVIProcessor` handles incoming client messages (text input, audio, function call results). `RTVIObserver` converts pipeline frames to outgoing messages: user/bot speaking events, transcriptions, LLM/TTS lifecycle, function calls, metrics, and audio levels.
-
-### Important Patterns
-
- **Context Aggregation**: `LLMContext` accumulates messages for LLM calls; `UserResponse` aggregates user input
-
- **Turn Management**: Turn management is done through `LLMUserAggregator` and
-`LLMAssistantAggregator`, created with `LLMContextAggregatorPair`
-
- **User turn strategies**: Detection of when the user starts and stops speaking is done via user turn start/stop strategies. They push `UserStartedSpeakingFrame` and `UserStoppedSpeakingFrame` respectively.
-
- **Interruptions**: Interruptions are usually triggered by a user turn start strategy (e.g. `VADUserTurnStartStrategy`) but they can be triggered by other processors as well, in which case the user turn start strategies don't need to. An `InterruptionFrame` carries an optional `asyncio.Event` that is set when the frame reaches the pipeline sink. If a processor stops an `InterruptionFrame` from propagating downstream (i.e., doesn't push it), it **must** call `frame.complete()` to avoid stalling `push_interruption_task_frame_and_wait()` callers.
-
- **Uninterruptible Frames**: These are frames that will not be removed from internal queues even if there's an interruption. For example, `EndFrame` and `StopFrame`.
-
- **Events**: Most classes in Pipecat have `BaseObject` as the very base class. `BaseObject` has support for events. Events can run in the background in an async task (default) or synchronously (`sync=True`) if we want immediate action. Synchronous event handlers need to exectue fast.
-
-### Key Directories
-
-| Directory                 | Purpose                                            |
-|---------------------------|----------------------------------------------------|
-| `src/pipecat/frames/`     | Frame definitions (100+ types)                     |
-| `src/pipecat/processors/` | FrameProcessor base + aggregators, filters, audio  |
-| `src/pipecat/pipeline/`   | Pipeline orchestration                             |
-| `src/pipecat/services/`   | AI service integrations (60+ providers)            |
-| `src/pipecat/transports/` | Transport layer (Daily, LiveKit, WebSocket, Local) |
-| `src/pipecat/serializers/`| Frame serialization for WebSocket protocols        |
-| `src/pipecat/audio/`      | VAD, filters, mixers, turn detection, DTMF         |
-| `src/pipecat/turns/`      | User turn management                               |
-
-## Code Style
-
- **Docstrings**: Google-style. Classes describe purpose; `__init__` has `Args:` section; dataclasses use `Parameters:` section.
- **Linting**: Ruff (line length 100). Pre-commit hooks enforce formatting.
- **Type hints**: Required for complex async code.
-
-### Docstring Example
-
-```python
-class MyService(LLMService):
-    """Description of what the service does.
-
-    More detailed description.
-
-    Event handlers available:
-
-    - on_connected: Called when we are connected
-
-    Example::
-
-        @service.event_handler("on_connected")
-        async def on_connected(service, frame):
-            ...
-    """
-
-    def __init__(self, param1: str, **kwargs):
-        """Initialize the service.
-
-        Args:
-            param1: Description of param1.
-            **kwargs: Additional arguments passed to parent.
-        """
-        super().__init__(**kwargs)
-```
-
-## Service Implementation
-
-When adding a new service:
-
-1. Extend the appropriate base class (`STTService`, `TTSService`, `LLMService`, etc.)
-2. Implement required abstract methods
-3. Handle necessary frames
-4. By default, all frames should be pushed in the direction they came
-5. Push `ErrorFrame` on failures
-6. Add metrics tracking via `MetricsData` if relevant
-7. Follow the pattern of existing services in `src/pipecat/services/`
-
-## Pull Requests
-
-After creating a PR, use `/changelog <pr_number>` to generate the changelog file and `/pr-description <pr_number>` to update the PR description.
--- a/2
+++ b/2
@@ -1,6 +1,6 @@
 BSD 2-Clause License

-Copyright (c) 2024–2026, Daily
+Copyright (c) 2024–2025, Daily

 Redistribution and use in source and binary forms, with or without
 modification, are permitted provided that the following conditions are met:
--- a/README.md
+++ b/README.md
@@ -73,15 +73,15 @@ Catch new features, interviews, and how-tos on our [Pipecat TV](https://www.yout

 | Category            | Services                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
 | ------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| Speech-to-Text      | [AssemblyAI](https://docs.pipecat.ai/server/services/stt/assemblyai), [AWS](https://docs.pipecat.ai/server/services/stt/aws), [Azure](https://docs.pipecat.ai/server/services/stt/azure), [Cartesia](https://docs.pipecat.ai/server/services/stt/cartesia), [Deepgram](https://docs.pipecat.ai/server/services/stt/deepgram), [ElevenLabs](https://docs.pipecat.ai/server/services/stt/elevenlabs), [Fal Wizper](https://docs.pipecat.ai/server/services/stt/fal), [Gladia](https://docs.pipecat.ai/server/services/stt/gladia), [Google](https://docs.pipecat.ai/server/services/stt/google), [Gradium](https://docs.pipecat.ai/server/services/stt/gradium), [Groq (Whisper)](https://docs.pipecat.ai/server/services/stt/groq), [Hathora](https://docs.pipecat.ai/server/services/stt/hathora), [NVIDIA Riva](https://docs.pipecat.ai/server/services/stt/riva), [OpenAI (Whisper)](https://docs.pipecat.ai/server/services/stt/openai), [SambaNova (Whisper)](https://docs.pipecat.ai/server/services/stt/sambanova), [Sarvam](https://docs.pipecat.ai/server/services/stt/sarvam), [Soniox](https://docs.pipecat.ai/server/services/stt/soniox), [Speechmatics](https://docs.pipecat.ai/server/services/stt/speechmatics), [Whisper](https://docs.pipecat.ai/server/services/stt/whisper)                                                                                                                                                                                                                                                            |
+| Speech-to-Text      | [AssemblyAI](https://docs.pipecat.ai/server/services/stt/assemblyai), [AWS](https://docs.pipecat.ai/server/services/stt/aws), [Azure](https://docs.pipecat.ai/server/services/stt/azure), [Cartesia](https://docs.pipecat.ai/server/services/stt/cartesia), [Deepgram](https://docs.pipecat.ai/server/services/stt/deepgram), [ElevenLabs](https://docs.pipecat.ai/server/services/stt/elevenlabs), [Fal Wizper](https://docs.pipecat.ai/server/services/stt/fal), [Gladia](https://docs.pipecat.ai/server/services/stt/gladia), [Google](https://docs.pipecat.ai/server/services/stt/google), [Gradium](https://docs.pipecat.ai/server/services/stt/gradium), [Groq (Whisper)](https://docs.pipecat.ai/server/services/stt/groq), [NVIDIA Riva](https://docs.pipecat.ai/server/services/stt/riva), [OpenAI (Whisper)](https://docs.pipecat.ai/server/services/stt/openai), [SambaNova (Whisper)](https://docs.pipecat.ai/server/services/stt/sambanova), [Sarvam](https://docs.pipecat.ai/server/services/stt/sarvam), [Soniox](https://docs.pipecat.ai/server/services/stt/soniox), [Speechmatics](https://docs.pipecat.ai/server/services/stt/speechmatics), [Whisper](https://docs.pipecat.ai/server/services/stt/whisper)                                                                                                                                                                                                                                                            |
 | LLMs                | [Anthropic](https://docs.pipecat.ai/server/services/llm/anthropic), [AWS](https://docs.pipecat.ai/server/services/llm/aws), [Azure](https://docs.pipecat.ai/server/services/llm/azure), [Cerebras](https://docs.pipecat.ai/server/services/llm/cerebras), [DeepSeek](https://docs.pipecat.ai/server/services/llm/deepseek), [Fireworks AI](https://docs.pipecat.ai/server/services/llm/fireworks), [Gemini](https://docs.pipecat.ai/server/services/llm/gemini), [Grok](https://docs.pipecat.ai/server/services/llm/grok), [Groq](https://docs.pipecat.ai/server/services/llm/groq), [Mistral](https://docs.pipecat.ai/server/services/llm/mistral), [NVIDIA NIM](https://docs.pipecat.ai/server/services/llm/nim), [Ollama](https://docs.pipecat.ai/server/services/llm/ollama), [OpenAI](https://docs.pipecat.ai/server/services/llm/openai), [OpenRouter](https://docs.pipecat.ai/server/services/llm/openrouter), [Perplexity](https://docs.pipecat.ai/server/services/llm/perplexity), [Qwen](https://docs.pipecat.ai/server/services/llm/qwen), [SambaNova](https://docs.pipecat.ai/server/services/llm/sambanova) [Together AI](https://docs.pipecat.ai/server/services/llm/together)                                                                                                                                                                                                                                                                                              |
-| Text-to-Speech      | [Async](https://docs.pipecat.ai/server/services/tts/asyncai), [AWS](https://docs.pipecat.ai/server/services/tts/aws), [Azure](https://docs.pipecat.ai/server/services/tts/azure), [Camb AI](https://docs.pipecat.ai/server/services/tts/camb), [Cartesia](https://docs.pipecat.ai/server/services/tts/cartesia), [Deepgram](https://docs.pipecat.ai/server/services/tts/deepgram), [ElevenLabs](https://docs.pipecat.ai/server/services/tts/elevenlabs), [Fish](https://docs.pipecat.ai/server/services/tts/fish), [Google](https://docs.pipecat.ai/server/services/tts/google), [Gradium](https://docs.pipecat.ai/server/services/tts/gradium), [Groq](https://docs.pipecat.ai/server/services/tts/groq), [Hathora](https://docs.pipecat.ai/server/services/tts/hathora), [Hume](https://docs.pipecat.ai/server/services/tts/hume), [Inworld](https://docs.pipecat.ai/server/services/tts/inworld), [LMNT](https://docs.pipecat.ai/server/services/tts/lmnt), [MiniMax](https://docs.pipecat.ai/server/services/tts/minimax), [Neuphonic](https://docs.pipecat.ai/server/services/tts/neuphonic), [NVIDIA Riva](https://docs.pipecat.ai/server/services/tts/riva), [OpenAI](https://docs.pipecat.ai/server/services/tts/openai), [Piper](https://docs.pipecat.ai/server/services/tts/piper), [PlayHT](https://docs.pipecat.ai/server/services/tts/playht), [Resemble](https://docs.pipecat.ai/server/services/tts/resemble), [Rime](https://docs.pipecat.ai/server/services/tts/rime), [Sarvam](https://docs.pipecat.ai/server/services/tts/sarvam), [Speechmatics](https://docs.pipecat.ai/server/services/tts/speechmatics), [XTTS](https://docs.pipecat.ai/server/services/tts/xtts) |
+| Text-to-Speech      | [Async](https://docs.pipecat.ai/server/services/tts/asyncai), [AWS](https://docs.pipecat.ai/server/services/tts/aws), [Azure](https://docs.pipecat.ai/server/services/tts/azure), [Cartesia](https://docs.pipecat.ai/server/services/tts/cartesia), [Deepgram](https://docs.pipecat.ai/server/services/tts/deepgram), [ElevenLabs](https://docs.pipecat.ai/server/services/tts/elevenlabs), [Fish](https://docs.pipecat.ai/server/services/tts/fish), [Google](https://docs.pipecat.ai/server/services/tts/google), [Gradium](https://docs.pipecat.ai/server/services/tts/gradium), [Groq](https://docs.pipecat.ai/server/services/tts/groq), [Hume](https://docs.pipecat.ai/server/services/tts/hume), [Inworld](https://docs.pipecat.ai/server/services/tts/inworld), [LMNT](https://docs.pipecat.ai/server/services/tts/lmnt), [MiniMax](https://docs.pipecat.ai/server/services/tts/minimax), [Neuphonic](https://docs.pipecat.ai/server/services/tts/neuphonic), [NVIDIA Riva](https://docs.pipecat.ai/server/services/tts/riva), [OpenAI](https://docs.pipecat.ai/server/services/tts/openai), [Piper](https://docs.pipecat.ai/server/services/tts/piper), [PlayHT](https://docs.pipecat.ai/server/services/tts/playht), [Rime](https://docs.pipecat.ai/server/services/tts/rime), [Sarvam](https://docs.pipecat.ai/server/services/tts/sarvam), [Speechmatics](https://docs.pipecat.ai/server/services/tts/speechmatics), [XTTS](https://docs.pipecat.ai/server/services/tts/xtts) |
 | Speech-to-Speech    | [AWS Nova Sonic](https://docs.pipecat.ai/server/services/s2s/aws), [Gemini Multimodal Live](https://docs.pipecat.ai/server/services/s2s/gemini), [Grok Voice Agent](https://docs.pipecat.ai/server/services/s2s/grok), [OpenAI Realtime](https://docs.pipecat.ai/server/services/s2s/openai), [Ultravox](https://docs.pipecat.ai/server/services/s2s/ultravox),                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
 | Transport           | [Daily (WebRTC)](https://docs.pipecat.ai/server/services/transport/daily), [FastAPI Websocket](https://docs.pipecat.ai/server/services/transport/fastapi-websocket), [SmallWebRTCTransport](https://docs.pipecat.ai/server/services/transport/small-webrtc), [WebSocket Server](https://docs.pipecat.ai/server/services/transport/websocket-server), Local                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
-| Serializers         | [Exotel](https://docs.pipecat.ai/server/utilities/serializers/exotel), [Plivo](https://docs.pipecat.ai/server/utilities/serializers/plivo), [Twilio](https://docs.pipecat.ai/server/utilities/serializers/twilio), [Telnyx](https://docs.pipecat.ai/server/utilities/serializers/telnyx), [Vonage](https://docs.pipecat.ai/server/utilities/serializers/vonage)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
+| Serializers         | [Plivo](https://docs.pipecat.ai/server/utilities/serializers/plivo), [Twilio](https://docs.pipecat.ai/server/utilities/serializers/twilio), [Telnyx](https://docs.pipecat.ai/server/utilities/serializers/telnyx)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |
 | Video               | [HeyGen](https://docs.pipecat.ai/server/services/video/heygen), [Tavus](https://docs.pipecat.ai/server/services/video/tavus), [Simli](https://docs.pipecat.ai/server/services/video/simli)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
 | Memory              | [mem0](https://docs.pipecat.ai/server/services/memory/mem0)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
-| Vision & Image      | [fal](https://docs.pipecat.ai/server/services/image-generation/fal), [Google Imagen](https://docs.pipecat.ai/server/services/image-generation/google-imagen), [Moondream](https://docs.pipecat.ai/server/services/vision/moondream)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
+| Vision & Image      | [fal](https://docs.pipecat.ai/server/services/image-generation/fal), [Google Imagen](https://docs.pipecat.ai/server/services/image-generation/fal), [Moondream](https://docs.pipecat.ai/server/services/vision/moondream)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
 | Audio Processing    | [Silero VAD](https://docs.pipecat.ai/server/utilities/audio/silero-vad-analyzer), [Krisp](https://docs.pipecat.ai/server/utilities/audio/krisp-filter), [Koala](https://docs.pipecat.ai/server/utilities/audio/koala-filter), [ai-coustics](https://docs.pipecat.ai/server/utilities/audio/aic-filter)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
 | Analytics & Metrics | [OpenTelemetry](https://docs.pipecat.ai/server/utilities/opentelemetry), [Sentry](https://docs.pipecat.ai/server/services/analytics/sentry)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |

--- a/changelog/3045.added.md
+++ b/changelog/3045.added.md
@@ -0,0 +1,42 @@
+- Introducing user turn strategies. User turn strategies indicate when the user turn starts or stops. In conversational agents, these are often referred to as start/stop speaking or turn-taking plans or policies.
+
+  User turn start strategies indicate when the user starts speaking (e.g. using VAD events or when a user says one or more words).
+
+  User turn stop strategies indicate when the user stops speaking (e.g. using an end-of-turn detection model or by observing incoming transcriptions).
+
+  A list of strategies can be specified for both strategies; strategies are evaluated in order until one evaluates to true.
+
+  Available user turn start strategies:
+    - VADUserTurnStartStrategy
+    - TranscriptionUserTurnStartStrategy
+    - MinWordsUserTurnStartStrategy
+    - ExternalUserTurnStartStrategy
+
+  Available user turn stop strategies:
+    - TranscriptionUserTurnStopStrategy
+    - TurnAnalyzerUserTurnStopStrategy
+    - ExternalUserTurnStopStrategy
+
+  The default strategies are:
+
+    - start: [VADUserTurnStartStrategy, TranscriptionUserTurnStartStrategy]
+    - stop: [TranscriptionUserTurnStopStrategy]
+
+  Turn strategies are configured when setting up `LLMContextAggregatorPair`. For example:
+
+  ```python
+  context_aggregator = LLMContextAggregatorPair(
+      context,
+      user_params=LLMUserAggregatorParams(
+          user_turn_strategies=UserTurnStrategies(
+              stop=[
+                  TurnAnalyzerUserTurnStopStrategy(
+                      turn_analyzer=LocalSmartTurnAnalyzerV3(params=SmartTurnParams())
+                  )
+              ],
+          )
+      ),
+  )
+  ```
+
+  In order to use the user turn strategies you must update to the new universal `LLMContext` and `LLMContextAggregatorPair`.
--- a/changelog/3045.deprecated.2.md
+++ b/changelog/3045.deprecated.2.md
@@ -0,0 +1 @@
+- ⚠️ `TransportParams.turn_analyzer` is deprecated and might result in unexpected behavior, use `LLMUserAggregator`'s new `turn_start_strategies` parameter instead.
--- a/changelog/3045.deprecated.3.md
+++ b/changelog/3045.deprecated.3.md
@@ -0,0 +1 @@
+- `FrameProcessor.interruption_strategies` is deprecated, use `LLMUserAggregator`'s new `turn_start_strategies` parameter instead.
--- a/changelog/3045.deprecated.4.md
+++ b/changelog/3045.deprecated.4.md
@@ -0,0 +1 @@
+- `EmulateUserStartedSpeakingFrame` and `EmulateUserStoppedSpeakingFrame` frames are deprecated.
--- a/changelog/3045.deprecated.5.md
+++ b/changelog/3045.deprecated.5.md
@@ -0,0 +1 @@
+- Deprecated the `emulated` field in the `UserStartedSpeakingFrame` and `UserStoppedSpeakingFrame` frames.
--- a/changelog/3045.deprecated.6.md
+++ b/changelog/3045.deprecated.6.md
@@ -0,0 +1 @@
+- The `LLMUserAggregatorParams` and `LLMAssistantAggregatorParams` classes in `pipecat.processors.aggregators.llm_response` are now deprecated. Use the new universal `LLMContext` and `LLMContextAggregatorPair` instead.
--- a/changelog/3045.deprecated.md
+++ b/changelog/3045.deprecated.md
@@ -0,0 +1 @@
+- `pipecat.audio.interruptions.MinWordsInterruptionStrategy` is deprecated. Use `pipecat.turns.user_start.MinWordsUserTurnStartStrategy` with `LLMUserAggregator`'s new `turn_start_strategies` parameter instead.
--- a/changelog/3134.added.md
+++ b/changelog/3134.added.md
@@ -1 +0,0 @@
- Added `ResembleAITTSService` for text-to-speech using Resemble AI's streaming WebSocket API with word-level timestamps and jitter buffering for smooth audio playback.
--- a/changelog/3205.added.md
+++ b/changelog/3205.added.md
@@ -0,0 +1 @@
+- Added `RNNoiseFilter` for real-time noise suppression using RNNoise neural network via pyrnnoise library.
--- a/changelog/3225.changed.md
+++ b/changelog/3225.changed.md
@@ -0,0 +1,15 @@
+- Updated `SpeechmaticsSTTService` to use new Python Voice SDK with improved VAD,
+  Smart Turn capabilities, and brings dramatic improvements to latency without
+  any impact on accuracy. Use the `turn_detection_mode` parameter to control the
+  endpointing of speech, with `TurnDetectionMode.EXTERNAL` (default),
+  `TurnDetectionMode.ADAPTIVE`, or `TurnDetectionMode.SMART_TURN`.
+  ```python
+    stt = SpeechmaticsSTTService(
+        api_key=os.getenv("SPEECHMATICS_API_KEY"),
+        params=SpeechmaticsSTTService.InputParams(
+            language=Language.EN,
+            turn_detection_mode=SpeechmaticsSTTService.TurnDetectionMode.ADAPTIVE,
+            speaker_active_format="<{speaker_id}>{text}</{speaker_id}>",
+        ),
+    )
+  ```
--- a/changelog/3225.deprecated.md
+++ b/changelog/3225.deprecated.md
@@ -0,0 +1,4 @@
+- For `SpeechmaticsSTTService`, the `end_of_utterance_mode` parameter is deprecated.
+  Use the new `turn_detection_mode` parameter instead, with `TurnDetectionMode.EXTERNAL`,
+  `TurnDetectionMode.ADAPTIVE`, or `TurnDetectionMode.SMART_TURN`. The `enable_vad`
+  parameter is also deprecated and is inferred from the `turn_detection_mode`.
--- a/changelog/3233.fixed.md
+++ b/changelog/3233.fixed.md
@@ -0,0 +1,2 @@
+- Improved error handling in `ElevenLabsRealtimeSTTService`
+- Fixed an issue in `ElevenLabsRealtimeSTTService` causing an infinite loop that blocks the process if the websocket disconnects due to an error
--- a/changelog/3257.changed.2.md
+++ b/changelog/3257.changed.2.md
@@ -0,0 +1 @@
+- `TranscriptionFrame` and `InterimTranscriptionFrame` produced by `DailyTransport` now include the transport source (i.e., the originating audio track).
--- a/changelog/3257.changed.md
+++ b/changelog/3257.changed.md
@@ -0,0 +1 @@
+- `daily-python` updated to 0.23.0.
--- a/changelog/3263.deprecated.md
+++ b/changelog/3263.deprecated.md
@@ -0,0 +1,15 @@
+- `OpenAILLMContext` and its associated things (context aggregators, etc.) are now deprecated in favor of the universal `LLMContext` and its associated things.
+
+  From the developer's point of view, switching to using `LLMContext` machinery will usually be a matter of going from this:
+
+  ```python
+  context = OpenAILLMContext(messages, tools)
+  context_aggregator = llm.create_context_aggregator(context)
+  ```
+
+  To this:
+
+  ```
+  context = LLMContext(messages, tools)
+  context_aggregator = LLMContextAggregatorPair(context)
+  ```
--- a/changelog/3267.added.md
+++ b/changelog/3267.added.md
@@ -0,0 +1,8 @@
+- Added `GrokRealtimeLLMService` for xAI's Grok Voice Agent API with real-time voice conversations:
+
+  - Support for real-time audio streaming with WebSocket connection
+  - Built-in server-side VAD (Voice Activity Detection)
+  - Multiple voice options: Ara, Rex, Sal, Eve, Leo
+  - Built-in tools support: web_search, x_search, file_search
+  - Custom function calling with standard Pipecat tools schema
+  - Configurable audio formats (PCM at 8kHz-48kHz)
--- a/changelog/3268.added.md
+++ b/changelog/3268.added.md
@@ -0,0 +1 @@
+- Added an approximation of TTFB for Ultravox.
--- a/changelog/3288.changed.md
+++ b/changelog/3288.changed.md
@@ -0,0 +1,5 @@
+- Updates to Inworld TTS services:
+
+  - Improved `InworldTTSService`'s websocket implementation to better flush and
+    close context to better handle long inputs.
+  - Improved docstrings for `InworldTTSService` and `InworldHttpTTSService`.
--- a/changelog/3289.added.md
+++ b/changelog/3289.added.md
@@ -0,0 +1 @@
+- Added a new `AudioContextTTSService` to the TTS service base classes. The `AudioContextWordTTSService` now inherits from `AudioContextTTSService` and `WebsocketWordTTSService`.
--- a/changelog/3291.added.md
+++ b/changelog/3291.added.md
@@ -0,0 +1,4 @@
+- `LLMUserAggregator` now exposes the following events:
+   - `on_user_turn_started`: triggered when a user turn starts
+   - `on_user_turn_stopped`: triggered when a user turn ends
+   - `on_user_turn_stop_timeout`: triggered when a user turn does not stop and times out
--- a/changelog/3292.added.md
+++ b/changelog/3292.added.md
@@ -0,0 +1,29 @@
+- Introducing user mute strategies. User mute strategies indicate when user input should be muted based on the current system state.
+
+  In conversational agents, user mute strategies are used to prevent user input from interrupting bot speech, tool execution, or other critical system operations.
+
+  A list of strategies can be specified; all strategies are evaluated for every frame so that each strategy can maintain its internal state. A user frame is muted if any of the configured strategies indicates it should be muted.
+
+  Available user mute strategies:
+
+  * `FirstSpeechUserMuteStrategy`
+  * `MuteUntilFirstBotCompleteUserMuteStrategy`
+  * `AlwaysUserMuteStrategy`
+  * `FunctionCallUserMuteStrategy`
+
+  User mute strategies replace the legacy `STTMuteFilter` and provide a more flexible and composable approach to muting user input.
+
+  User mute strategies are configured when setting up the `LLMContextAggregatorPair`. For example:
+
+  ```python
+  context_aggregator = LLMContextAggregatorPair(
+      context,
+      user_params=LLMUserAggregatorParams(
+          user_mute_strategies=[
+              FirstSpeechUserMuteStrategy(),
+          ]
+      ),
+  )
+  ```
+
+  In order to use user mute strategies you should update to the new universal `LLMContext` and `LLMContextAggregatorPair`.
--- a/changelog/3292.deprecated.md
+++ b/changelog/3292.deprecated.md
@@ -0,0 +1 @@
+- `STTMuteFilter` is deprecated and will be removed in a future version. Use `LLMUserAggregator`'s new `user_mute_strategies` instead.
--- a/changelog/3292.fixed.md
+++ b/changelog/3292.fixed.md
@@ -0,0 +1 @@
+- Fixed a bug in `STTMuteFilter` where the user was not always muted during function calls, especially when there were multiple simultaneous calls.
--- a/changelog/3297.deprecated.2.md
+++ b/changelog/3297.deprecated.2.md
@@ -0,0 +1 @@
+- `FrameProcessor.interruptions_allowed` is now deprecated, use `LLMUserAggregator`'s new parameter `user_mute_strategies` instead.
--- a/changelog/3297.deprecated.md
+++ b/changelog/3297.deprecated.md
@@ -0,0 +1,12 @@
+- `PipelineParams.allow_interruptions` is now deprecated, use `LLMUserAggregator`'s new parameter `turn_start_strategies` instead. For example, to disable interruptions but still get user turns you can do:
+
+  ```python
+  context_aggregator = LLMContextAggregatorPair(
+      context,
+      user_params=LLMUserAggregatorParams(
+          user_turn_strategies=UserTurnStrategies(
+              start=[TranscriptionUserTurnStartStrategy(enable_interruptions=False)],
+          ),
+      ),
+  )
+  ```
--- a/changelog/3300.added.md
+++ b/changelog/3300.added.md
@@ -0,0 +1 @@
+- Added `use_ssl` parameter to `NvidiaSTTService`, `NvidiaSegmentedSTTService` and `NvidiaTTSService`.
--- a/changelog/3314.changed.md
+++ b/changelog/3314.changed.md
@@ -0,0 +1 @@
+- Updated `DeepgramSTTService` to push user started/stopped speaking and interruption frames when `vad_enabled` is set to true. This centralizes the frames into the service, removing the need to have your application code handle Deepgram's events and push these frames.
--- a/changelog/3316.added.md
+++ b/changelog/3316.added.md
@@ -0,0 +1 @@
+- Added `enable_interruptions` constructor argument to all user turn strategies. This tells the `LLMUserAggregator` to push or not push an `InterruptionFrame`.
--- a/changelog/3316.other.md
+++ b/changelog/3316.other.md
@@ -0,0 +1 @@
+- Added `52-live-transcription.py` foundational example demonstrating live transcription and translation from English to Spanish. In this example, the bot is not interruptible: as the user continues speaking, English transcriptions are queued, and the bot continuously translates and speaks each queued sentence in Spanish without being interrupted by new user speech.
--- a/changelog/3326.added.md
+++ b/changelog/3326.added.md
@@ -0,0 +1 @@
+- Frame processors can now push frames from the top of the pipeline using new methods `queue_task_frame()` and `queue_task_frames()`.
--- a/changelog/3355.added.md
+++ b/changelog/3355.added.md
@@ -1 +0,0 @@
- Added `UserBotLatencyObserver` for tracking user-to-bot response latency. When tracing is enabled, latency measurements are automatically recorded as `turn.user_bot_latency_seconds` attributes on OpenTelemetry turn spans.
--- a/changelog/3355.deprecated.md
+++ b/changelog/3355.deprecated.md
@@ -1 +0,0 @@
- Deprecated `UserBotLatencyLogObserver`. Use `UserBotLatencyObserver` directly with its `on_latency_measured` event handler instead.
--- a/changelog/3542.fixed.md
+++ b/changelog/3542.fixed.md
@@ -1 +0,0 @@
- Fixed pipeline freeze when `InterruptionFrame` discards `EndFrame` or `StopFrame` by making terminal frames uninterruptible.
--- a/changelog/3589.fixed.md
+++ b/changelog/3589.fixed.md
@@ -1 +0,0 @@
- Fixed OpenAI LLM stream not being closed on cancellation/exception, which could leak sockets.
--- a/changelog/3593.added.md
+++ b/changelog/3593.added.md
@@ -1 +0,0 @@
- Added support for Inworld TTS Websocket Auto Mode for improved latency
--- a/changelog/3593.changed.md
+++ b/changelog/3593.changed.md
@@ -1 +0,0 @@
- Updated timestamps to be cumulative within an agent turn, using flushCompleted message as an indication of when timestamps from the server are reset to 0
--- a/changelog/3610.fixed.md
+++ b/changelog/3610.fixed.md
@@ -1 +0,0 @@
- Fixed `PipelineTask` adding duplicate `RTVIProcessor` and `RTVIObserver` when they were already provided in the pipeline or observers list. They are now detected and skipped, with appropriate warnings and errors logged for mismatched configurations.
--- a/changelog/3612.changed.md
+++ b/changelog/3612.changed.md
@@ -1 +0,0 @@
-  Changed `KokoroTTSService` to use `kokoro-onnx` instead of `kokoro` as the underlying TTS engine.
--- a/changelog/3616.fixed.md
+++ b/changelog/3616.fixed.md
@@ -1 +0,0 @@
- Fixed function call timeout task not being cancelled when the handler completes without calling `result_callback` or is cancelled externally, which caused `RuntimeWarning: coroutine was never awaited`.
--- a/changelog/3617.fixed.md
+++ b/changelog/3617.fixed.md
@@ -1,5 +0,0 @@
- Fixed sentence splitting for Japanese, Chinese, Korean, and other non-Latin
-  languages in TTS pipeline. NLTK's sentence tokenizer does not support CJK
-  languages, causing text to accumulate until flush instead of being split at
-  sentence boundaries. Added fallback detection for unambiguous non-Latin
-  sentence-ending punctuation (e.g., `。`, `？`, `！`).
--- a/changelog/3623.fixed.md
+++ b/changelog/3623.fixed.md
@@ -1 +0,0 @@
- Fixed `PipelineTask` to also call `set_bot_ready()` when an external `RTVIProcessor` is provided.
--- a/changelog/3628.fixed.md
+++ b/changelog/3628.fixed.md
@@ -1 +0,0 @@
- Fixed `VADController` not broadcasting `SpeechControlParamsFrame` on startup, which prevented STT services from receiving VAD params needed for TTFB measurement.
--- a/changelog/3629.fixed.md
+++ b/changelog/3629.fixed.md
@@ -1 +0,0 @@
- Fixed `StopAsyncIteration` exceptions in `parse_telephony_websocket()` when WebSocket connections close before sending expected messages.
--- a/changelog/3630.added.md
+++ b/changelog/3630.added.md
@@ -1 +0,0 @@
- Added RTVI function call lifecycle events (`llm-function-call-started`, `llm-function-call-in-progress`, `llm-function-call-stopped`) with configurable security levels via `RTVIObserverParams.function_call_report_level`. Supports per-function control over what information is exposed (`DISABLED`, `NONE`, `NAME`, or `FULL`).
--- a/changelog/3630.deprecated.md
+++ b/changelog/3630.deprecated.md
@@ -1 +0,0 @@
- Deprecated `RTVILLMFunctionCallMessage`, `RTVILLMFunctionCallMessageData`, and `RTVIProcessor.handle_function_call()`. Use the new `llm-function-call-in-progress` event sent automatically by `RTVIObserver` instead.
--- a/changelog/3635.fixed.md
+++ b/changelog/3635.fixed.md
@@ -1 +0,0 @@
- Fixed WebSocket transport error when broadcasting `InputTransportMessageFrame` by correctly instantiating the frame with its message parameter.
--- a/changelog/3649.fixed.md
+++ b/changelog/3649.fixed.md
@@ -1 +0,0 @@
- Fixed orphan OpenTelemetry spans during flow initialization and transitions in tracing.
--- a/changelog/3652.changed.md
+++ b/changelog/3652.changed.md
@@ -1 +0,0 @@
- Upgraded the `pipecat-ai-small-webrtc-prebuilt` package to v2.1.0.
--- a/changelog/3656.added.md
+++ b/changelog/3656.added.md
@@ -1 +0,0 @@
- Added `OpenAIRealtimeSTTService` for real-time streaming speech-to-text using OpenAI's Realtime API WebSocket transcription sessions. Supports local VAD and server-side VAD modes, noise reduction, and automatic reconnection.
--- a/changelog/3659.changed.md
+++ b/changelog/3659.changed.md
@@ -1,10 +0,0 @@
- ⚠️ The default `VADParams` `stop_secs` default is changing from `0.8` seconds
-  to `0.2` seconds. This change both simplifies the developer experience and
-    improves the performance of STT services. With a shorter `stop_secs` value,
-    STT services using a local VAD can finalize sooner, resulting in faster
-    transcription.
-
-  - `SpeechTimeoutUserTurnStopStrategy`: control how long to wait for
-    additional user speech using `user_speech_timeout` (default: 0.6 sec).
-  - `TurnAnalyzerUserTurnStopStrategy`: the turn analyzer automatically adjusts
-    the user wait time based on the audio input.
--- a/changelog/3660.changed.md
+++ b/changelog/3660.changed.md
@@ -1 +0,0 @@
- Moved interruption wait event from per-processor instance state to `InterruptionFrame` itself. Added `InterruptionFrame.complete()` to signal when the interruption has fully traversed the pipeline. Custom processors that block or consume an `InterruptionFrame` before it reaches the pipeline sink must call `frame.complete()` to avoid stalling `push_interruption_task_frame_and_wait()`. A warning is logged if completion does not happen within 2 seconds.
--- a/changelog/3663.fixed.md
+++ b/changelog/3663.fixed.md
@@ -1 +0,0 @@
- Fixed `SambaNovaLLMService` and `GoogleLLMOpenAIBetaService` streams not being closed on cancellation/exception, which could leak sockets.
--- a/changelog/3664.changed.md
+++ b/changelog/3664.changed.md
@@ -1 +0,0 @@
- Update the default model to `scribe_v2` for `ElevenLabsSTTService`.
--- a/changelog/3666.changed.md
+++ b/changelog/3666.changed.md
@@ -1 +0,0 @@
- Changed the `DeepgramSTTService` default setting for `smart_format` to `False`, as agents don't need smart formatting. Disabling this setting provides a small performance improvement, as well.
--- a/changelog/3667.fixed.md
+++ b/changelog/3667.fixed.md
@@ -1 +0,0 @@
- Fixed an issue in `InworldTTSService` where punctuation was pronounced. Now, the `InworldTTSService` ensures proper spacing between sentences, resolving pronunciation issues.
--- a/changelog/3668.fixed.md
+++ b/changelog/3668.fixed.md
@@ -1 +0,0 @@
- Fixed `ParallelPipeline` allowing frames pushed by internal processors to escape during lifecycle frame (`StartFrame`/`EndFrame`/`CancelFrame`) synchronization. These frames are now buffered and flushed after all branches complete.
--- a/changelog/3678.added.md
+++ b/changelog/3678.added.md
@@ -1 +0,0 @@
- Added pyright basic type checking configuration for the core framework.
--- a/docs/api/conf.py
+++ b/docs/api/conf.py
@@ -91,25 +91,6 @@ autodoc_mock_imports = [
    # MLX dependencies (Apple Silicon specific)
    "mlx",
    "mlx_whisper",  # Note: might need underscore format too
-    # Pydantic v2 compatibility issues in third-party SDKs
-    "hume",
-    "hume.tts",
-    "hume.tts.types",
-    "cartesia",
-    "camb",
-    "sarvamai",
-    "openpipe",
-    "openai.types.beta.realtime",
-    "langchain_core",
-    "langchain_core.messages",
-    # FastAPI - Pydantic v2 compatibility issues during Sphinx autodoc
-    "fastapi",
-    "fastapi.applications",
-    "fastapi.routing",
-    "fastapi.params",
-    "fastapi.middleware",
-    "fastapi.responses",
-    "uvicorn",
 ]

 # HTML output settings
--- a/env.example
+++ b/env.example
@@ -31,9 +31,6 @@ AZURE_DALLE_API_KEY=...
 AZURE_DALLE_ENDPOINT=https://...
 AZURE_DALLE_MODEL=...

-# Camb.ai
-CAMB_API_KEY=...
-
 # Cartesia
 CARTESIA_API_KEY=...
 CARTESIA_VOICE_ID=...
@@ -43,7 +40,7 @@ CEREBRAS_API_KEY=...

 # Daily
 DAILY_API_KEY=...
-DAILY_ROOM_URL=https://...
+DAILY_SAMPLE_ROOM_URL=https://...

 # Deepgram
 DEEPGRAM_API_KEY=...
@@ -85,9 +82,6 @@ GROK_API_KEY=...
 # Groq
 GROQ_API_KEY=...

-# Hathora
-HATHORA_API_KEY=...
-
 # Heygen
 HEYGEN_API_KEY=...
 HEYGEN_LIVE_AVATAR_API_KEY=...
@@ -103,8 +97,7 @@ INWORLD_API_KEY=...
 KRISP_MODEL_PATH=...

 # Krisp Viva
-KRISP_VIVA_FILTER_MODEL_PATH=...
-KRISP_VIVA_TURN_MODEL_PATH=...
+KRISP_VIVA_MODEL_PATH=...

 # LiveKit
 LIVEKIT_API_KEY=...
@@ -156,10 +149,6 @@ PLIVO_AUTH_TOKEN=...
 # Qwen
 QWEN_API_KEY=...

-# Resemble AI
-RESEMBLE_API_KEY=
-RESEMBLE_VOICE_UUID=
-
 # Rime
 RIME_API_KEY=...
 RIME_VOICE_ID=...
--- a/examples/foundational/01-say-one-thing-piper.py
+++ b/examples/foundational/01-say-one-thing-piper.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -16,7 +16,7 @@ from pipecat.pipeline.runner import PipelineRunner
 from pipecat.pipeline.task import PipelineTask
 from pipecat.runner.types import RunnerArguments
 from pipecat.runner.utils import create_transport
-from pipecat.services.piper.tts import PiperHttpTTSService
+from pipecat.services.piper.tts import PiperTTSService
 from pipecat.transports.base_transport import BaseTransport, TransportParams
 from pipecat.transports.daily.transport import DailyParams
 from pipecat.transports.websocket.fastapi import FastAPIWebsocketParams
@@ -24,8 +24,9 @@ from pipecat.transports.websocket.fastapi import FastAPIWebsocketParams
 load_dotenv(override=True)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(audio_out_enabled=True),
    "twilio": lambda: FastAPIWebsocketParams(audio_out_enabled=True),
@@ -38,7 +39,7 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):

    # Create an HTTP session
    async with aiohttp.ClientSession() as session:
-        tts = PiperHttpTTSService(
+        tts = PiperTTSService(
            base_url=os.getenv("PIPER_BASE_URL"), aiohttp_session=session, sample_rate=24000
        )

--- a/examples/foundational/01-say-one-thing-rime.py
+++ b/examples/foundational/01-say-one-thing-rime.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -23,8 +23,9 @@ from pipecat.transports.websocket.fastapi import FastAPIWebsocketParams

 load_dotenv(override=True)

-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(audio_out_enabled=True),
    "twilio": lambda: FastAPIWebsocketParams(audio_out_enabled=True),
--- a/examples/foundational/01-say-one-thing.py
+++ b/examples/foundational/01-say-one-thing.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -23,8 +23,9 @@ from pipecat.transports.websocket.fastapi import FastAPIWebsocketParams
 load_dotenv(override=True)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(audio_out_enabled=True),
    "twilio": lambda: FastAPIWebsocketParams(audio_out_enabled=True),
--- a/examples/foundational/01a-local-audio.py
+++ b/examples/foundational/01a-local-audio.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
--- a/examples/foundational/01b-livekit-audio.py
+++ b/examples/foundational/01b-livekit-audio.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
--- a/examples/foundational/01c-nvidia-riva-tts.py
+++ b/examples/foundational/01c-nvidia-riva-tts.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -23,8 +23,9 @@ from pipecat.transports.websocket.fastapi import FastAPIWebsocketParams
 load_dotenv(override=True)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(audio_out_enabled=True),
    "twilio": lambda: FastAPIWebsocketParams(audio_out_enabled=True),
--- a/examples/foundational/02-llm-say-one-thing.py
+++ b/examples/foundational/02-llm-say-one-thing.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -25,8 +25,9 @@ from pipecat.transports.websocket.fastapi import FastAPIWebsocketParams
 load_dotenv(override=True)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(audio_out_enabled=True),
    "twilio": lambda: FastAPIWebsocketParams(audio_out_enabled=True),
--- a/examples/foundational/03-still-frame.py
+++ b/examples/foundational/03-still-frame.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -23,8 +23,9 @@ from pipecat.transports.daily.transport import DailyParams
 load_dotenv(override=True)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        video_out_enabled=True,
--- a/examples/foundational/03a-local-still-frame.py
+++ b/examples/foundational/03a-local-still-frame.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
--- a/examples/foundational/03b-still-frame-imagen.py
+++ b/examples/foundational/03b-still-frame-imagen.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -22,8 +22,9 @@ from pipecat.transports.daily.transport import DailyParams
 load_dotenv(override=True)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        video_out_enabled=True,
--- a/examples/foundational/04-transports-small-webrtc.py
+++ b/examples/foundational/04-transports-small-webrtc.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -19,6 +19,7 @@ from pipecat_ai_small_webrtc_prebuilt.frontend import SmallWebRTCPrebuiltUI

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
@@ -63,6 +64,7 @@ async def run_example(webrtc_connection: SmallWebRTCConnection):
        params=TransportParams(
            audio_in_enabled=True,
            audio_out_enabled=True,
+            vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
        ),
    )

@@ -83,13 +85,12 @@ async def run_example(webrtc_connection: SmallWebRTCConnection):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

@@ -97,11 +98,11 @@ async def run_example(webrtc_connection: SmallWebRTCConnection):
        [
            transport.input(),  # Transport user input
            stt,
-            user_aggregator,  # User responses
+            context_aggregator.user(),  # User responses
            llm,  # LLM
            tts,  # TTS
            transport.output(),  # Transport bot output
-            assistant_aggregator,  # Assistant spoken responses
+            context_aggregator.assistant(),  # Assistant spoken responses
        ]
    )

--- a/examples/foundational/04a-transports-daily.py
+++ b/examples/foundational/04a-transports-daily.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -14,6 +14,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
@@ -48,6 +49,7 @@ async def main():
                audio_in_enabled=True,
                audio_out_enabled=True,
                transcription_enabled=True,
+                vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
            ),
        )

@@ -66,7 +68,7 @@ async def main():
        ]

        context = LLMContext(messages)
-        user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+        context_aggregator = LLMContextAggregatorPair(
            context,
            user_params=LLMUserAggregatorParams(
                user_turn_strategies=UserTurnStrategies(
@@ -74,18 +76,17 @@ async def main():
                        TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())
                    ]
                ),
-                vad_analyzer=SileroVADAnalyzer(),
            ),
        )

        pipeline = Pipeline(
            [
                transport.input(),  # Transport user input
-                user_aggregator,  # User responses
+                context_aggregator.user(),  # User responses
                llm,  # LLM
                tts,  # TTS
                transport.output(),  # Transport bot output
-                assistant_aggregator,  # Assistant spoken responses
+                context_aggregator.assistant(),  # Assistant spoken responses
            ]
        )

--- a/examples/foundational/04b-transports-livekit.py
+++ b/examples/foundational/04b-transports-livekit.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -14,6 +14,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import (
    InterruptionFrame,
    TranscriptionFrame,
@@ -53,6 +54,7 @@ async def main():
        params=LiveKitParams(
            audio_in_enabled=True,
            audio_out_enabled=True,
+            vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
        ),
    )

@@ -76,13 +78,12 @@ async def main():
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

@@ -90,11 +91,11 @@ async def main():
        [
            transport.input(),  # Transport user input
            stt,
-            user_aggregator,  # User responses
+            context_aggregator.user(),  # User responses
            llm,  # LLM
            tts,  # TTS
            transport.output(),  # Transport bot output
-            assistant_aggregator,  # Assistant spoken responses
+            context_aggregator.assistant(),  # Assistant spoken responses
        ]
    )

--- a/examples/foundational/05-sync-speech-and-image.py
+++ b/examples/foundational/05-sync-speech-and-image.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -65,8 +65,9 @@ class MonthPrepender(FrameProcessor):
            await self.push_frame(frame, direction)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_out_enabled=True,
--- a/examples/foundational/05a-local-sync-speech-and-image.py
+++ b/examples/foundational/05a-local-sync-speech-and-image.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
--- a/examples/foundational/06-listen-and-respond.py
+++ b/examples/foundational/06-listen-and-respond.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -11,6 +11,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import Frame, LLMRunFrame, MetricsFrame
 from pipecat.metrics.metrics import (
    LLMUsageMetricsData,
@@ -61,20 +62,24 @@ class MetricsLogger(FrameProcessor):
        await self.push_frame(frame, direction)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -101,13 +106,12 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

@@ -115,12 +119,12 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        [
            transport.input(),
            stt,
-            user_aggregator,
+            context_aggregator.user(),
            llm,
            tts,
            ml,
            transport.output(),
-            assistant_aggregator,
+            context_aggregator.assistant(),
        ]
    )

--- a/examples/foundational/06a-image-sync.py
+++ b/examples/foundational/06a-image-sync.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -12,6 +12,7 @@ from PIL import Image

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import (
    BotStartedSpeakingFrame,
    BotStoppedSpeakingFrame,
@@ -76,8 +77,9 @@ class ImageSyncAggregator(FrameProcessor):
        await self.push_frame(frame, direction)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
@@ -85,6 +87,7 @@ transport_params = {
        video_out_enabled=True,
        video_out_width=1024,
        video_out_height=1024,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
@@ -92,6 +95,7 @@ transport_params = {
        video_out_enabled=True,
        video_out_width=1024,
        video_out_height=1024,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -116,13 +120,12 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

@@ -135,12 +138,12 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        [
            transport.input(),
            stt,
-            user_aggregator,
+            context_aggregator.user(),
            llm,
            tts,
            image_sync_aggregator,
            transport.output(),
-            assistant_aggregator,
+            context_aggregator.assistant(),
        ]
    )

--- a/examples/foundational/07-interruptible-cartesia-http.py
+++ b/examples/foundational/07-interruptible-cartesia-http.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -11,6 +11,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
@@ -34,20 +35,24 @@ from pipecat.turns.user_turn_strategies import UserTurnStrategies
 load_dotenv(override=True)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -72,13 +77,12 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

@@ -86,11 +90,11 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        [
            transport.input(),  # Transport user input
            stt,
-            user_aggregator,  # User responses
+            context_aggregator.user(),  # User responses
            llm,  # LLM
            tts,  # TTS
            transport.output(),  # Transport bot output
-            assistant_aggregator,  # Assistant spoken responses
+            context_aggregator.assistant(),  # Assistant spoken responses
        ]
    )

--- a/examples/foundational/07-interruptible.py
+++ b/examples/foundational/07-interruptible.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -11,6 +11,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
@@ -33,20 +34,24 @@ from pipecat.turns.user_turn_strategies import UserTurnStrategies

 load_dotenv(override=True)

-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -71,13 +76,12 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

@@ -85,11 +89,11 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        [
            transport.input(),  # Transport user input
            stt,
-            user_aggregator,  # User responses
+            context_aggregator.user(),  # User responses
            llm,  # LLM
            tts,  # TTS
            transport.output(),  # Transport bot output
-            assistant_aggregator,  # Assistant spoken responses
+            context_aggregator.assistant(),  # Assistant spoken responses
        ]
    )

--- a/examples/foundational/07a-interruptible-speechmatics-vad.py
+++ b/examples/foundational/07a-interruptible-speechmatics-vad.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -33,8 +33,9 @@ from pipecat.turns.user_turn_strategies import ExternalUserTurnStrategies

 load_dotenv(override=True)

-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
@@ -130,7 +131,7 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        ]

        context = LLMContext(messages)
-        user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+        context_aggregator = LLMContextAggregatorPair(
            context,
            user_params=LLMUserAggregatorParams(user_turn_strategies=ExternalUserTurnStrategies()),
        )
@@ -139,11 +140,11 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
            [
                transport.input(),  # Transport user input
                stt,
-                user_aggregator,  # User responses
+                context_aggregator.user(),  # User responses
                llm,  # LLM
                tts,  # TTS
                transport.output(),  # Transport bot output
-                assistant_aggregator,  # Assistant spoken responses
+                context_aggregator.assistant(),  # Assistant spoken responses
            ]
        )

--- a/examples/foundational/07a-interruptible-speechmatics.py
+++ b/examples/foundational/07a-interruptible-speechmatics.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -12,6 +12,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
@@ -36,20 +37,24 @@ from pipecat.turns.user_turn_strategies import UserTurnStrategies

 load_dotenv(override=True)

-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -112,7 +117,7 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        ]

        context = LLMContext(messages)
-        user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+        context_aggregator = LLMContextAggregatorPair(
            context,
            user_params=LLMUserAggregatorParams(
                user_turn_strategies=UserTurnStrategies(
@@ -120,7 +125,6 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
                        TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())
                    ]
                ),
-                vad_analyzer=SileroVADAnalyzer(),
            ),
        )

@@ -128,11 +132,11 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
            [
                transport.input(),  # Transport user input
                stt,  # STT
-                user_aggregator,  # User responses
+                context_aggregator.user(),  # User responses
                llm,  # LLM
                tts,  # TTS
                transport.output(),  # Transport bot output
-                assistant_aggregator,  # Assistant spoken responses
+                context_aggregator.assistant(),  # Assistant spoken responses
            ]
        )

--- a/examples/foundational/07aa-interruptible-soniox.py
+++ b/examples/foundational/07aa-interruptible-soniox.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -12,6 +12,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
@@ -38,14 +39,17 @@ transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -72,13 +76,12 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

@@ -86,11 +89,11 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        [
            transport.input(),  # Transport user input
            stt,
-            user_aggregator,  # User responses
+            context_aggregator.user(),  # User responses
            llm,  # LLM
            tts,  # TTS
            transport.output(),  # Transport bot output
-            assistant_aggregator,  # Assistant spoken responses
+            context_aggregator.assistant(),  # Assistant spoken responses
        ]
    )
    task = PipelineTask(
--- a/examples/foundational/07ab-interruptible-inworld-http.py
+++ b/examples/foundational/07ab-interruptible-inworld-http.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -12,6 +12,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame, TTSTextFrame
 from pipecat.observers.loggers.debug_log_observer import DebugLogObserver, FrameEndpoint
 from pipecat.pipeline.pipeline import Pipeline
@@ -22,6 +23,7 @@ from pipecat.processors.aggregators.llm_response_universal import (
    LLMContextAggregatorPair,
    LLMUserAggregatorParams,
 )
+from pipecat.processors.frameworks.rtvi import RTVIObserver, RTVIProcessor
 from pipecat.runner.types import RunnerArguments
 from pipecat.runner.utils import create_transport
 from pipecat.services.deepgram.stt import DeepgramSTTService
@@ -40,14 +42,17 @@ transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -77,7 +82,7 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        ]

        context = LLMContext(messages)
-        user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+        context_aggregator = LLMContextAggregatorPair(
            context,
            user_params=LLMUserAggregatorParams(
                user_turn_strategies=UserTurnStrategies(
@@ -85,19 +90,21 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
                        TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())
                    ]
                ),
-                vad_analyzer=SileroVADAnalyzer(),
            ),
        )

+        rtvi = RTVIProcessor()
+
        pipeline = Pipeline(
            [
                transport.input(),
+                rtvi,
                stt,
-                user_aggregator,
+                context_aggregator.user(),
                llm,
                tts,
                transport.output(),
-                assistant_aggregator,
+                context_aggregator.assistant(),
            ]
        )

@@ -108,6 +115,7 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
                enable_usage_metrics=True,
            ),
            observers=[
+                RTVIObserver(rtvi),
                DebugLogObserver(
                    frame_types={
                        TTSTextFrame: (BaseOutputTransport, FrameEndpoint.SOURCE),
--- a/examples/foundational/07ab-interruptible-inworld.py
+++ b/examples/foundational/07ab-interruptible-inworld.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -11,6 +11,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame, TTSTextFrame
 from pipecat.observers.loggers.debug_log_observer import DebugLogObserver, FrameEndpoint
 from pipecat.pipeline.pipeline import Pipeline
@@ -21,6 +22,7 @@ from pipecat.processors.aggregators.llm_response_universal import (
    LLMContextAggregatorPair,
    LLMUserAggregatorParams,
 )
+from pipecat.processors.frameworks.rtvi import RTVIConfig, RTVIObserver, RTVIProcessor
 from pipecat.runner.types import RunnerArguments
 from pipecat.runner.utils import create_transport
 from pipecat.services.deepgram.stt import DeepgramSTTService
@@ -40,14 +42,17 @@ transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -74,25 +79,27 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

+    rtvi = RTVIProcessor(config=RTVIConfig(config=[]))
+
    pipeline = Pipeline(
        [
            transport.input(),
+            rtvi,
            stt,
-            user_aggregator,
+            context_aggregator.user(),
            llm,
            tts,
            transport.output(),
-            assistant_aggregator,
+            context_aggregator.assistant(),
        ]
    )

@@ -103,6 +110,7 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
            enable_usage_metrics=True,
        ),
        observers=[
+            RTVIObserver(rtvi),
            DebugLogObserver(
                frame_types={
                    TTSTextFrame: (BaseOutputTransport, FrameEndpoint.SOURCE),
--- a/examples/foundational/07ac-interruptible-asyncai-http.py
+++ b/examples/foundational/07ac-interruptible-asyncai-http.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -13,6 +13,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
@@ -36,20 +37,24 @@ from pipecat.turns.user_turn_strategies import UserTurnStrategies
 load_dotenv(override=True)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -77,7 +82,7 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        ]

        context = LLMContext(messages)
-        user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+        context_aggregator = LLMContextAggregatorPair(
            context,
            user_params=LLMUserAggregatorParams(
                user_turn_strategies=UserTurnStrategies(
@@ -85,7 +90,6 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
                        TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())
                    ]
                ),
-                vad_analyzer=SileroVADAnalyzer(),
            ),
        )

@@ -93,11 +97,11 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
            [
                transport.input(),  # Transport user input
                stt,
-                user_aggregator,  # User responses
+                context_aggregator.user(),  # User responses
                llm,  # LLM
                tts,  # TTS
                transport.output(),  # Transport bot output
-                assistant_aggregator,  # Assistant spoken responses
+                context_aggregator.assistant(),  # Assistant spoken responses
            ]
        )

--- a/examples/foundational/07ac-interruptible-asyncai.py
+++ b/examples/foundational/07ac-interruptible-asyncai.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -12,6 +12,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
@@ -35,20 +36,24 @@ from pipecat.turns.user_turn_strategies import UserTurnStrategies
 load_dotenv(override=True)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -73,13 +78,12 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

@@ -87,11 +91,11 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        [
            transport.input(),  # Transport user input
            stt,
-            user_aggregator,  # User responses
+            context_aggregator.user(),  # User responses
            llm,  # LLM
            tts,  # TTS
            transport.output(),  # Transport bot output
-            assistant_aggregator,  # Assistant spoken responses
+            context_aggregator.assistant(),  # Assistant spoken responses
        ]
    )

--- a/examples/foundational/07ad-interruptible-aicoustics.py
+++ b/examples/foundational/07ad-interruptible-aicoustics.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -38,38 +38,50 @@ from pipecat.turns.user_turn_strategies import UserTurnStrategies
 load_dotenv(override=True)


+# Create audio buffer processor so we can hear the audio fitler results.
+audiobuffer = AudioBufferProcessor(
+    num_channels=2,  # 1 for mono, 2 for stereo (user left, bot right)
+    enable_turn_audio=False,  # Enable per-turn audio recording
+)
+
+
 def _create_aic_filter() -> AICFilter:
    license_key = os.getenv("AICOUSTICS_LICENSE_KEY", "")

    return AICFilter(
        license_key=license_key,
-        model_id="quail-vf-l-16khz",
+        enhancement_level=0.5,
    )


-aic_filter = _create_aic_filter()
-aic_vad_analyzer = aic_filter.create_vad_analyzer(
-    speech_hold_duration=0.05, minimum_speech_duration=0.0, sensitivity=6.0
-)
-
-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
-    "daily": lambda: DailyParams(
-        audio_in_enabled=True,
-        audio_out_enabled=True,
-        audio_in_filter=aic_filter,
-    ),
-    "twilio": lambda: FastAPIWebsocketParams(
-        audio_in_enabled=True,
-        audio_out_enabled=True,
-        audio_in_filter=aic_filter,
-    ),
-    "webrtc": lambda: TransportParams(
-        audio_in_enabled=True,
-        audio_out_enabled=True,
-        audio_in_filter=aic_filter,
-    ),
+    "daily": lambda: (
+        lambda aic: DailyParams(
+            audio_in_enabled=True,
+            audio_out_enabled=True,
+            vad_analyzer=aic.create_vad_analyzer(lookback_buffer_size=6.0, sensitivity=6.0),
+            audio_in_filter=aic,
+        )
+    )(_create_aic_filter()),
+    "twilio": lambda: (
+        lambda aic: FastAPIWebsocketParams(
+            audio_in_enabled=True,
+            audio_out_enabled=True,
+            vad_analyzer=aic.create_vad_analyzer(lookback_buffer_size=6.0, sensitivity=6.0),
+            audio_in_filter=aic,
+        )
+    )(_create_aic_filter()),
+    "webrtc": lambda: (
+        lambda aic: TransportParams(
+            audio_in_enabled=True,
+            audio_out_enabled=True,
+            vad_analyzer=aic.create_vad_analyzer(lookback_buffer_size=6.0, sensitivity=6.0),
+            audio_in_filter=aic,
+        )
+    )(_create_aic_filter()),
 }


@@ -93,32 +105,25 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
-            vad_analyzer=aic_vad_analyzer,
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
        ),
    )

-    # Create audio buffer processor so we can hear the audio fitler results.
-    audiobuffer = AudioBufferProcessor(
-        num_channels=2,  # 1 for mono, 2 for stereo (user left, bot right)
-        enable_turn_audio=False,  # Enable per-turn audio recording
-    )
-
    pipeline = Pipeline(
        [
            transport.input(),  # Transport user input
            stt,  # STT
-            user_aggregator,  # User responses
+            context_aggregator.user(),  # User responses
            llm,  # LLM
            tts,  # TTS
            transport.output(),  # Transport bot output
            audiobuffer,  # write audio data to a file
-            assistant_aggregator,  # Assistant spoken responses
+            context_aggregator.assistant(),  # Assistant spoken responses
        ]
    )

--- a/examples/foundational/07ae-interruptible-hume.py
+++ b/examples/foundational/07ae-interruptible-hume.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -11,6 +11,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame, TTSTextFrame
 from pipecat.observers.loggers.debug_log_observer import DebugLogObserver, FrameEndpoint
 from pipecat.pipeline.pipeline import Pipeline
@@ -21,6 +22,7 @@ from pipecat.processors.aggregators.llm_response_universal import (
    LLMContextAggregatorPair,
    LLMUserAggregatorParams,
 )
+from pipecat.processors.frameworks.rtvi import RTVIConfig, RTVIObserver, RTVIProcessor
 from pipecat.runner.types import RunnerArguments
 from pipecat.runner.utils import create_transport
 from pipecat.services.deepgram.stt import DeepgramSTTService
@@ -36,20 +38,24 @@ from pipecat.turns.user_turn_strategies import UserTurnStrategies
 load_dotenv(override=True)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -75,25 +81,27 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

+    rtvi = RTVIProcessor(config=RTVIConfig(config=[]))
+
    pipeline = Pipeline(
        [
            transport.input(),  # Transport user input
+            rtvi,
            stt,
-            user_aggregator,  # User responses
+            context_aggregator.user(),  # User responses
            llm,  # LLM
            tts,  # TTS (HumeTTSService with word timestamps)
            transport.output(),  # Transport bot output
-            assistant_aggregator,  # Assistant spoken responses
+            context_aggregator.assistant(),  # Assistant spoken responses
        ]
    )

@@ -106,6 +114,7 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        ),
        idle_timeout_secs=runner_args.pipeline_idle_timeout_secs,
        observers=[
+            RTVIObserver(rtvi),
            DebugLogObserver(
                frame_types={
                    TTSTextFrame: (BaseOutputTransport, FrameEndpoint.SOURCE),
@@ -114,6 +123,10 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        ],
    )

+    @rtvi.event_handler("on_client_ready")
+    async def on_client_ready(rtvi):
+        await rtvi.set_bot_ready()
+
    @transport.event_handler("on_client_connected")
    async def on_client_connected(transport, client):
        logger.info(f"Client connected")
--- a/examples/foundational/07af-interruptible-gradium.py
+++ b/examples/foundational/07af-interruptible-gradium.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -11,6 +11,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMRunFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
@@ -25,7 +26,6 @@ from pipecat.runner.utils import create_transport
 from pipecat.services.gradium.stt import GradiumSTTService
 from pipecat.services.gradium.tts import GradiumTTSService
 from pipecat.services.openai.llm import OpenAILLMService
-from pipecat.transcriptions.language import Language
 from pipecat.transports.base_transport import BaseTransport, TransportParams
 from pipecat.transports.daily.transport import DailyParams
 from pipecat.transports.websocket.fastapi import FastAPIWebsocketParams
@@ -34,20 +34,24 @@ from pipecat.turns.user_turn_strategies import UserTurnStrategies

 load_dotenv(override=True)

-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -55,18 +59,11 @@ transport_params = {
 async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    logger.info(f"Starting bot")

-    stt = GradiumSTTService(
-        api_key=os.getenv("GRADIUM_API_KEY"),
-        api_endpoint_base_url="wss://us.api.gradium.ai/api/speech/asr",
-        params=GradiumSTTService.InputParams(
-            language=Language.EN,
-        ),
-    )
+    stt = GradiumSTTService(api_key=os.getenv("GRADIUM_API_KEY"))

    tts = GradiumTTSService(
        api_key=os.getenv("GRADIUM_API_KEY"),
        voice_id="YTpq7expH9539ERJ",
-        url="wss://us.api.gradium.ai/api/speech/tts",
    )

    llm = OpenAILLMService(api_key=os.getenv("OPENAI_API_KEY"))
@@ -79,13 +76,12 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

@@ -93,11 +89,11 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        [
            transport.input(),  # Transport user input
            stt,
-            user_aggregator,  # User responses
+            context_aggregator.user(),  # User responses
            llm,  # LLM
            tts,  # TTS
            transport.output(),  # Transport bot output
-            assistant_aggregator,  # Assistant spoken responses
+            context_aggregator.assistant(),  # Assistant spoken responses
        ]
    )

--- a/examples/foundational/07b-interruptible-langchain.py
+++ b/examples/foundational/07b-interruptible-langchain.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -17,6 +17,7 @@ from loguru import logger

 from pipecat.audio.turn.smart_turn.local_smart_turn_v3 import LocalSmartTurnAnalyzerV3
 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.audio.vad.vad_analyzer import VADParams
 from pipecat.frames.frames import LLMMessagesUpdateFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
@@ -49,20 +50,24 @@ def get_session_history(session_id: str) -> BaseChatMessageHistory:
    return message_store[session_id]


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "twilio": lambda: FastAPIWebsocketParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
    "webrtc": lambda: TransportParams(
        audio_in_enabled=True,
        audio_out_enabled=True,
+        vad_analyzer=SileroVADAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
 }

@@ -98,13 +103,12 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    lc = LangchainProcessor(history_chain)

    context = LLMContext()
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(
            user_turn_strategies=UserTurnStrategies(
                stop=[TurnAnalyzerUserTurnStopStrategy(turn_analyzer=LocalSmartTurnAnalyzerV3())]
            ),
-            vad_analyzer=SileroVADAnalyzer(),
        ),
    )

@@ -112,11 +116,11 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        [
            transport.input(),  # Transport user input
            stt,
-            user_aggregator,  # User responses
+            context_aggregator.user(),  # User responses
            lc,  # Langchain
            tts,  # TTS
            transport.output(),  # Transport bot output
-            assistant_aggregator,  # Assistant spoken responses
+            context_aggregator.assistant(),  # Assistant spoken responses
        ]
    )

--- a/examples/foundational/07c-interruptible-deepgram-flux.py
+++ b/examples/foundational/07c-interruptible-deepgram-flux.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024-2026, Daily
+# Copyright (c) 2024–2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -32,8 +32,9 @@ from pipecat.turns.user_turn_strategies import ExternalUserTurnStrategies
 load_dotenv(override=True)


-# We use lambdas to defer transport parameter creation until the transport
-# type is selected at runtime.
+# We store functions so objects (e.g. SileroVADAnalyzer) don't get
+# instantiated. The function will be called when the desired transport gets
+# selected.
 transport_params = {
    "daily": lambda: DailyParams(
        audio_in_enabled=True,
@@ -70,7 +71,7 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
    ]

    context = LLMContext(messages)
-    user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
+    context_aggregator = LLMContextAggregatorPair(
        context,
        user_params=LLMUserAggregatorParams(user_turn_strategies=ExternalUserTurnStrategies()),
    )
@@ -79,11 +80,11 @@ async def run_bot(transport: BaseTransport, runner_args: RunnerArguments):
        [
            transport.input(),  # Transport user input
            stt,  # STT
-            user_aggregator,  # User responses
+            context_aggregator.user(),  # User responses
            llm,  # LLM
            tts,  # TTS
            transport.output(),  # Transport bot output
-            assistant_aggregator,  # Assistant spoken responses
+            context_aggregator.assistant(),  # Assistant spoken responses
        ]
    )

--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
Aleix Conchillo Flaqué	ba86fc2f08	LLMUserAggregator: use queue_task_frame() to push user speaking frames	2025-12-30 15:18:38 -08:00
Aleix Conchillo Flaqué	d459465eb6	FrameProcessor: add queue_task_frame() and queue_task_frames()	2025-12-30 15:18:38 -08:00
Aleix Conchillo Flaqué	74aea65f17	PipelineTask: use QueueTaskFrame	2025-12-30 15:16:52 -08:00
Aleix Conchillo Flaqué	bd7b24596e	frames: add QueueTaskFrame	2025-12-30 15:16:52 -08:00
				`@@ -0,0 +1 @@`
				- ⚠️ `TransportParams.turn_analyzer` is deprecated and might result in unexpected behavior, use `LLMUserAggregator`'s new `turn_start_strategies` parameter instead.
				`@@ -0,0 +1 @@`
				- `FrameProcessor.interruption_strategies` is deprecated, use `LLMUserAggregator`'s new `turn_start_strategies` parameter instead.
				`@@ -0,0 +1 @@`
				- `EmulateUserStartedSpeakingFrame` and `EmulateUserStoppedSpeakingFrame` frames are deprecated.
				`@@ -0,0 +1 @@`
				- Deprecated the `emulated` field in the `UserStartedSpeakingFrame` and `UserStoppedSpeakingFrame` frames.
				`@@ -0,0 +1 @@`
				- The `LLMUserAggregatorParams` and `LLMAssistantAggregatorParams` classes in `pipecat.processors.aggregators.llm_response` are now deprecated. Use the new universal `LLMContext` and `LLMContextAggregatorPair` instead.
				`@@ -0,0 +1 @@`
				- `pipecat.audio.interruptions.MinWordsInterruptionStrategy` is deprecated. Use `pipecat.turns.user_start.MinWordsUserTurnStartStrategy` with `LLMUserAggregator`'s new `turn_start_strategies` parameter instead.
				`@@ -1 +0,0 @@`
				- Added `ResembleAITTSService` for text-to-speech using Resemble AI's streaming WebSocket API with word-level timestamps and jitter buffering for smooth audio playback.
				`@@ -0,0 +1 @@`
				- Added `RNNoiseFilter` for real-time noise suppression using RNNoise neural network via pyrnnoise library.
				`@@ -0,0 +1 @@`
				- `TranscriptionFrame` and `InterimTranscriptionFrame` produced by `DailyTransport` now include the transport source (i.e., the originating audio track).