example: realtime with transcripts

Merge pull request #1295 from pipecat-ai/aleix/pipelinetask-keyword-arguments
PipelineTask: force constructor keyword arguments
2025-02-26 16:29:07 +08:00 · 2025-02-25 19:00:58 -08:00 · 2025-02-25 18:58:47 -08:00 · 2025-02-25 18:57:52 -08:00 · 2025-02-25 18:56:41 -08:00 · 2025-02-25 18:55:45 -08:00
181 changed files with 13163 additions and 652 deletions
--- a/.gitignore
+++ b/.gitignore
@@ -32,6 +32,21 @@ fly.toml

 # Example files
 pipecat/examples/twilio-chatbot/templates/streams.xml
+pipecat/examples/bot-ready-signalling/client/react-native/node_modules/
+pipecat/examples/bot-ready-signalling/client/react-native/.expo/
+pipecat/examples/bot-ready-signalling/client/react-native/dist/
+pipecat/examples/bot-ready-signalling/client/react-native/npm-debug.*
+pipecat/examples/bot-ready-signalling/client/react-native/*.jks
+pipecat/examples/bot-ready-signalling/client/react-native/*.p8
+pipecat/examples/bot-ready-signalling/client/react-native/*.p12
+pipecat/examples/bot-ready-signalling/client/react-native/*.key
+pipecat/examples/bot-ready-signalling/client/react-native/*.mobileprovision
+pipecat/examples/bot-ready-signalling/client/react-native/*.orig.*
+pipecat/examples/bot-ready-signalling/client/react-native/web-build/
+
+# macOS
+.DS_Store
+

 # Documentation
 docs/api/_build/
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -5,6 +5,125 @@ All notable changes to **Pipecat** will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

+## [Unreleased]
+
+### Added
+
+- Pipecat version will now be logged on every application startup. This will
+  help us identify what version we are running in case of any issues.
+
+- Added a new `StopFrame` which can be used to stop a pipeline task while
+  keeping the frame processors running. The frame processors could then be used
+  in a different pipeline. The difference between a `StopFrame` and a
+  `StopTaskFrame` is that, as with `EndFrame` and `EndTaskFrame`, the
+  `StopFrame` is pushed from the task and the `StopTaskFrame` is pushed upstream
+  inside the pipeline by any processor.
+
+- Added a new `PipelineTask` parameter `observers` that replaces the previous
+  `PipelineParams.observers`.
+
+- Added a new `PipelineTask` parameter `check_dangling_tasks` to enable or
+  disable checking for frame processors' dangling tasks when the Pipeline
+  finishes running.
+
+- Added new `on_completion_timeout` event for LLM services (all OpenAI-based
+  services, Anthropic and Google). Note that this event will only get triggered
+  if LLM timeouts are setup and if the timeout was reached. It can be useful to
+  retrigger another completion and see if the timeout was just a blip.
+
+- Added new log observers `LLMLogObserver` and `TranscriptionLogObserver` that
+  can be useful for debugging your pipelines.
+
+- Added `room_url` property to `DailyTransport`.
+
+- Added `addons` argument to `DeepgramSTTService`.
+
+- Added `exponential_backoff_time()` to `utils.network` module.
+
+### Changed
+
+- ⚠️ `PipelineTask` now requires keyword arguments (except for the first one for
+  the pipeline).
+
+- The base `TTSService` class now strips leading newlines before sending text
+  to the TTS provider. This change is to solve issues where some TTS providers,
+  like Azure, would not output text due to newlines.
+
+- `GrokLLMSService` now uses `grok-2` as the default model.
+
+- `AnthropicLLMService` now uses `claude-3-7-sonnet-20250219` as the default
+  model.
+
+- `RimeHttpTTSService` needs an `aiohttp.ClientSession` to be passed to the
+  constructor as all the other HTTP-based services.
+
+- `RimeHttpTTSService` doesn't use a default voice anymore.
+
+- `DeepgramSTTService` now uses the new `nova-3` model by default. If you want
+  to use the previous model you can pass `LiveOptions(model="nova-2-general")`.
+  (see https://deepgram.com/learn/introducing-nova-3-speech-to-text-api)
+
+```python
+stt = DeepgramSTTService(..., live_options=LiveOptions(model="nova-2-general"))
+```
+
+### Deprecated
+
+- `PipelineParams.observers` is now deprecated, you the new `PipelineTask`
+  parameter `observers`.
+
+### Removed
+
+- Remove `TransportParams.audio_out_is_live` since it was not being used at all.
+
+### Fixed
+
+- Fixed an `AudioContextWordTTSService` issue that would cause an `EndFrame` to
+  disconnect from the TTS service before audio from all the contexts was
+  received. This affected services like Cartesia and Rime.
+
+- Fixed an issue that was not allowing to pass an `OpenAILLMContext` to create
+  `GoogleLLMService`'s context aggregators.
+
+- Fixed a `ElevenLabsTTSService`, `FishAudioTTSService`, `LMNTTTSService` and
+  `PlayHTTTSService` issue that was resulting in audio requested before an
+  interruption being played after an interruption.
+
+- Fixed `match_endofsentence` support for ellipses.
+
+- Fixed an issue that would cause undesired interruptions via
+  `EmulateUserStartedSpeakingFrame` when only interim transcriptions (i.e. no
+  final transcriptions) where received.
+
+- Fixed an issue where `EndTaskFrame` was not triggering
+  `on_client_disconnected` or closing the WebSocket in FastAPI.
+
+- Fixed an issue in `DeepgramSTTService` where the `sample_rate` passed to the
+  `LiveOptions` was not being used, causing the service to use the default
+  sample rate of pipeline.
+
+- Fixed a context aggregator issue that would not append the LLM text response
+  to the context if a function call happened in the same LLM turn.
+
+- Fixed an issue that was causing HTTP TTS services to push `TTSStoppedFrame`
+  more than once.
+
+- Fixed a `FishAudioTTSService` issue where `TTSStoppedFrame` was not being
+  pushed.
+
+- Fixed an issue that `start_callback` was not invoked for some LLM services.
+
+- Fixed an issue that would cause `DeepgramSTTService` to stop working after an
+  error occurred (e.g. sudden network loss). If the network recovered we would
+  not reconnect.
+
+- Fixed a `STTMuteFilter` issue that would not mute user audio frames causing
+  transcriptions to be generated by the STT service.
+
+### Other
+
+- Added Gemini support to `examples/phone-chatbot`.
+
 ## [0.0.57] - 2025-02-14

 ### Added
--- a/dev-requirements.txt
+++ b/dev-requirements.txt
@@ -3,10 +3,10 @@ coverage~=7.6.12
 grpcio-tools~=1.67.1
 pip-tools~=7.4.1
 pre-commit~=4.0.1
-pyright~=1.1.393
+pyright~=1.1.394
 pytest~=8.3.4
-pytest-asyncio~=0.25.2
-ruff~=0.9.5
+pytest-asyncio~=0.25.3
+ruff~=0.9.7
 setuptools~=70.0.0
 setuptools_scm~=8.1.0
 python-dotenv~=1.0.1
--- a/dot-env.template
+++ b/dot-env.template
@@ -18,6 +18,9 @@ AZURE_DALLE_API_KEY=...
 AZURE_DALLE_ENDPOINT=https://...
 AZURE_DALLE_MODEL=...

+# Cartesia
+CARTESIA_API_KEY=...
+
 # Daily
 DAILY_API_KEY=...
 DAILY_SAMPLE_ROOM_URL=https://...
--- a/examples/bot-ready-signalling/client/react-native/.nvmrc
+++ b/examples/bot-ready-signalling/client/react-native/.nvmrc
@@ -0,0 +1 @@
+22.14
--- a/examples/bot-ready-signalling/client/react-native/README.md
+++ b/examples/bot-ready-signalling/client/react-native/README.md
@@ -0,0 +1,60 @@
+# React Native Implementation
+
+Basic implementation using the [Pipecat React Native SDK](https://docs.pipecat.ai/client/react-native/introduction).
+
+## Usage
+
+### Expo requirements
+
+This project cannot be used with an [Expo Go](https://docs.expo.dev/workflow/expo-go/) app because [it requires custom native code](https://docs.expo.io/workflow/customizing/).
+
+When a project requires custom native code or a config plugin, we need to transition from using [Expo Go](https://docs.expo.dev/workflow/expo-go/) 
+to a [development build](https://docs.expo.dev/development/introduction/).
+
+More details about the custom native code used by this demo can be found in [rn-daily-js-expo-config-plugin](https://github.com/daily-co/rn-daily-js-expo-config-plugin).
+
+### Building remotely
+
+If you do not have experience with Xcode and Android Studio builds or do not have them installed locally on your computer, you will need to follow [this guide from Expo to use EAS Build](https://docs.expo.dev/development/create-development-builds/#create-and-install-eas-build).
+
+### Building locally
+
+You will need to have installed locally on your computer:
+- [Xcode](https://developer.apple.com/xcode/) to build for iOS;
+- [Android Studio](https://developer.android.com/studio) to build for Android;
+
+#### Install the demo dependencies
+
+```bash
+# Use the version of node specified in .nvmrc
+nvm i
+
+# Install dependencies
+npm i
+
+# Before a native app can be compiled, the native source code must be generated.
+npx expo prebuild
+
+# Configure the environment variable to connect to the local server
+cp env.example .env
+# edit .env and add your local ip address, for example: http://192.168.1.16:7860
+```
+
+#### Running on Android
+
+After plugging in an Android device [configured for debugging](https://developer.android.com/studio/debug/dev-options), run the following command:
+
+```
+npm run android
+```
+
+#### Running on iOS
+
+Run the following command:
+
+```
+npm run ios
+```
+
+#### Connect to the server
+Use the http://localhost:5173 in your app.
--- a/examples/bot-ready-signalling/client/react-native/app.json
+++ b/examples/bot-ready-signalling/client/react-native/app.json
@@ -0,0 +1,75 @@
+{
+  "expo": {
+    "name": "bot-ready-rn",
+    "slug": "bot-ready-rn",
+    "version": "1.0.0",
+    "orientation": "portrait",
+    "icon": "./assets/icon.png",
+    "userInterfaceStyle": "light",
+    "splash": {
+      "image": "./assets/splash.png",
+      "resizeMode": "contain",
+      "backgroundColor": "#ffffff"
+    },
+    "updates": {
+      "fallbackToCacheTimeout": 0
+    },
+    "assetBundlePatterns": [
+      "**/*"
+    ],
+    "ios": {
+      "supportsTablet": true,
+      "bitcode": false,
+      "bundleIdentifier": "co.daily.expo.BotReady",
+      "infoPlist": {
+        "UIBackgroundModes": [
+          "voip"
+        ]
+      },
+      "appleTeamId": "EEBGKV9N3N"
+    },
+    "android": {
+      "adaptiveIcon": {
+        "foregroundImage": "./assets/adaptive-icon.png",
+        "backgroundColor": "#FFFFFF"
+      },
+      "package": "co.daily.expo.BotReady",
+      "permissions": [
+        "android.permission.ACCESS_NETWORK_STATE",
+        "android.permission.BLUETOOTH",
+        "android.permission.CAMERA",
+        "android.permission.INTERNET",
+        "android.permission.MODIFY_AUDIO_SETTINGS",
+        "android.permission.RECORD_AUDIO",
+        "android.permission.SYSTEM_ALERT_WINDOW",
+        "android.permission.WAKE_LOCK",
+        "android.permission.FOREGROUND_SERVICE",
+        "android.permission.FOREGROUND_SERVICE_CAMERA",
+        "android.permission.FOREGROUND_SERVICE_MICROPHONE",
+        "android.permission.FOREGROUND_SERVICE_MEDIA_PROJECTION",
+        "android.permission.POST_NOTIFICATIONS"
+      ]
+    },
+    "web": {
+      "favicon": "./assets/favicon.png"
+    },
+    "plugins": [
+      "@config-plugins/react-native-webrtc",
+      "@daily-co/config-plugin-rn-daily-js",
+      [
+        "expo-build-properties",
+        {
+          "android": {
+            "minSdkVersion": 24,
+            "compileSdkVersion": 35,
+            "targetSdkVersion": 34,
+            "buildToolsVersion": "35.0.0"
+          },
+          "ios": {
+            "deploymentTarget": "15.1"
+          }
+        }
+      ]
+    ]
+  }
+}
--- a/examples/bot-ready-signalling/client/react-native/assets/adaptive-icon.png
+++ b/examples/bot-ready-signalling/client/react-native/assets/adaptive-icon.png
--- a/examples/bot-ready-signalling/client/react-native/assets/favicon.png
+++ b/examples/bot-ready-signalling/client/react-native/assets/favicon.png
--- a/examples/bot-ready-signalling/client/react-native/assets/icon.png
+++ b/examples/bot-ready-signalling/client/react-native/assets/icon.png
--- a/examples/bot-ready-signalling/client/react-native/assets/splash.png
+++ b/examples/bot-ready-signalling/client/react-native/assets/splash.png
--- a/examples/bot-ready-signalling/client/react-native/babel.config.js
+++ b/examples/bot-ready-signalling/client/react-native/babel.config.js
@@ -0,0 +1,7 @@
+module.exports = function(api) {
+  api.cache(true);
+  return {
+    presets: ['babel-preset-expo'],
+    plugins: [["module:react-native-dotenv"]],
+  };
+};
--- a/examples/bot-ready-signalling/client/react-native/env.example
+++ b/examples/bot-ready-signalling/client/react-native/env.example
@@ -0,0 +1 @@
+API_BASE_URL=http://YOUR_LOCAL_IP:7860
--- a/examples/bot-ready-signalling/client/react-native/index.js
+++ b/examples/bot-ready-signalling/client/react-native/index.js
@@ -0,0 +1,7 @@
+import { registerRootComponent } from "expo";
+
+import App from "./src/App";
+
+// registerRootComponent calls AppRegistry.registerComponent('main', () => App);
+// It also ensures that the environment is set up appropriately
+registerRootComponent(App);
--- a/examples/bot-ready-signalling/client/react-native/metro.config.js
+++ b/examples/bot-ready-signalling/client/react-native/metro.config.js
@@ -0,0 +1,4 @@
+// Learn more https://docs.expo.io/guides/customizing-metro
+const { getDefaultConfig } = require('expo/metro-config');
+
+module.exports = getDefaultConfig(__dirname);
--- a/examples/bot-ready-signalling/client/react-native/package-lock.json
+++ b/examples/bot-ready-signalling/client/react-native/package-lock.json
--- a/examples/bot-ready-signalling/client/react-native/package.json
+++ b/examples/bot-ready-signalling/client/react-native/package.json
@@ -0,0 +1,31 @@
+{
+  "name": "bot-ready-rn",
+  "version": "1.0.0",
+  "scripts": {
+    "start": "expo start --dev-client",
+    "android": "expo run:android --device",
+    "ios": "expo run:ios --device",
+    "web": "expo start --web"
+  },
+  "dependencies": {
+    "@config-plugins/react-native-webrtc": "^10.0.0",
+    "@daily-co/config-plugin-rn-daily-js": "0.0.7",
+    "@daily-co/react-native-daily-js": "^0.70.0",
+    "@daily-co/react-native-webrtc": "^118.0.3-daily.2",
+    "@react-native-async-storage/async-storage": "1.23.1",
+    "expo": "^52.0.0",
+    "expo-build-properties": "~0.13.1",
+    "expo-dev-client": "~5.0.5",
+    "expo-splash-screen": "~0.29.16",
+    "expo-status-bar": "~2.0.0",
+    "react": "18.3.1",
+    "react-native": "0.76.3",
+    "react-native-background-timer": "^2.4.1",
+    "react-native-dotenv": "^3.4.11",
+    "react-native-get-random-values": "^1.11.0"
+  },
+  "devDependencies": {
+    "@babel/core": "^7.12.9"
+  },
+  "private": true
+}
--- a/examples/bot-ready-signalling/client/react-native/src/App.js
+++ b/examples/bot-ready-signalling/client/react-native/src/App.js
@@ -0,0 +1,121 @@
+import React, { useState, useEffect } from 'react';
+import {SafeAreaView, View, Text, Button, StyleSheet, ScrollView} from 'react-native';
+import Daily from "@daily-co/react-native-daily-js";
+import { API_BASE_URL } from "@env";
+
+const CallScreen = () => {
+  const [connectionStatus, setConnectionStatus] = useState('Disconnected');
+  const [isConnected, setIsConnected] = useState(false);
+  const [callObject, setCallObject] = useState(null);
+  const [logs, setLogs] = useState([]);
+
+  useEffect(() => {
+    if (callObject) {
+      setupTrackListeners(callObject);
+    }
+  }, [callObject]);
+
+  const log = (message) => {
+    setLogs((prevLogs) => [...prevLogs, `${new Date().toISOString()} - ${message}`]);
+    console.log(message);
+  };
+
+  const setupTrackListeners = (callObject) => {
+    callObject.on("joined-meeting", () => {
+      setConnectionStatus('Connected');
+      setIsConnected(true);
+      log('Client connected');
+    });
+    callObject.on("left-meeting", () => {
+      setConnectionStatus('Disconnected');
+      setIsConnected(false);
+      log('Client disconnected');
+    });
+    callObject.on("participant-left", () => {
+      // When the bot leaves, we are also disconnecting from the call
+      disconnect().catch((err) => {
+        log(`Failed to disconnect ${err}`);
+      })
+    });
+    // Trigger so the bot can start sending audio
+    callObject.on("track-started", (evt) => {
+      if (evt.track.kind === "audio" && evt.participant.local === false) {
+        handleEventToConsole(evt)
+        log("Sending the message that will trigger the bot to play the audio.")
+        callObject.sendAppMessage("playable")
+      }
+    });
+    callObject.on("error", (evt) => log(`Error: ${evt.error}`));
+    // Other events just for awareness
+    callObject.on("track-stopped", handleEventToConsole);
+    callObject.on("participant-joined", handleEventToConsole);
+    callObject.on("participant-updated", handleEventToConsole);
+  };
+
+  const handleEventToConsole = (evt) => {
+    log(`Received event: ${evt.action}`);
+  };
+
+  const connect = async () => {
+    try {
+      const callObject = Daily.createCallObject({ subscribeToTracksAutomatically: true });
+      setCallObject(callObject);
+      const connectionUrl = `${API_BASE_URL}/connect`
+      const res = await fetch(connectionUrl, { method: "POST", headers: { "Content-Type": "application/json" } });
+      const roomInfo = await res.json();
+      await callObject.join({ url: roomInfo.room_url });
+    } catch (error) {
+      log(`Error connecting: ${error.message}`);
+    }
+  };
+
+  const disconnect = async () => {
+    if (callObject) {
+      try {
+        await callObject.leave();
+        await callObject.destroy();
+        setCallObject(null);
+      } catch (error) {
+        log(`Error disconnecting: ${error.message}`);
+      }
+    }
+  };
+
+  return (
+      <SafeAreaView style={styles.safeArea}>
+        <View style={styles.container}>
+          <View style={styles.statusBar}>
+            <Text>Status: <Text style={styles.status}>{connectionStatus}</Text></Text>
+            <View style={styles.controls}>
+              <Button
+                title={isConnected ? "Disconnect" : "Connect"}
+                onPress={isConnected ? disconnect : connect}
+              />
+            </View>
+          </View>
+
+          <View style={styles.debugPanel}>
+            <Text style={styles.debugTitle}>Debug Info</Text>
+            <ScrollView style={styles.debugLog}>
+              {logs.map((logEntry, index) => (
+                  <Text key={index} style={styles.logText}>{logEntry}</Text>
+              ))}
+            </ScrollView>
+          </View>
+        </View>
+      </SafeAreaView>
+  );
+};
+
+const styles = StyleSheet.create({
+  safeArea: { flex: 1, backgroundColor: '#f0f0f0', padding: 20 },
+  container: { flex: 1, margin: 20 },
+  statusBar: { flexDirection: 'row', justifyContent: 'space-between', alignItems: 'center', padding: 10, backgroundColor: '#fff', borderRadius: 8, marginBottom: 20 },
+  status: { fontWeight: 'bold' },
+  controls: { flexDirection: 'row', gap: 10 },
+  debugPanel: { height: '80%', backgroundColor: '#fff', borderRadius: 8, padding: 20},
+  debugTitle: { fontSize: 16, fontWeight: 'bold' },
+  debugLog: { height: '100%', overflow: 'scroll', backgroundColor: '#f8f8f8', padding: 10, borderRadius: 4, fontFamily: 'monospace', fontSize: 12, lineHeight: 1.4 },
+});
+
+export default CallScreen;
--- a/examples/bot-ready-signalling/server/signalling_bot.py
+++ b/examples/bot-ready-signalling/server/signalling_bot.py
@@ -17,7 +17,7 @@ from runner import configure
 from pipecat.frames.frames import AudioRawFrame, EndFrame, OutputAudioRawFrame, TTSSpeakFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
-from pipecat.pipeline.task import PipelineParams, PipelineTask
+from pipecat.pipeline.task import PipelineTask
 from pipecat.services.cartesia import CartesiaTTSService
 from pipecat.transports.services.daily import DailyParams, DailyTransport

--- a/examples/canonical-metrics/bot.py
+++ b/examples/canonical-metrics/bot.py
@@ -119,7 +119,7 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+        task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/chatbot-audio-recording/bot.py
+++ b/examples/chatbot-audio-recording/bot.py
@@ -124,7 +124,7 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+        task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

        @audiobuffer.event_handler("on_audio_data")
        async def on_audio_data(buffer, audio, sample_rate, num_channels):
--- a/examples/deployment/flyio-example/bot.py
+++ b/examples/deployment/flyio-example/bot.py
@@ -70,7 +70,7 @@ async def main(room_url: str, token: str):
        ]
    )

-    task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+    task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

    @transport.event_handler("on_first_participant_joined")
    async def on_first_participant_joined(transport, participant):
--- a/examples/deployment/modal-example/bot.py
+++ b/examples/deployment/modal-example/bot.py
@@ -62,7 +62,7 @@ async def main(room_url: str, token: str):

    task = PipelineTask(
        pipeline,
-        PipelineParams(
+        params=PipelineParams(
            allow_interruptions=True,
            enable_metrics=True,
            enable_usage_metrics=True,
--- a/examples/foundational/03a-local-still-frame.py
+++ b/examples/foundational/03a-local-still-frame.py
@@ -18,8 +18,7 @@ from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
 from pipecat.pipeline.task import PipelineTask
 from pipecat.services.fal import FalImageGenService
-from pipecat.transports.base_transport import TransportParams
-from pipecat.transports.local.tk import TkLocalTransport
+from pipecat.transports.local.tk import TkLocalTransport, TkTransportParams

 load_dotenv(override=True)

@@ -34,7 +33,9 @@ async def main():

        transport = TkLocalTransport(
            tk_root,
-            TransportParams(camera_out_enabled=True, camera_out_width=1024, camera_out_height=1024),
+            TkTransportParams(
+                camera_out_enabled=True, camera_out_width=1024, camera_out_height=1024
+            ),
        )

        imagegen = FalImageGenService(
--- a/examples/foundational/03b-still-frame-imagen.py
+++ b/examples/foundational/03b-still-frame-imagen.py
@@ -44,7 +44,8 @@ async def main():
        runner = PipelineRunner()

        task = PipelineTask(
-            Pipeline([imagegen, transport.output()]), PipelineParams(enable_metrics=True)
+            Pipeline([imagegen, transport.output()]),
+            params=PipelineParams(enable_metrics=True),
        )

        @transport.event_handler("on_first_participant_joined")
--- a/examples/foundational/05a-local-sync-speech-and-image.py
+++ b/examples/foundational/05a-local-sync-speech-and-image.py
@@ -30,8 +30,7 @@ from pipecat.processors.frame_processor import FrameDirection, FrameProcessor
 from pipecat.services.cartesia import CartesiaHttpTTSService
 from pipecat.services.fal import FalImageGenService
 from pipecat.services.openai import OpenAILLMService
-from pipecat.transports.base_transport import TransportParams
-from pipecat.transports.local.tk import TkLocalTransport, TkOutputTransport
+from pipecat.transports.local.tk import TkLocalTransport, TkTransportParams

 load_dotenv(override=True)

@@ -152,7 +151,7 @@ async def main():

        transport = TkLocalTransport(
            tk_root,
-            TransportParams(
+            TkTransportParams(
                audio_out_enabled=True,
                camera_out_enabled=True,
                camera_out_width=1024,
--- a/examples/foundational/06-listen-and-respond.py
+++ b/examples/foundational/06-listen-and-respond.py
@@ -105,7 +105,10 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(enable_metrics=True, enable_usage_metrics=True),
+            params=PipelineParams(
+                enable_metrics=True,
+                enable_usage_metrics=True,
+            ),
        )

        @transport.event_handler("on_first_participant_joined")
--- a/examples/foundational/06a-image-sync.py
+++ b/examples/foundational/06a-image-sync.py
@@ -127,7 +127,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07-interruptible-vad.py
+++ b/examples/foundational/07-interruptible-vad.py
@@ -76,7 +76,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07-interruptible.py
+++ b/examples/foundational/07-interruptible.py
@@ -74,7 +74,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07a-interruptible-anthropic.py
+++ b/examples/foundational/07a-interruptible-anthropic.py
@@ -79,7 +79,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07b-interruptible-langchain.py
+++ b/examples/foundational/07b-interruptible-langchain.py
@@ -103,7 +103,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07c-interruptible-deepgram-vad.py
+++ b/examples/foundational/07c-interruptible-deepgram-vad.py
@@ -81,7 +81,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07c-interruptible-deepgram.py
+++ b/examples/foundational/07c-interruptible-deepgram.py
@@ -74,7 +74,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07d-interruptible-elevenlabs.py
+++ b/examples/foundational/07d-interruptible-elevenlabs.py
@@ -74,7 +74,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07e-interruptible-playht-http.py
+++ b/examples/foundational/07e-interruptible-playht-http.py
@@ -75,7 +75,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07e-interruptible-playht.py
+++ b/examples/foundational/07e-interruptible-playht.py
@@ -77,7 +77,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07f-interruptible-azure.py
+++ b/examples/foundational/07f-interruptible-azure.py
@@ -83,7 +83,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07g-interruptible-openai.py
+++ b/examples/foundational/07g-interruptible-openai.py
@@ -81,7 +81,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07h-interruptible-openpipe.py
+++ b/examples/foundational/07h-interruptible-openpipe.py
@@ -81,7 +81,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07i-interruptible-xtts.py
+++ b/examples/foundational/07i-interruptible-xtts.py
@@ -75,7 +75,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07j-interruptible-gladia.py
+++ b/examples/foundational/07j-interruptible-gladia.py
@@ -80,7 +80,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07k-interruptible-lmnt.py
+++ b/examples/foundational/07k-interruptible-lmnt.py
@@ -71,7 +71,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07l-interruptible-together.py
+++ b/examples/foundational/07l-interruptible-together.py
@@ -88,7 +88,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07m-interruptible-polly.py
+++ b/examples/foundational/07m-interruptible-polly.py
@@ -81,7 +81,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07n-interruptible-google.py
+++ b/examples/foundational/07n-interruptible-google.py
@@ -79,7 +79,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07o-interruptible-assemblyai.py
+++ b/examples/foundational/07o-interruptible-assemblyai.py
@@ -80,7 +80,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07p-interruptible-krisp.py
+++ b/examples/foundational/07p-interruptible-krisp.py
@@ -76,7 +76,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07q-interruptible-rime.py
+++ b/examples/foundational/07q-interruptible-rime.py
@@ -74,7 +74,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07r-interruptible-riva-nim.py
+++ b/examples/foundational/07r-interruptible-riva-nim.py
@@ -74,7 +74,7 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+        task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/foundational/07s-interruptible-google-audio-in.py
+++ b/examples/foundational/07s-interruptible-google-audio-in.py
@@ -251,7 +251,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/07t-interruptible-fish.py
+++ b/examples/foundational/07t-interruptible-fish.py
@@ -74,7 +74,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/09-mirror.py
+++ b/examples/foundational/09-mirror.py
@@ -78,7 +78,11 @@ async def main():
        runner = PipelineRunner()

        task = PipelineTask(
-            pipeline, PipelineParams(audio_in_sample_rate=24000, audio_out_sample_rate=24000)
+            pipeline,
+            params=PipelineParams(
+                audio_in_sample_rate=24000,
+                audio_out_sample_rate=24000,
+            ),
        )

        await runner.run(task)
--- a/examples/foundational/09a-local-mirror.py
+++ b/examples/foundational/09a-local-mirror.py
@@ -24,8 +24,7 @@ from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
 from pipecat.pipeline.task import PipelineParams, PipelineTask
 from pipecat.processors.frame_processor import FrameDirection, FrameProcessor
-from pipecat.transports.base_transport import TransportParams
-from pipecat.transports.local.tk import TkLocalTransport
+from pipecat.transports.local.tk import TkLocalTransport, TkTransportParams
 from pipecat.transports.services.daily import DailyParams, DailyTransport

 load_dotenv(override=True)
@@ -67,7 +66,7 @@ async def main():

        tk_transport = TkLocalTransport(
            tk_root,
-            TransportParams(
+            TkTransportParams(
                audio_out_enabled=True,
                camera_out_enabled=True,
                camera_out_is_live=True,
@@ -83,7 +82,11 @@ async def main():
        pipeline = Pipeline([daily_transport.input(), MirrorProcessor(), tk_transport.output()])

        task = PipelineTask(
-            pipeline, PipelineParams(audio_in_sample_rate=24000, audio_out_sample_rate=24000)
+            pipeline,
+            params=PipelineParams(
+                audio_in_sample_rate=24000,
+                audio_out_sample_rate=24000,
+            ),
        )

        async def run_tk():
--- a/examples/foundational/10-wake-phrase.py
+++ b/examples/foundational/10-wake-phrase.py
@@ -76,7 +76,7 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+        task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/foundational/14-function-calling.py
+++ b/examples/foundational/14-function-calling.py
@@ -112,7 +112,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/14a-function-calling-anthropic.py
+++ b/examples/foundational/14a-function-calling-anthropic.py
@@ -99,7 +99,13 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True, enable_metrics=True))
+        task = PipelineTask(
+            pipeline,
+            params=PipelineParams(
+                allow_interruptions=True,
+                enable_metrics=True,
+            ),
+        )

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/foundational/14b-function-calling-anthropic-video.py
+++ b/examples/foundational/14b-function-calling-anthropic-video.py
@@ -153,7 +153,13 @@ If you need to use a tool, simply use the tool. Do not tell the user the tool yo
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True, enable_metrics=True))
+        task = PipelineTask(
+            pipeline,
+            params=PipelineParams(
+                allow_interruptions=True,
+                enable_metrics=True,
+            ),
+        )

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/foundational/14e-function-calling-gemini.py
+++ b/examples/foundational/14e-function-calling-gemini.py
@@ -14,6 +14,7 @@ from loguru import logger
 from runner import configure

 from pipecat.audio.vad.silero import SileroVADAnalyzer
+from pipecat.frames.frames import TTSSpeakFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
 from pipecat.pipeline.task import PipelineParams, PipelineTask
@@ -30,6 +31,12 @@ logger.add(sys.stderr, level="DEBUG")
 video_participant_id = None


+async def start_fetch_weather(function_name, llm, context):
+    """Push a frame to the LLM; this is handy when the LLM response might take a while."""
+    await llm.push_frame(TTSSpeakFrame("Let me check on that."))
+    logger.debug(f"Starting fetch_weather_from_api with function_name: {function_name}")
+
+
 async def get_weather(function_name, tool_call_id, arguments, llm, context, result_callback):
    location = arguments["location"]
    await result_callback(f"The weather in {location} is currently 72 degrees and sunny.")
@@ -63,7 +70,7 @@ async def main():
        )

        llm = GoogleLLMService(api_key=os.getenv("GOOGLE_API_KEY"), model="gemini-2.0-flash-001")
-        llm.register_function("get_weather", get_weather)
+        llm.register_function("get_weather", get_weather, start_fetch_weather)
        llm.register_function("get_image", get_image)

        tools = [
@@ -145,7 +152,7 @@ indicate you should use the get_image tool are:

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/14f-function-calling-groq.py
+++ b/examples/foundational/14f-function-calling-groq.py
@@ -116,7 +116,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/14g-function-calling-grok.py
+++ b/examples/foundational/14g-function-calling-grok.py
@@ -113,7 +113,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/14h-function-calling-azure.py
+++ b/examples/foundational/14h-function-calling-azure.py
@@ -117,7 +117,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/14i-function-calling-fireworks.py
+++ b/examples/foundational/14i-function-calling-fireworks.py
@@ -116,7 +116,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/14j-function-calling-nim.py
+++ b/examples/foundational/14j-function-calling-nim.py
@@ -116,7 +116,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/14k-function-calling-cerebras.py
+++ b/examples/foundational/14k-function-calling-cerebras.py
@@ -123,7 +123,7 @@ Start by asking me for my location. Then, use 'get_weather_current' to give me a

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/14l-function-calling-deepseek.py
+++ b/examples/foundational/14l-function-calling-deepseek.py
@@ -123,7 +123,7 @@ Start by asking me for my location. Then, use 'get_weather_current' to give me a

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/14m-function-calling-openrouter.py
+++ b/examples/foundational/14m-function-calling-openrouter.py
@@ -117,7 +117,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/14n-function-calling-perplexity.py
+++ b/examples/foundational/14n-function-calling-perplexity.py
@@ -83,7 +83,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/15-switch-voices.py
+++ b/examples/foundational/15-switch-voices.py
@@ -133,7 +133,7 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+        task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/foundational/15a-switch-languages.py
+++ b/examples/foundational/15a-switch-languages.py
@@ -126,7 +126,7 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+        task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/foundational/16-gpu-container-local-bot.py
+++ b/examples/foundational/16-gpu-container-local-bot.py
@@ -85,7 +85,13 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True, enable_metrics=True))
+        task = PipelineTask(
+            pipeline,
+            params=PipelineParams(
+                allow_interruptions=True,
+                enable_metrics=True,
+            ),
+        )

        # When a participant joins, start transcription for that participant so the
        # bot can "hear" and respond to them.
--- a/examples/foundational/17-detect-user-idle.py
+++ b/examples/foundational/17-detect-user-idle.py
@@ -108,7 +108,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                report_only_initial_ttfb=True,
--- a/examples/foundational/18-gstreamer-filesrc.py
+++ b/examples/foundational/18-gstreamer-filesrc.py
@@ -38,7 +38,6 @@ async def main():
            "GStreamer",
            DailyParams(
                audio_out_enabled=True,
-                audio_out_is_live=True,
                camera_out_enabled=True,
                camera_out_width=1280,
                camera_out_height=720,
--- a/examples/foundational/19-openai-realtime-beta.py
+++ b/examples/foundational/19-openai-realtime-beta.py
@@ -16,10 +16,13 @@ from runner import configure

 from pipecat.audio.vad.silero import SileroVADAnalyzer
 from pipecat.audio.vad.vad_analyzer import VADParams
+from pipecat.frames.frames import TranscriptionMessage, TranscriptionUpdateFrame
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
 from pipecat.pipeline.task import PipelineParams, PipelineTask
 from pipecat.processors.aggregators.openai_llm_context import OpenAILLMContext
+from pipecat.processors.transcript_processor import TranscriptProcessor
+from pipecat.services.deepgram import DeepgramSTTService
 from pipecat.services.openai_realtime_beta import (
    InputAudioTranscription,
    OpenAIRealtimeBetaLLMService,
@@ -140,21 +143,29 @@ Remember, your responses should be short. Just one or two sentences, usually."""
            tools,
        )

+        stt = DeepgramSTTService(api_key=os.getenv("DEEPGRAM_API_KEY"))
+
+        # Create transcript processor and handler
+        transcript = TranscriptProcessor()
+
        context_aggregator = llm.create_context_aggregator(context)

        pipeline = Pipeline(
            [
                transport.input(),  # Transport user input
+                stt,
+                transcript.user(),  # User transcripts
                context_aggregator.user(),
                llm,  # LLM
                context_aggregator.assistant(),
+                transcript.assistant(),  # Assistant transcripts
                transport.output(),  # Transport bot output
            ]
        )

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
@@ -162,9 +173,16 @@ Remember, your responses should be short. Just one or two sentences, usually."""
            ),
        )

+        # Register event handler for transcript updates
+        @transcript.event_handler("on_transcript_update")
+        async def on_transcript_update(processor, frame):
+            logger.debug(f"Received transcript update with {len(frame.messages)} new messages")
+            for msg in frame.messages:
+                logger.debug(msg)
+
        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
-            await transport.capture_participant_transcription(participant["id"])
+            # await transport.capture_participant_transcription(participant["id"])
            # Kick off the conversation.
            await task.queue_frames([context_aggregator.user().get_context_frame()])

--- a/examples/foundational/20a-persistent-context-openai.py
+++ b/examples/foundational/20a-persistent-context-openai.py
@@ -212,7 +212,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/20b-persistent-context-openai-realtime.py
+++ b/examples/foundational/20b-persistent-context-openai-realtime.py
@@ -237,7 +237,7 @@ Remember, your responses should be short. Just one or two sentences, usually."""

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/20c-persistent-context-anthropic.py
+++ b/examples/foundational/20c-persistent-context-anthropic.py
@@ -209,7 +209,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/20d-persistent-context-gemini.py
+++ b/examples/foundational/20d-persistent-context-gemini.py
@@ -263,7 +263,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/21-tavus-layer.py
+++ b/examples/foundational/21-tavus-layer.py
@@ -87,7 +87,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                # We just use 16000 because that's what Tavus is expecting and
                # we avoid resampling.
                audio_in_sample_rate=16000,
--- a/examples/foundational/22-natural-conversation.py
+++ b/examples/foundational/22-natural-conversation.py
@@ -145,7 +145,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/22b-natural-conversation-proposal.py
+++ b/examples/foundational/22b-natural-conversation-proposal.py
@@ -138,6 +138,7 @@ class OutputGate(FrameProcessor):
        self._gate_open = start_open
        self._frames_buffer = []
        self._notifier = notifier
+        self._gate_task = None

    def close_gate(self):
        self._gate_open = False
@@ -178,10 +179,13 @@ class OutputGate(FrameProcessor):

    async def _start(self):
        self._frames_buffer = []
-        self._gate_task = self.create_task(self._gate_task_handler())
+        if not self._gate_task:
+            self._gate_task = self.create_task(self._gate_task_handler())

    async def _stop(self):
-        await self.cancel_task(self._gate_task)
+        if self._gate_task:
+            await self.cancel_task(self._gate_task)
+            self._gate_task = None

    async def _gate_task_handler(self):
        while True:
@@ -351,7 +355,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/22c-natural-conversation-mixed-llms.py
+++ b/examples/foundational/22c-natural-conversation-mixed-llms.py
@@ -342,6 +342,7 @@ class OutputGate(FrameProcessor):
        self._gate_open = start_open
        self._frames_buffer = []
        self._notifier = notifier
+        self._gate_task = None

    def close_gate(self):
        self._gate_open = False
@@ -382,10 +383,13 @@ class OutputGate(FrameProcessor):

    async def _start(self):
        self._frames_buffer = []
-        self._gate_task = self.create_task(self._gate_task_handler())
+        if not self._gate_task:
+            self._gate_task = self.create_task(self._gate_task_handler())

    async def _stop(self):
-        await self.cancel_task(self._gate_task)
+        if self._gate_task:
+            await self.cancel_task(self._gate_task)
+            self._gate_task = None

    async def _gate_task_handler(self):
        while True:
@@ -560,7 +564,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/22d-natural-conversation-gemini-audio.py
+++ b/examples/foundational/22d-natural-conversation-gemini-audio.py
@@ -25,10 +25,8 @@ from pipecat.frames.frames import (
    InputAudioRawFrame,
    LLMFullResponseEndFrame,
    LLMFullResponseStartFrame,
-    LLMMessagesFrame,
    StartFrame,
    StartInterruptionFrame,
-    StopInterruptionFrame,
    SystemFrame,
    TextFrame,
    TranscriptionFrame,
@@ -555,6 +553,7 @@ class OutputGate(FrameProcessor):
        self._notifier = notifier
        self._context = context
        self._transcription_buffer = user_transcription_buffer
+        self._gate_task = None

    def close_gate(self):
        self._gate_open = False
@@ -602,10 +601,13 @@ class OutputGate(FrameProcessor):

    async def _start(self):
        self._frames_buffer = []
-        self._gate_task = self.create_task(self._gate_task_handler())
+        if not self._gate_task:
+            self._gate_task = self.create_task(self._gate_task_handler())

    async def _stop(self):
-        await self.cancel_task(self._gate_task)
+        if self._gate_task:
+            await self.cancel_task(self._gate_task)
+            self._gate_task = None

    async def _gate_task_handler(self):
        while True:
@@ -740,7 +742,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/23-bot-background-sound.py
+++ b/examples/foundational/23-bot-background-sound.py
@@ -87,7 +87,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/24-stt-mute-filter.py
+++ b/examples/foundational/24-stt-mute-filter.py
@@ -122,7 +122,7 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+        task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/foundational/25-google-audio-in.py
+++ b/examples/foundational/25-google-audio-in.py
@@ -354,7 +354,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/26-gemini-multimodal-live.py
+++ b/examples/foundational/26-gemini-multimodal-live.py
@@ -63,7 +63,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/26a-gemini-multimodal-live-transcription.py
+++ b/examples/foundational/26a-gemini-multimodal-live-transcription.py
@@ -89,7 +89,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/26b-gemini-multimodal-live-function-calling.py
+++ b/examples/foundational/26b-gemini-multimodal-live-function-calling.py
@@ -120,7 +120,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/26c-gemini-multimodal-live-video.py
+++ b/examples/foundational/26c-gemini-multimodal-live-video.py
@@ -79,7 +79,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/26d-gemini-multimodal-live-text.py
+++ b/examples/foundational/26d-gemini-multimodal-live-text.py
@@ -106,7 +106,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/26e-gemini-multimodal-google-search.py
+++ b/examples/foundational/26e-gemini-multimodal-google-search.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024, Daily
+# Copyright (c) 2024-2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -34,7 +34,7 @@ search_tool = {"google_search": {}}
 tools = [search_tool]

 system_instruction = """
-You are an expert at providing the most recent news from any place. Your responses will be converted to audio, so avoid using special characters or overly complex formatting. 
+You are an expert at providing the most recent news from any place. Your responses will be converted to audio, so avoid using special characters or overly complex formatting.

 Always use the google search API to retrieve the latest news. You must also use it to check which day is today.

@@ -93,7 +93,7 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+        task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/foundational/27-simli-layer.py
+++ b/examples/foundational/27-simli-layer.py
@@ -83,7 +83,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
            ),
--- a/examples/foundational/28a-transcription-processor-openai.py
+++ b/examples/foundational/28a-transcription-processor-openai.py
@@ -150,7 +150,7 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+        task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/foundational/28b-transcript-processor-anthropic.py
+++ b/examples/foundational/28b-transcript-processor-anthropic.py
@@ -150,7 +150,7 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+        task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/foundational/28c-transcription-processor-gemini.py
+++ b/examples/foundational/28c-transcription-processor-gemini.py
@@ -178,7 +178,7 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/examples/foundational/30-observer.py
+++ b/examples/foundational/30-observer.py
@@ -18,12 +18,10 @@ from pipecat.frames.frames import (
    BotStartedSpeakingFrame,
    BotStoppedSpeakingFrame,
    Frame,
-    LLMFullResponseEndFrame,
-    LLMFullResponseStartFrame,
-    LLMTextFrame,
    StartInterruptionFrame,
 )
 from pipecat.observers.base_observer import BaseObserver
+from pipecat.observers.loggers.llm_log_observer import LLMLogObserver
 from pipecat.pipeline.pipeline import Pipeline
 from pipecat.pipeline.runner import PipelineRunner
 from pipecat.pipeline.task import PipelineParams, PipelineTask
@@ -73,38 +71,6 @@ class DebugObserver(BaseObserver):
            logger.info(f"🤖 BOT STOP SPEAKING: {src} {arrow} {dst} at {time_sec:.2f}s")


-class LLMLogObserver(BaseObserver):
-    """Observer to log LLM activity to the console.
-
-    Logs all frame instances of:
-    - LLMFullResponseStartFrame (only from LLM service)
-    - LLMTextFrame
-    - LLMFullResponseEndFrame (only from LLM service)
-
-    This allows you to track when the LLM starts responding, what it generates, and when it finishes.
-    Log format: [LLM EVENT]: [details] at [timestamp]s
-    """
-
-    async def on_push_frame(
-        self,
-        src: FrameProcessor,
-        dst: FrameProcessor,
-        frame: Frame,
-        direction: FrameDirection,
-        timestamp: int,
-    ):
-        time_sec = timestamp / 1_000_000_000
-
-        # Only log start/end frames from OpenAILLMService
-        if isinstance(frame, (LLMFullResponseStartFrame, LLMFullResponseEndFrame)):
-            if isinstance(src, OpenAILLMService):
-                event = "START" if isinstance(frame, LLMFullResponseStartFrame) else "END"
-                logger.info(f"🧠 LLM {event} RESPONSE at {time_sec:.2f}s")
-        # Log all LLMTextFrames
-        elif isinstance(frame, LLMTextFrame):
-            logger.info(f"🧠 LLM GENERATING: {frame.text!r} at {time_sec:.2f}s")
-
-
 async def main():
    async with aiohttp.ClientSession() as session:
        (room_url, token) = await configure(session)
@@ -151,13 +117,13 @@ async def main():

        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
                report_only_initial_ttfb=True,
-                observers=[DebugObserver(), LLMLogObserver()],
            ),
+            observers=[DebugObserver(), LLMLogObserver()],
        )

        @transport.event_handler("on_first_participant_joined")
--- a/examples/foundational/31-heartbeats.py
+++ b/examples/foundational/31-heartbeats.py
@@ -32,7 +32,7 @@ async def main():

    pipeline = Pipeline([NullProcessor()])

-    task = PipelineTask(pipeline, PipelineParams(enable_heartbeats=True))
+    task = PipelineTask(pipeline, params=PipelineParams(enable_heartbeats=True))

    runner = PipelineRunner()

--- a/examples/foundational/32-gemini-grounding-metadata.py
+++ b/examples/foundational/32-gemini-grounding-metadata.py
@@ -1,5 +1,5 @@
 #
-# Copyright (c) 2024, Daily
+# Copyright (c) 2024-2025, Daily
 #
 # SPDX-License-Identifier: BSD 2-Clause License
 #
@@ -38,7 +38,7 @@ search_tool = {"google_search_retrieval": {}}
 tools = [search_tool]

 system_instruction = """
-You are an expert at providing the most recent news from any place. Your responses will be converted to audio, so avoid using special characters or overly complex formatting. 
+You are an expert at providing the most recent news from any place. Your responses will be converted to audio, so avoid using special characters or overly complex formatting.

 Always use the google search API to retrieve the latest news. You must also use it to check which day is today.

@@ -117,7 +117,7 @@ async def main():
            ]
        )

-        task = PipelineTask(pipeline, PipelineParams(allow_interruptions=True))
+        task = PipelineTask(pipeline, params=PipelineParams(allow_interruptions=True))

        @transport.event_handler("on_first_participant_joined")
        async def on_first_participant_joined(transport, participant):
--- a/examples/foundational/33-gemini-rag.py
+++ b/examples/foundational/33-gemini-rag.py
@@ -230,7 +230,7 @@ Your response will be turned into speech so use only simple words and punctuatio
        )
        task = PipelineTask(
            pipeline,
-            PipelineParams(
+            params=PipelineParams(
                allow_interruptions=True,
                enable_metrics=True,
                enable_usage_metrics=True,
--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
James Hush	230d92850a	example: realtime with transcripts	2025-02-26 16:29:07 +08:00
Aleix Conchillo Flaqué	96c6aeaada	Merge pull request #1295 from pipecat-ai/aleix/pipelinetask-keyword-arguments PipelineTask: force constructor keyword arguments	2025-02-25 19:00:58 -08:00
Aleix Conchillo Flaqué	6722aae598	PipelineTask: force constructor keyword arguments	2025-02-25 18:58:47 -08:00
Aleix Conchillo Flaqué	66564392a6	Merge pull request #1293 from pipecat-ai/aleix/log-pipecat-version log pipecat version on application startup	2025-02-25 18:57:52 -08:00
Aleix Conchillo Flaqué	f258f5ab66	Merge pull request #1292 from pipecat-ai/aleix/audiocontext-terminate-nicely AudioContextWordTTSService: wait for all requested audio	2025-02-25 18:56:41 -08:00
Aleix Conchillo Flaqué	f8f0578c3d	log pipecat version on application startup	2025-02-25 18:55:45 -08:00
Aleix Conchillo Flaqué	aa60a413f3	Merge pull request #1294 from pipecat-ai/aleix/improve-test-requirements improve test-requirements.txt	2025-02-25 18:55:18 -08:00
Aleix Conchillo Flaqué	3e66f2378d	improve test-requirements.txt	2025-02-25 17:34:33 -08:00
Aleix Conchillo Flaqué	9a50f33e36	AudioContextWordTTSService: wait for all requested audio	2025-02-25 15:35:47 -08:00
Aleix Conchillo Flaqué	4bd5e9c0a7	Merge pull request #1285 from pipecat-ai/aleix/handle-stop-task-gracefully handle stop task gracefully	2025-02-25 11:25:38 -08:00
Mark Backman	12092c8715	Merge pull request #1288 from pipecat-ai/mb/clean-up-tts-text-input TTSService: Remove newlines before sending text to TTS service to gen…	2025-02-25 14:00:43 -05:00
Mark Backman	92cc6d39f2	TTSService: Remove newlines before sending text to TTS service to generate	2025-02-25 13:37:25 -05:00
Aleix Conchillo Flaqué	34a50033cb	tk: use TkTransportParams in examples	2025-02-25 10:24:24 -08:00
Aleix Conchillo Flaqué	e60b65228b	allow multiple StartFrames	2025-02-25 10:24:04 -08:00
Mark Backman	e74864335b	Merge pull request #1287 from pipecat-ai/mb/30-observer-pipeline-task Example 30: Move observers to PipelineTask	2025-02-25 12:11:23 -05:00
Mark Backman	27a088a457	Merge pull request #1286 from pipecat-ai/mb/update-grok-2 Set grok-2 as default model for GrokLLMSService	2025-02-25 12:11:09 -05:00
Mark Backman	cfe72143b8	Example 30: Move observers to PipelineTask	2025-02-25 10:54:25 -05:00
Mark Backman	36a729cbfe	Set grok-2 as default model for GrokLLMSService	2025-02-25 10:00:45 -05:00
Aleix Conchillo Flaqué	d2f006682c	introduce new BaseTaskManager	2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué	fb7fe540f5	tts: don't connect to websocket if already connected	2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué	1ec68bd071	make sure we don't create tasks if already created	2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué	4536d03e82	FrameProcessor: cancel input/push tasks on CancelFrame	2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué	699704732c	asyncio: re-raise CancelledError in wait_for_task()	2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué	376d969a77	task: handle StopFrame and StopTaskFrame gracefully	2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué	68789dfcf0	frames: add new StopFrame	2025-02-24 21:34:23 -08:00
Aleix Conchillo Flaqué	fe9fc61c4e	Merge pull request #1282 from pipecat-ai/aleix/pipelinetask-observers-constructor PipelineTask: pass observers in contructor parameter	2025-02-24 21:29:46 -08:00
Aleix Conchillo Flaqué	6028f0f23a	PipelineTask: pass observers in contructor parameter	2025-02-24 21:29:17 -08:00
Aleix Conchillo Flaqué	e9a0959e28	Merge pull request #1283 from pipecat-ai/aleix/check-dangling-tasks PipelineTask: add check_dangling_tasks parameter	2025-02-24 21:26:32 -08:00
Dominic Stewart	f66be2cfa7	Dom/gemini system prompt switching (#1260 ) * Updated example to use Gemini * Fixed typo * Based on feedback, made the gemini file something that can be called separately * Updated the readme * Updated the readme * Changed example to use gemini 2.0 flash lite * This works * Improvement * I think this works * Updated the code to use the correct prompt broken down into smaller pieces * Added a few more things to detect in the prompt * Fixed import ordering * Updated prompt for non gemini bot to look for more voicemail examples, plus added logic to detect if we're doing dialin or not to avoid a non-fatal dialin related error * moved terminate call to handlers class * Simplified logic for dialin * Forgot to use the same logic for the openai bot * Starting to add logic for native audio input for flash lite * Fixed logic * Fixed some code based on suggestions	2025-02-24 22:29:55 -06:00
Aleix Conchillo Flaqué	f818bed58f	Merge pull request #1281 from pipecat-ai/aleix/google-context-aggregator-upgrade-context google: updgrade OpenAILLMContext to GoogleLLMContext	2025-02-24 17:37:26 -08:00
Aleix Conchillo Flaqué	07b9be5308	PipelineTask: add check_dangling_tasks parameter	2025-02-24 17:33:10 -08:00
Aleix Conchillo Flaqué	40c2452d6e	google: updgrade OpenAILLMContext to GoogleLLMContext	2025-02-24 15:35:18 -08:00
Aleix Conchillo Flaqué	30cdd1b71a	Merge pull request #1280 from pipecat-ai/aleix/add-completion-timeout services(llm): add on_completion_timeout event	2025-02-24 15:07:20 -08:00
Aleix Conchillo Flaqué	2110b79507	services(llm): add on_completion_timeout event	2025-02-24 14:55:36 -08:00
Aleix Conchillo Flaqué	fc544fa61c	Merge pull request #1272 from pipecat-ai/aleix/tts-websocket-interruptions services: fix some TTS websocket service interruption handling	2025-02-24 14:54:41 -08:00
Mark Backman	976fe95304	Merge pull request #1279 from pipecat-ai/mb/remove-open-optional-dep Remove `openai` optional dependency from services as it's now required	2025-02-24 17:42:53 -05:00
Aleix Conchillo Flaqué	408270b647	lmnt: don't send "eof" before closing the socket	2025-02-24 14:37:37 -08:00
Mark Backman	1dfb75bc9d	Merge pull request #1278 from pipecat-ai/mb/claude-3-7 Update AnthropicLLMService to use claude-3-7-sonnet-20250219 by default	2025-02-24 15:41:28 -05:00
Mark Backman	cefc2a1088	Fix test-requirements.text ordering	2025-02-24 15:06:13 -05:00
Mark Backman	3b9b9200ea	Remove openai optional dependency from services as it's now required	2025-02-24 15:05:42 -05:00
Mark Backman	d6f29a0f4b	Update AnthropicLLMService to use claude-3-7-sonnet-20250219 by default	2025-02-24 14:32:00 -05:00
Aleix Conchillo Flaqué	5b762d11ef	Merge pull request #1228 from CarlKho-Minerva/main Missing Cartesia~=1.3.1 → `test-requirements`	2025-02-24 08:47:41 -08:00
Aleix Conchillo Flaqué	2f3e2da6b9	Merge pull request #1259 from pipecat-ai/openai-not-optional Since the `openai` package is used by pretty much everything in pipec…	2025-02-24 08:45:45 -08:00
allenmylath	45058d4a94	Update audio_buffer_processor.py (#1266 )	2025-02-24 08:41:19 -08:00
Aleix Conchillo Flaqué	5b637bd826	services: fix some TTS websocket service interruption handling	2025-02-24 08:37:22 -08:00
Mark Backman	2d4fd7e903	Merge pull request #1274 from pipecat-ai/mb/add-ellipsis-test Add one additional ellipsis test to test_utils_string	2025-02-23 11:26:20 -05:00
Mark Backman	b5662520aa	Add one additional ellipsis test to test_utils_string	2025-02-23 11:04:24 -05:00
Aleix Conchillo Flaqué	af45c170b5	Merge pull request #1264 from pipecat-ai/aleix/add-log-observers add initial log observers	2025-02-21 15:20:45 -08:00
Aleix Conchillo Flaqué	65f548b2ec	examples(30-observer): update to use LLMLogObserver	2025-02-21 15:15:16 -08:00
Aleix Conchillo Flaqué	b29ab8c608	observers: add LLMLogObserver and TranscriptionLogObserver	2025-02-21 15:15:16 -08:00
Aleix Conchillo Flaqué	d6dc37f0b6	Merge pull request #1269 from pipecat-ai/aleix/endofsentence-support-ellipses utils: add support for ellipses in match_endofsentence()	2025-02-21 15:08:22 -08:00
Aleix Conchillo Flaqué	12bce2e8c0	utils: add support for ellipses in match_endofsentence()	2025-02-21 15:05:50 -08:00
Aleix Conchillo Flaqué	4acf7296e0	Merge pull request #1261 from pipecat-ai/aleix/emualted-frames-being-triggered-prematurely LLMUserContextAggregator: don't reset timer with interim transcription	2025-02-21 10:15:28 -08:00
Aleix Conchillo Flaqué	98706d429c	LLMUserContextAggregator: make sure incoming transcription has text	2025-02-21 10:12:54 -08:00
Aleix Conchillo Flaqué	41720b1a13	LLMUserContextAggregator: don't reset timer with interim transcription It turns out that in some cases we only get interim transcriptions (e.g. someone is speaking very very softly or someone is talking in the background). In those cases we don't want to interrupt the bot because there's really nothing to interrupt the bot for. We originally thought we should interrupt the bot right at the time we got an interim frame, but this is causing too many false positives. It's actually better to simply wait for a real transcription before interrupting (in case VAD didn't interrupt).	2025-02-21 09:05:56 -08:00
Aleix Conchillo Flaqué	3ef4245166	Merge pull request #1265 from pipecat-ai/aleix/transport-remove-audio-out-is-live	2025-02-21 06:51:09 -08:00
Filipi da Silva Fuchter	3bb0797922	Merge pull request #1257 from pipecat-ai/fastapi_disconnect_issue Fixed an issue where FastAPI was not triggering on_client_disconnected.	2025-02-21 09:15:15 -03:00
Filipi Fuchter	7c7b4c52af	Fixed an issue where EndTaskFrame was not triggering on_client_disconnected or closing the WebSocket in FastAPI.	2025-02-21 09:11:58 -03:00
Aleix Conchillo Flaqué	01f083b7fc	transports: remove TransportParams.audio_out_is_live	2025-02-20 23:33:06 -08:00
Aleix Conchillo Flaqué	91fcaebe25	Merge pull request #1263 from Vaibhav159/vl_fix_deepgram_sample_rate_mismatch fixing deepgram mismatch	2025-02-20 22:39:06 -08:00
Vaibhav159	9c5fe5c85e	fixing deepgram mismatch	2025-02-21 09:32:40 +05:30
Aleix Conchillo Flaqué	7e5e167a4b	Merge pull request #1250 from pipecat-ai/aleix/context-aggregation-simulatenous-text-tools AssistantContextAggregator: append aggregation and tools in the same turn	2025-02-20 17:32:57 -08:00
Aleix Conchillo Flaqué	d04c4b36f3	AssistantContextAggregator: append aggregation and tools in the same turn	2025-02-20 17:29:43 -08:00
Aleix Conchillo Flaqué	a811e53626	Merge pull request #1253 from pipecat-ai/aleix/http-tts-services-stopped-frame HTTP TTS services stopped frame	2025-02-20 17:28:05 -08:00
Paul Kompfner	df57202a05	Since the `openai` package is used by pretty much everything in pipecat (due to `OpenAILLMContext` being the standard context representation), let's make it a non-optional dependency. This change solves an issue faced by users who aren't intending to use OpenAI getting scary error messages saying that they need the `openai` optional dependency "in order to use OpenAI", along with an instruction to set the OPENAI_API_KEY environment variable. Note that with this change we could theoretically remove from pyproject.toml a number of defined optional dependencies that list only the `openai` package as a dependency (like `deepseek`, for example), but I didn't want to "break the API" in terms of how users install/consume pipecat and its set of built-in services. Finally, I removed the `python-deepcompare` dependency from the `openai` optional dependency, since it appears to me like it was added by mistake (my guess is it was used for debugging during development and then never removed).	2025-02-20 15:21:35 -05:00
Aleix Conchillo Flaqué	69e6f3fdb7	rime: pass aiohttp session to constructor	2025-02-20 07:36:24 -08:00
Aleix Conchillo Flaqué	6809254963	tts: fix metrics and TTSStoppedFrame frame in HTTP services Fixes #1247	2025-02-20 07:36:21 -08:00
Aleix Conchillo Flaqué	81093d3bed	Merge pull request #1252 from pipecat-ai/aleix/remove-vad-extra-logging BaseInputTransport: remove VAD logging	2025-02-20 07:32:20 -08:00
Aleix Conchillo Flaqué	d9a67164f6	Merge pull request #1251 from pipecat-ai/aleix/fish-tts-service-push-stop-frame FishAudioTTSService should push TTSStoppedFrame	2025-02-20 07:32:05 -08:00
Aleix Conchillo Flaqué	98259af54e	update CHANGELOG	2025-02-19 22:05:48 -08:00
Dominic Stewart	039d144c79	examples(phone-bot): updated example to use Gemini (#1233 )	2025-02-19 22:03:37 -08:00
Aleix Conchillo Flaqué	d0f67fc189	BaseInputTransport: remove VAD logging These logs are very verbose. They were added to try to find an issue that resulted in being because of low CPU/memory resources, but these logs were not helpful to determine that.	2025-02-19 21:55:11 -08:00
Aleix Conchillo Flaqué	6e3f96aa83	fish: automatically send TTSStoppedFrame after timeout	2025-02-19 21:41:18 -08:00
Aleix Conchillo Flaqué	293677588d	tts: make push_stop_frames default to 2.0s	2025-02-19 21:39:00 -08:00
Filipi da Silva Fuchter	77e777b1ce	Merge pull request #1249 from pipecat-ai/invoking_call_start_function Fixed an issue that `start_callback` was not invoked for some LLM services	2025-02-19 18:09:00 -03:00
Filipi Fuchter	7e7926059c	Fixed an issue that `start_callback` was not invoked for some LLM services.	2025-02-19 18:04:20 -03:00
Aleix Conchillo Flaqué	c948754eff	Merge pull request #1248 from pipecat-ai/aleix/daily-transport-room-url daily: add room_url property	2025-02-19 09:46:46 -08:00
Aleix Conchillo Flaqué	83f1a8830d	daily: add room_url property	2025-02-19 09:29:53 -08:00
James Hush	80f8e05fcf	docs: fix transcripts in translation chatbot example (#1199 )	2025-02-19 16:07:22 +08:00
Aleix Conchillo Flaqué	afd1a1e80b	Merge pull request #1245 from pipecat-ai/aleix/stt-mute-filter-trace-logging	2025-02-18 21:21:55 -08:00
Aleix Conchillo Flaqué	84ac88cad7	STTMuteFilter: change suppressed logging to trace	2025-02-18 18:03:37 -08:00
Aleix Conchillo Flaqué	211163e5c7	Merge pull request #1241 from pipecat-ai/aleix/deepgram-nova-3 deepgram: use the new nova-3 model as default	2025-02-18 17:53:04 -08:00
Aleix Conchillo Flaqué	1b0bcebef6	deepgram: use the new nova-3 model as default	2025-02-18 17:51:54 -08:00
Aleix Conchillo Flaqué	89736b03c4	Merge pull request #1243 from pipecat-ai/aleix/add-deepgram-addons deepgram: add ability to provide custom addons	2025-02-18 17:47:48 -08:00
Aleix Conchillo Flaqué	4edda718ed	deepgram: add ability to provide custom addons	2025-02-18 17:45:41 -08:00
Aleix Conchillo Flaqué	22a62edc9e	Merge pull request #1242 from pipecat-ai/aleix/utils-network-exponential network: added exponential_backoff_time() function	2025-02-18 17:44:21 -08:00
Aleix Conchillo Flaqué	50b6cc8135	network: added exponential_backoff_time() function	2025-02-18 17:42:43 -08:00
Aleix Conchillo Flaqué	45cf36925a	Merge pull request #1240 from pipecat-ai/aleix/handle-deepgram-on-error deepgram: handle error event and reconnect	2025-02-18 17:41:29 -08:00
Filipi da Silva Fuchter	83a71e1fec	Merge pull request #1112 from pipecat-ai/bot-ready-signalling-rn React Native client for the bot ready example.	2025-02-18 15:17:38 -03:00
Filipi Fuchter	e809c8680e	Upgrading to use the latest node stable version	2025-02-18 15:12:44 -03:00
Aleix Conchillo Flaqué	c926063d74	deepgram: handle error event and reconnect	2025-02-18 09:52:18 -08:00
Aleix Conchillo Flaqué	0334550356	Merge pull request #1238 from pipecat-ai/aleix/stt-mute-filter-ignore-input-audio-frames STTMuteFilter: ignore audio frames so no transcriptions are generated	2025-02-18 09:48:13 -08:00
Aleix Conchillo Flaqué	90b9dce710	STTMuteFilter: ignore audio frames so no transcriptions are generated	2025-02-17 19:59:05 -08:00
Carl Kho	a5cdd5f1b8	Add Cartesia API key to dot-env.template	2025-02-14 21:29:37 -08:00
Carl Kho	5f937b8479	Update test requirements to include Cartesia version 1.3.1	2025-02-14 21:14:32 -08:00
Filipi Fuchter	7e3e126730	Migrating the base API URL for the react native example to an .env file.	2025-01-30 10:42:16 -03:00
Filipi Fuchter	75ca0571bb	Improving the layout from the bot ready react native demo.	2025-01-30 10:31:04 -03:00
Filipi Fuchter	a48e5d0714	Only sending the message when it is a remote audio track.	2025-01-30 10:14:37 -03:00
Filipi Fuchter	2b6a992207	Sending the app-message to start playing audio once the track has started.	2025-01-30 09:37:33 -03:00
Filipi Fuchter	24cf106ed2	Refactoring the code to ask for the room that it should connect.	2025-01-30 09:14:18 -03:00
Filipi Fuchter	95c8346cb5	Starting to create a react native client for the bot ready example.	2025-01-29 19:00:42 -03:00