Go to file

Paul Kompfner b14a03d01f fix: extend cancel_on_interruption=False regression fix to remaining realtime services

Applies the same async-tool message routing introduced for AWSNovaSonicLLMService
and OpenAIRealtimeLLMService to additional realtime LLM services where the
flag's intent ("keep talking while the tool runs") is achievable:

- GrokRealtimeLLMService (xAI Realtime — also benefits the deprecated Grok
  alias since it re-exports the xAI module)
- AzureRealtimeLLMService picks up the fix transitively by inheriting from
  OpenAIRealtimeLLMService — no code change needed.

GrokRealtimeLLMService's _process_completed_function_calls now matches
the canonical pattern: skip LLMSpecificMessage, detect async-tool messages
via parse_message and route them — started skipped silently, intermediate
logged as an error and surfaced via push_error, final delivered through
the same channel as a synchronous result.

UltravoxRealtimeLLMService instead gets a one-time warning when async-tool
messages appear in the context. The Ultravox API freezes the conversation
during tool execution
(https://docs.ultravox.ai/tools/async-tools#custom-tool-timeouts), so the
flag's "keep talking while the tool runs" intent isn't achievable there —
applying the same code pattern would mislead users into expecting a UX
Ultravox can't deliver. Surfacing a clear warning is the right behavior
until Ultravox grows true async tool support.

Adds async-tool example files for Grok and Azure modeled on the existing
Nova Sonic / OpenAI Realtime ones (10s simulated network delay, weather
tool registered with cancel_on_interruption=False).

Two services remain excluded:

- GeminiLiveLLMService — the async-tool path needs deeper investigation.
- InworldRealtimeLLMService — appears to have a pre-existing problem
  with even simple synchronous tool calling on its Realtime API (the
  request reaches the server fine, but response generation fails with a
  generic server_error).

2026-05-08 15:43:53 -04:00

.claude

Move foundational examples to examples/

2026-03-31 13:12:24 -04:00

.claude-plugin

Add /update-docs skill to claude-plugin

2026-02-25 15:45:16 -05:00

.github

ci: install runner extra for the coverage job

2026-05-04 16:44:47 -04:00

changelog

fix: extend cancel_on_interruption=False regression fix to remaining realtime services

2026-05-08 15:43:53 -04:00

docs/api

Fix Pydantic v2 + Sphinx autodoc incompatibility for Daily utils

2026-04-03 12:00:11 -04:00

examples

fix: extend cancel_on_interruption=False regression fix to remaining realtime services

2026-05-08 15:43:53 -04:00

scripts

chore(scripts): add release-changelog.py

2026-04-27 15:07:53 -07:00

src/pipecat

fix: extend cancel_on_interruption=False regression fix to remaining realtime services

2026-05-08 15:43:53 -04:00

tests

fix: restore cancel_on_interruption=False support in AWS Nova Sonic and OpenAI Realtime

2026-05-08 09:33:06 -04:00

.gitignore

Save Smart Turn input data if SMART_TURN_LOG_DATA is set

2026-01-22 14:17:59 +00:00

.pre-commit-config.yaml

Remove deprecated OpenAILLMContext as well as everything (code paths or whole types) dependent on it (all of which were also deprecated)

2026-03-31 18:15:25 -04:00

.readthedocs.yaml

Clean up docs config after riva removal and add missing modules

2026-04-03 09:52:31 -04:00

CHANGELOG.md

Update changelog for version 1.1.0

2026-04-27 13:59:17 -07:00

CLAUDE.md

Document deprecation docstring convention in CLAUDE.md.

2026-05-07 10:03:43 -04:00

codecov.yml

include codecov.yml

2025-02-11 23:46:19 -08:00

COMMUNITY_INTEGRATIONS.md

Move foundational examples to examples/

2026-03-31 13:12:24 -04:00

CONTRIBUTING.md

Add Performance as a changelog fragment option

2026-02-25 09:47:42 -05:00

env.example

rename environment variables and references from AICOUSTICS to AIC.

2026-04-25 09:51:23 +02:00

LICENSE

Update copyright date range to 2024-2026

2026-01-07 16:58:13 -05:00

MANIFEST.in

add MANIFEST.in to reduce sdist tarball size

2025-05-28 10:09:39 -07:00

pipecat.png

renamed image.png to pipecat.png

2024-05-12 17:44:10 -07:00

pyproject.toml

chore(daily): bump daily-python to ~=0.28.0

2026-04-27 13:35:14 -07:00

pyrightconfig.json

fix: clear 8 more services from pyright ignore list

2026-05-01 09:36:14 -04:00

README.md

Update README to remove NVIDIA references to RIVA

2026-04-28 12:42:58 -04:00

SECURITY.md

Add SECURITY.md

2025-10-05 13:24:47 -05:00

uv.lock

chore(daily): bump daily-python to ~=0.28.0

2026-04-27 13:35:14 -07:00

README.md

🎙️ Pipecat: Real-Time Voice & Multimodal AI Agents

Pipecat is an open-source Python framework for building real-time voice and multimodal conversational agents. Orchestrate audio and video, AI services, different transports, and conversation pipelines effortlessly—so you can focus on what makes your agent unique.

Want to dive right in? Run pipecat init quickstart or follow the quickstart guide.

🚀 What You Can Build

Voice Assistants – natural, streaming conversations with AI
AI Companions – coaches, meeting assistants, characters
Multimodal Interfaces – voice, video, images, and more
Interactive Storytelling – creative tools with generative media
Business Agents – customer intake, support bots, guided flows
Complex Dialog Systems – design logic with structured conversations

🧠 Why Pipecat?

Voice-first: Integrates speech recognition, text-to-speech, and conversation handling
Pluggable: Supports many AI services and tools
Composable Pipelines: Build complex behavior from modular components
Real-Time: Ultra-low latency interaction with different transports (e.g. WebSockets or WebRTC)

🌐 Pipecat Ecosystem

🧩 Multi-agent systems

Need multiple AI agents working together? Pipecat Subagents lets you build distributed multi-agent systems where each agent runs its own pipeline and communicates through a shared message bus. Hand off conversations between specialists, dispatch background tasks, and scale agents across processes or machines.

📱 Client SDKs

Building client applications? You can connect to Pipecat from any platform using our official SDKs:

🧭 Structured conversations

Looking to build structured conversations? Check out Pipecat Flows for managing complex conversational states and transitions.

🪄 Beautiful UIs

Want to build beautiful and engaging experiences? Checkout the Voice UI Kit, a collection of components, hooks and templates for building voice AI applications quickly.

🛠️ Create and deploy projects

Create a new project in under a minute with the Pipecat CLI. Then use the CLI to monitor and deploy your agent to production.

🔍 Debugging

Looking for help debugging your pipeline and processors? Check out Whisker, a real-time Pipecat debugger.

🖥️ Terminal

Love terminal applications? Check out Tail, a terminal dashboard for Pipecat.

🤖 Claude Code Skills

Use Pipecat Skills with Claude Code to scaffold projects, deploy to Pipecat Cloud, and more. Install the marketplace with:

claude plugin marketplace add pipecat-ai/skills

and install any of the available plugins.

🧩 Community Integrations

Build and share your own Pipecat service integrations! Browse existing community integrations or check out our guide to create your own.

📺️ Pipecat TV Channel

Catch new features, interviews, and how-tos on our Pipecat TV channel.

🎬 See it in action

🧩 Available services

Category	Services
Speech-to-Text	AssemblyAI, AWS, Azure, Cartesia, Deepgram, ElevenLabs, Fal Wizper, Gladia, Google, Gradium, Groq (Whisper), Mistral, NVIDIA, OpenAI (Whisper), Sarvam, Soniox, Speechmatics, Whisper, xAI
LLMs	Anthropic, AWS, Azure, Cerebras, DeepSeek, Fireworks AI, Gemini, Grok, Groq, Mistral, Nebius, Novita, NVIDIA NIM, Ollama, OpenAI, OpenAI Responses, OpenRouter, Perplexity, Qwen, SambaNova, Sarvam, Together AI
Text-to-Speech	Async, AWS, Azure, Camb AI, Cartesia, Deepgram, ElevenLabs, Fish, Google, Gradium, Groq, Hume, Inworld, Kokoro, LMNT, MiniMax, Mistral, Neuphonic, NVIDIA, OpenAI, Piper, Resemble, Rime, Sarvam, Smallest, Soniox, Speechmatics, xAI, XTTS
Speech-to-Speech	AWS Nova Sonic, Gemini Multimodal Live, Grok Voice Agent, OpenAI Realtime, Ultravox,
Transport	Daily (WebRTC), FastAPI Websocket, LiveKit (WebRTC), SmallWebRTCTransport, WebSocket Server, WhatsApp, Local
Serializers	Exotel, Genesys, Plivo, Twilio, Telnyx, Vonage
Video	HeyGen, LemonSlice, Tavus, Simli
Memory	mem0
Vision & Image	fal, Google Imagen, Moondream
Audio Processing	Silero VAD, Krisp Viva, Koala, ai-coustics, RNNoise
Analytics & Metrics	OpenTelemetry, Sentry
Community	Browse community integrations →

📚 View full services documentation →

⚡ Getting started

You can get started with Pipecat running on your local machine, then move your agent processes to the cloud when you're ready.

Install uv
```
curl -LsSf https://astral.sh/uv/install.sh | sh
```
Need help? Refer to the uv install documentation.

Install the module

# For new projects
uv init my-pipecat-app
cd my-pipecat-app
uv add pipecat-ai

# Or for existing projects
uv add pipecat-ai

Set up your environment
```
cp env.example .env
```
To keep things lightweight, only the core framework is included by default. If you need support for third-party AI services, you can add the necessary dependencies with:
```
uv add "pipecat-ai[option,...]"
```

Using pip? You can still use pip install pipecat-ai and pip install "pipecat-ai[option,...]" to get set up.

🧪 Code examples

Foundational — small snippets that build on each other, introducing one or two concepts at a time
Example apps — complete applications that you can use as starting points for development

🛠️ Contributing to the framework

Prerequisites

Minimum Python Version: 3.11 Recommended Python Version: >= 3.12

Setup Steps

Clone the repository and navigate to it:

git clone https://github.com/pipecat-ai/pipecat.git
cd pipecat

Install development and testing dependencies:

uv sync --group dev --all-extras \
  --no-extra gstreamer \
  --no-extra local \

Install the git pre-commit hooks:
```
uv run pre-commit install
```

Note

: Some extras (local, gstreamer) require system dependencies. See documentation if you encounter build errors.

Claude Code Skills

Install development workflow skills for contributing to Pipecat with Claude Code:

claude plugin marketplace add pipecat-ai/pipecat
claude plugin install pipecat-dev@pipecat-dev-skills

Running tests

To run all tests, from the root directory:

uv run pytest

Run a specific test suite:

uv run pytest tests/test_name.py

🤝 Contributing

We welcome contributions from the community! Whether you're fixing bugs, improving documentation, or adding new features, here's how you can help:

Found a bug? Open an issue
Have a feature idea? Start a discussion
Want to contribute code? Check our CONTRIBUTING.md guide
Documentation improvements? Docs PRs are always welcome

Before submitting a pull request, please check existing issues and PRs to avoid duplicates.

We aim to review all contributions promptly and provide constructive feedback to help get your changes merged.

🛟 Getting help

➡️ Join our Discord

➡️ Read the docs

➡️ Reach us on X

README.md Unescape Escape

🎙️ Pipecat: Real-Time Voice & Multimodal AI Agents

🚀 What You Can Build

🧠 Why Pipecat?

🌐 Pipecat Ecosystem

🧩 Multi-agent systems

📱 Client SDKs

🧭 Structured conversations

🪄 Beautiful UIs

🛠️ Create and deploy projects

🔍 Debugging

🖥️ Terminal

🤖 Claude Code Skills

🧩 Community Integrations

📺️ Pipecat TV Channel

🎬 See it in action

🧩 Available services

⚡ Getting started

🧪 Code examples

🛠️ Contributing to the framework

Prerequisites

Setup Steps

Claude Code Skills

Running tests

🤝 Contributing

🛟 Getting help

README.md