Go to file

Paul Kompfner 8a4ab611be Broad service settings refactor, with the primary aim of making service settings discoverable and strongly-typed. Service settings can be updated at runtime with *UpdateSettingsFrames.

Does not (yet) touch `InputParams`, to avoid scope creep and touching something currently part of the public API. But there is a lot of overlap between `*Settings` object fields and `InputParams` fields.

Other than discoverability/typing, these are some other improvements brought by this refactor:
- There is now a single code path (see `_update_settings_from_typed`) where services can respond to settings changes (by, say, reconnecting if needed), improving maintainability and guaranteeing one and only one reconnection no matter which settings changed
- `set_language`/`set_model`/`set_voice`—which we're assuming are usable as public methods, though *not* recommended over `*UpdateSettingsFrame`—all use the same code path as settings updates. They're also now all consistent in that, if a service needs to respond to a change (by, say, reconnecting if needed), any of these methods will kick off that process. Note that this is technically a behavior change.
- Several services now properly react to changed settings by reconnecting:
  - `AWSTranscribeSTTService`
  - `AzureSTTService`
  - `SonioxSTTService`
  - `GladiaSTTService`
  - `SpeechmaticsSTTService`
  - `AssemblyAISTTService`
  - `CartesiaSTTService`
  - `FishAudioTTSService` (would previously only reconnect when `model` changed)
  - `GoogleSTTService`
  - `SpeechmaticsSTTService` (which previously only handled *some* settings updates through a nonstandard public `update_params` method)
  - `GradiumSTTService`
  - `NvidiaSegmentedSTTService` (which previously only handled changes to language)
- Bookkeeping across various services has been reduced, mostly by deduping ivars; the `self._settings` ivar is treated as the source of truth

NOTE: I pretty much guarantee that there are services missed in this PR in terms of bringing to consistency with how updates are handled (like whether changes in certain fields trigger reconnects when they need to). We can squash remaining inconsistencies as we stumble onto them, service by service. The goal here is to get things *mostly* in order, and establish the infrastructure and patterns we'll need going forward.

2026-02-13 15:12:26 -05:00

.claude

Broad service settings refactor, with the primary aim of making service settings discoverable and strongly-typed. Service settings can be updated at runtime with *UpdateSettingsFrames.

2026-02-13 15:12:26 -05:00

.github

pyproject: add local smartturn as a default dependency

2026-02-10 14:32:32 -08:00

changelog

Update changelog for version 0.0.102

2026-02-10 18:28:21 -08:00

docs/api

Mock FastAPI

2026-01-15 17:29:47 -05:00

examples

Broad service settings refactor, with the primary aim of making service settings discoverable and strongly-typed. Service settings can be updated at runtime with *UpdateSettingsFrames.

2026-02-13 15:12:26 -05:00

scripts

Add OpenAIRealtimeSTTService

2026-02-05 15:48:00 -05:00

src/pipecat

Broad service settings refactor, with the primary aim of making service settings discoverable and strongly-typed. Service settings can be updated at runtime with *UpdateSettingsFrames.

2026-02-13 15:12:26 -05:00

tests

Broad service settings refactor, with the primary aim of making service settings discoverable and strongly-typed. Service settings can be updated at runtime with *UpdateSettingsFrames.

2026-02-13 15:12:26 -05:00

.dockerignore

Modularize tricky dependencies (#95 )

2024-04-03 10:48:11 -05:00

.gitignore

Save Smart Turn input data if SMART_TURN_LOG_DATA is set

2026-01-22 14:17:59 +00:00

.pre-commit-config.yaml

Update pre-commit-config ruff version

2025-08-02 18:06:06 -04:00

.readthedocs.yaml

Add Ultravox service (#1 )

2025-12-12 10:16:15 -08:00

CHANGELOG.md

Update changelog for version 0.0.102

2026-02-10 18:28:21 -08:00

CHANGELOG.md.template

add CHANGELOG.md

2024-05-14 13:45:01 -07:00

CLAUDE.md

CLAUDE.md: add pipeline task and pipeline runner

2026-02-09 16:19:11 -08:00

codecov.yml

include codecov.yml

2025-02-11 23:46:19 -08:00

COMMUNITY_INTEGRATIONS.md

Update NVIDIA NIM and Riva services to Nvidia

2025-12-01 22:41:17 -06:00

CONTRIBUTING.md

Add 'other' changelog category

2025-12-29 20:43:24 -05:00

env.example

Add ResembleAITTSService

2026-02-02 08:55:27 -05:00

LICENSE

Update copyright date range to 2024-2026

2026-01-07 16:58:13 -05:00

MANIFEST.in

add MANIFEST.in to reduce sdist tarball size

2025-05-28 10:09:39 -07:00

pipecat.png

renamed image.png to pipecat.png

2024-05-12 17:44:10 -07:00

pyproject.toml

pyproject: add local smartturn as a default dependency

2026-02-10 14:32:32 -08:00

README.md

Add Resemble TTS to README

2026-02-02 09:05:03 -05:00

SECURITY.md

Add SECURITY.md

2025-10-05 13:24:47 -05:00

uv.lock

pyproject: add local smartturn as a default dependency

2026-02-10 14:32:32 -08:00

README.md

🎙️ Pipecat: Real-Time Voice & Multimodal AI Agents

Pipecat is an open-source Python framework for building real-time voice and multimodal conversational agents. Orchestrate audio and video, AI services, different transports, and conversation pipelines effortlessly—so you can focus on what makes your agent unique.

Want to dive right in? Try the quickstart.

🚀 What You Can Build

Voice Assistants – natural, streaming conversations with AI
AI Companions – coaches, meeting assistants, characters
Multimodal Interfaces – voice, video, images, and more
Interactive Storytelling – creative tools with generative media
Business Agents – customer intake, support bots, guided flows
Complex Dialog Systems – design logic with structured conversations

🧠 Why Pipecat?

Voice-first: Integrates speech recognition, text-to-speech, and conversation handling
Pluggable: Supports many AI services and tools
Composable Pipelines: Build complex behavior from modular components
Real-Time: Ultra-low latency interaction with different transports (e.g. WebSockets or WebRTC)

🌐 Pipecat Ecosystem

📱 Client SDKs

Building client applications? You can connect to Pipecat from any platform using our official SDKs:

🧭 Structured conversations

Looking to build structured conversations? Check out Pipecat Flows for managing complex conversational states and transitions.

🪄 Beautiful UIs

Want to build beautiful and engaging experiences? Checkout the Voice UI Kit, a collection of components, hooks and templates for building voice AI applications quickly.

🛠️ Create and deploy projects

Create a new project in under a minute with the Pipecat CLI. Then use the CLI to monitor and deploy your agent to production.

🔍 Debugging

Looking for help debugging your pipeline and processors? Check out Whisker, a real-time Pipecat debugger.

🖥️ Terminal

Love terminal applications? Check out Tail, a terminal dashboard for Pipecat.

📺️ Pipecat TV Channel

Catch new features, interviews, and how-tos on our Pipecat TV channel.

🎬 See it in action

🧩 Available services

Category	Services
Speech-to-Text	AssemblyAI, AWS, Azure, Cartesia, Deepgram, ElevenLabs, Fal Wizper, Gladia, Google, Gradium, Groq (Whisper), Hathora, NVIDIA Riva, OpenAI (Whisper), SambaNova (Whisper), Sarvam, Soniox, Speechmatics, Whisper
LLMs	Anthropic, AWS, Azure, Cerebras, DeepSeek, Fireworks AI, Gemini, Grok, Groq, Mistral, NVIDIA NIM, Ollama, OpenAI, OpenRouter, Perplexity, Qwen, SambaNova Together AI
Text-to-Speech	Async, AWS, Azure, Camb AI, Cartesia, Deepgram, ElevenLabs, Fish, Google, Gradium, Groq, Hathora, Hume, Inworld, LMNT, MiniMax, Neuphonic, NVIDIA Riva, OpenAI, Piper, PlayHT, Resemble, Rime, Sarvam, Speechmatics, XTTS
Speech-to-Speech	AWS Nova Sonic, Gemini Multimodal Live, Grok Voice Agent, OpenAI Realtime, Ultravox,
Transport	Daily (WebRTC), FastAPI Websocket, SmallWebRTCTransport, WebSocket Server, Local
Serializers	Exotel, Plivo, Twilio, Telnyx, Vonage
Video	HeyGen, Tavus, Simli
Memory	mem0
Vision & Image	fal, Google Imagen, Moondream
Audio Processing	Silero VAD, Krisp, Koala, ai-coustics
Analytics & Metrics	OpenTelemetry, Sentry

📚 View full services documentation →

⚡ Getting started

You can get started with Pipecat running on your local machine, then move your agent processes to the cloud when you're ready.

Install uv
```
curl -LsSf https://astral.sh/uv/install.sh | sh
```
Need help? Refer to the uv install documentation.

Install the module

# For new projects
uv init my-pipecat-app
cd my-pipecat-app
uv add pipecat-ai

# Or for existing projects
uv add pipecat-ai

Set up your environment
```
cp env.example .env
```
To keep things lightweight, only the core framework is included by default. If you need support for third-party AI services, you can add the necessary dependencies with:
```
uv add "pipecat-ai[option,...]"
```

Using pip? You can still use pip install pipecat-ai and pip install "pipecat-ai[option,...]" to get set up.

🧪 Code examples

Foundational — small snippets that build on each other, introducing one or two concepts at a time
Example apps — complete applications that you can use as starting points for development

🛠️ Contributing to the framework

Prerequisites

Minimum Python Version: 3.10 Recommended Python Version: 3.12

Setup Steps

Clone the repository and navigate to it:

git clone https://github.com/pipecat-ai/pipecat.git
cd pipecat

Install development and testing dependencies:

uv sync --group dev --all-extras \
  --no-extra gstreamer \
  --no-extra krisp \
  --no-extra local \

Install the git pre-commit hooks:
```
uv run pre-commit install
```

Note

: Some extras (local, gstreamer) require system dependencies. See documentation if you encounter build errors.

Running tests

To run all tests, from the root directory:

uv run pytest

Run a specific test suite:

uv run pytest tests/test_name.py

🤝 Contributing

We welcome contributions from the community! Whether you're fixing bugs, improving documentation, or adding new features, here's how you can help:

Found a bug? Open an issue
Have a feature idea? Start a discussion
Want to contribute code? Check our CONTRIBUTING.md guide
Documentation improvements? Docs PRs are always welcome

Before submitting a pull request, please check existing issues and PRs to avoid duplicates.

We aim to review all contributions promptly and provide constructive feedback to help get your changes merged.

🛟 Getting help

➡️ Join our Discord

➡️ Read the docs

➡️ Reach us on X

README.md Unescape Escape

🎙️ Pipecat: Real-Time Voice & Multimodal AI Agents

🚀 What You Can Build

🧠 Why Pipecat?

🌐 Pipecat Ecosystem

📱 Client SDKs

🧭 Structured conversations

🪄 Beautiful UIs

🛠️ Create and deploy projects

🔍 Debugging

🖥️ Terminal

📺️ Pipecat TV Channel

🎬 See it in action

🧩 Available services

⚡ Getting started

🧪 Code examples

🛠️ Contributing to the framework

Prerequisites

Setup Steps

Running tests

🤝 Contributing

🛟 Getting help

README.md