pipecat

Author	SHA1	Message	Date
Mark Backman	28f9203401	Code review fixes	2026-05-21 11:45:17 -04:00
Mark Backman	c1bf7dbb4a	chore: bump pipecat-ai-prebuilt to 1.0.1	2026-05-20 12:15:09 -04:00
asilvestre	e2d249e5d9	adding uv.lock	2026-05-19 16:33:38 +02:00
Filipi da Silva Fuchter	c51a817efa	Merge pull request #4442 from pipecat-ai/filipi/runner_all_transports Unified start route to make all transports available	2026-05-18 09:27:44 -03:00
filipi87	c3338667b1	Mounting the prebuilt frontend UI and root redirect for all transports.	2026-05-15 10:06:47 -03:00
Aleix Conchillo Flaqué	7b6d878f07	update uv.lock	2026-05-14 14:41:38 -07:00
Mark Backman	5b33964a1b	Update uv.lock for urllib3 and langchain-core	2026-05-11 15:51:01 -04:00
Aleix Conchillo Flaqué	644e106c03	chore(daily): bump daily-python to ~=0.28.0	2026-04-27 13:35:14 -07:00
Mark Backman	815cd44c2a	Merge pull request #4372 from pipecat-ai/mb/relax-frames-proto-5x Relax protobuf pin to support both 5.x and 6.x runtimes	2026-04-27 08:58:23 -04:00
Gökmen Görgen	f75f361629	bump `aic-sdk` to 2.2.0 and update `AICFilter` with `model_id` and `enhancement_level` changes.	2026-04-25 09:51:23 +02:00
Mark Backman	4088992d97	Relax protobuf pin to support both 5.x and 6.x runtimes Pipecat 1.0.8 hard-required protobuf 6.x via the base `protobuf>=6.31.1,<7` pin, blocking users whose dependency graph already constrains protobuf to the 5.x line. The original bump (PR #4136) was only needed because `nvidia-riva-client>=2.25.1` ships gencode compiled with protoc 6.31.1. Changes: - Widen base pin to `protobuf>=5.29.6,<7`. - Regenerate `frames_pb2.py` with `grpcio-tools~=1.67.1` (protoc 5.x). Per Google's cross-version runtime guarantee, 5.x gencode runs on both 5.x and 6.x runtimes, so this single artifact serves all users. - Loosen the dev pin `grpcio-tools` to `>=1.67.1,<2` so contributors can install `pipecat[dev,nvidia]` without resolver conflict. Comment in `frames.proto` documents the 1.67.x requirement for regeneration. - Add an explicit `protobuf>=6.31.1,<7` to the `nvidia` extra. This compensates for nvidia-riva-client's missing `protobuf` install requirement (upstream packaging gap, see https://github.com/nvidia-riva/python-clients/issues/172). When that issue is resolved, the explicit protobuf entry in the `nvidia` extra can be removed. Verified: pipecat imports cleanly on both protobuf 5.29.6 and 6.33.6; `tests/test_protobuf_serializer.py` passes; `import riva.client` succeeds when `pipecat[nvidia]` is installed.	2026-04-24 21:15:32 -04:00
filipi87	6b1d8d9fa5	Fixing merge conflicts.	2026-04-22 15:22:32 -03:00
Mark Backman	c091232f2f	Add xAI streaming STT service New `XAISTTService` wraps xAI's real-time speech-to-text WebSocket (`wss://api.x.ai/v1/stt`). It extends `WebsocketSTTService`, authenticates with the `XAI_API_KEY` as a Bearer token on the WS handshake, and streams raw audio (PCM/mu-law/A-law) with configurable interim results, endpointing, language, multichannel, and diarization settings. - `src/pipecat/services/xai/stt.py`: new service, settings dataclass, and `language_to_xai_stt_language` helper. - `src/pipecat/services/stt_latency.py`: `XAI_TTFS_P99` default. - `pyproject.toml` / `uv.lock`: `xai` extra now pulls in `websockets-base`. - `README.md`: link to xAI STT in the services table. - `examples/voice/voice-xai.py`: swap DeepgramSTTService for XAISTTService so the xAI voice example is fully xAI. - `examples/transcription/transcription-xai.py`: new transcription-only example using the new service.	2026-04-21 13:45:34 -04:00
Paul Kompfner	81571beb1b	Use ExternalUserTurnStrategies, as expected, in a Deepgram Flux example	2026-04-21 10:51:59 -04:00
dyi1	b8a1f45d4c	Improve HeyGen LiveAvatar plugin reliability and performance (#4312 ) * Improve HeyGen LiveAvatar plugin reliability and performance - Add WebSocket ready gate: wait for session.state_updated connected event before sending commands (prevents silently dropped messages) - Add keep-alive mechanism: send session.keep_alive every 2.5 min to prevent 5-minute inactivity timeout - Optimize audio chunking: 600ms first chunk for faster initial response, 1s subsequent chunks for efficient streaming - Fix audio buffer flush: send remaining buffered audio on utterance end instead of discarding it - Fix WS state cleanup: properly reset connected/ready state when WebSocket drops unexpectedly - Add livekit_config passthrough in LiveAvatar session token creation - Replace stray print() with logger.debug() * Fix HeyGenOutputTransport.start() signature and use 400ms first chunk - Update transport.py to match new client.start() signature (no audio_chunk_size param) - Change first chunk size from 600ms to 400ms per feedback * Fix transport audio resampling and client.start() error propagation - Add audio resampling in HeyGenOutputTransport.write_audio_frame() to ensure audio is always 24kHz before sending to HeyGen (was sending at pipeline sample rate, causing garbled audio) - Raise exception on WS ready timeout instead of silently returning, preventing transport from appearing ready when WS connection failed * Fix session readiness gate to work with LITE mode LITE mode does not send session.state_updated WS events. Instead, use a dual-signal _session_ready event that fires on either: - WS session.state_updated connected (FULL mode) - LiveKit participant connected (LITE mode) Also reorder start() to connect both WS and LiveKit before waiting, since the WS events may depend on LiveKit being connected. Verified with live sandbox session - all tests pass. * Simplify session readiness to use only WS ready gate Remove _session_ready dual-signal and use only _ws_ready, which fires on the session.state_updated connected WS event. Increase timeout to 30s. LiveKit is connected before waiting so the WS event can arrive. * Reduce WS ready gate timeout back to 10s * Remove WS ready gate (session.state_updated not reliably received) The session.state_updated connected event is not reliably received via the websockets library. Remove the gate for now and assume the session is ready after WS + LiveKit connect. Keep-alive, chunking, buffer flush, state cleanup, and other improvements remain.	2026-04-16 12:58:14 -04:00
Mark Backman	7291026695	Update Tavus transport example Show how to use on_connected event handler to obtain Daily room URL	2026-04-15 23:04:31 -04:00
Aleix Conchillo Flaqué	a14d257cf2	update pytest to >=9	2026-04-13 15:08:47 -07:00
Aleix Conchillo Flaqué	a8660aabfe	update uv.lock	2026-04-13 15:06:25 -07:00
Mark Backman	d942a713af	Update uv.lock resolving langchain-core and cryptography vulnerabilities	2026-04-13 11:09:31 -04:00
Cale Shapera	ec574edd53	Add Inworld Realtime Service (#4140 ) * Add Inworld Realtime LLM service Adds a WebSocket-based realtime service for Inworld's cascade STT/LLM/TTS API with semantic VAD, function calling, and streaming transcription support. New files: - src/pipecat/services/inworld/realtime/ (service, events) - src/pipecat/adapters/services/inworld_realtime_adapter.py - examples/foundational/19zb-inworld-realtime.py Also includes: - websockets dependency for inworld extra in pyproject.toml - Adapter and settings tests matching OpenAI/Grok realtime patterns - Fix for double-response when server-side VAD is enabled * Prefer init-provided system instruction in Inworld Realtime Adopt _resolve_system_instruction() from BaseLLMAdapter, matching the pattern applied to OpenAI Realtime, Grok Realtime, Gemini Live, and Nova Sonic in the pk/realtime-services-init-v-context-system-instructions-cleanup branch. * Update changelog entry with PR number * Fix changelog format to use bullet point * Polish PR: default model, example cleanup, changelog update - Change default model from gpt-4.1-nano to gpt-4.1-mini - Add function calling demo to example - Remove demo-testing artifact from system instruction - Mention Router support in changelog * Address PR review feedback for Inworld Realtime - Move example to examples/realtime/realtime-inworld.py - Change initial context role from "user" to "developer" - Remove explicit sample rates from example; sync them in _ensure_audio_config so Inworld gets the transport's actual rates - Add audio race condition guard in _handle_evt_audio_delta (matches OpenAI realtime pattern) - Convert remaining "system"/"developer" messages to "user" in adapter - Add clarifying comment for local-VAD vs server-VAD metrics paths * Simplify example, add provider tracking, remove local VAD path - Remove function calling from example, switch model to xai/grok-4-1-fast-non-reasoning - Add pipecat-realtime session key prefix and provider_data metadata for Inworld traffic attribution - Remove local VAD code path (Inworld only supports server-side VAD) - Use typed InputAudioBufferAppendEvent for audio sends * Default TTS model to inworld-tts-1.5-max * Remove dead shimmed tools code, set STT/VAD defaults - Remove non-functional AdapterType.SHIM custom tools code from adapter - Default STT model to assemblyai/u3-rt-pro - Default VAD eagerness to low	2026-04-09 13:04:17 -04:00
Mark Backman	7f3f23dcb9	Add Mistral Voxtral streaming TTS service Integrate with Mistral's Voxtral TTS API (voxtral-mini-tts-2603) using HTTP streaming with Server-Sent Events. Converts base64-encoded float32 PCM chunks from the API to int16 for the Pipecat pipeline.	2026-04-07 09:39:36 -04:00
Mark Backman	e1638a9342	Clean up docs config after riva removal and add missing modules Remove stale riva mock imports from autodoc_mock_imports since the riva service was removed and nvidia-riva-client is installed during doc builds. Add pipecat.turns and pipecat.extensions to import_core_modules() and add Turns to the index.rst toctree. Regenerate uv.lock to reflect the riva extra removal from pyproject.toml.	2026-04-03 09:52:31 -04:00
Aleix Conchillo Flaqué	a6013ba437	update uv.lock	2026-04-01 19:12:39 -07:00
Mark Backman	8a794424dd	Update uv.lock	2026-04-01 19:05:17 -04:00
Aleix Conchillo Flaqué	58b1b7249e	Update onnxruntime to 1.24.3 This version adds support for Python 3.14.	2026-04-01 19:02:32 -04:00
Aleix Conchillo Flaqué	ece4d0661e	update uv.lock	2026-03-30 15:06:05 -07:00
Mark Backman	b6579dc763	Update uv lock with latest versions of Pygments and cryptography	2026-03-29 10:20:45 -04:00
Mark Backman	ccb9dc20f8	Update langchain dependencies to latest major versions Update langchain 0.3→1.2, langchain-community 0.3→0.4, and langchain-openai 0.3→1.1. This also unblocks openai>=2.26 which was previously constrained by the now-removed openpipe package.	2026-03-29 10:17:28 -04:00
Mark Backman	2177e28ee1	Remove OpenPipe integration OpenPipe was acquired by CoreWeave in September 2025. The Python package hasn't been updated since June 2025 and the repo since 2024. The openpipe package caps openai<=1.97.1, creating dependency conflicts with other extras. Remove the dead integration to clean up the codebase.	2026-03-29 10:12:35 -04:00
Arindam200	39919f7889	Add NebiusLLMService for Nebius Token Factory Adds an OpenAI-compatible LLM service for Nebius Token Factory, supporting open-source models (Meta Llama, Qwen, DeepSeek) via their OpenAI-compatible REST API at https://api.tokenfactory.nebius.com/v1/.	2026-03-29 14:35:46 +05:30
Aleix Conchillo Flaqué	fc76b3f2fb	update pyproject.toml and uv.lock	2026-03-27 21:36:03 -07:00
Mark Backman	0798803c70	Bump deepgram-sdk minimum version to 6.1.0	2026-03-27 14:46:17 -04:00
Mark Backman	4e4a8c45d5	build(mem0): bump mem0ai dependency to >=1.0.8,<2	2026-03-26 13:28:41 -04:00
Mark Backman	e58740e948	Bump nltk minimum version to 3.9.4 to resolve CVE-2026-33230	2026-03-25 23:16:46 -04:00
Mark Backman	1f0d9ad01a	Upgrade protobuf to 6.x for nvidia-riva-client 2.25.1 compatibility nvidia-riva-client 2.25.1 ships with gencode compiled against protobuf 6.31.1, which requires a runtime >= 6.31.1. Update protobuf from 5.29.6 to >=6.31.1,<7 and grpcio-tools from 1.67.1 to 1.78.0 to match. Regenerate frames_pb2.py with the new compiler.	2026-03-25 15:23:53 -04:00
Mark Backman	adc003d6c7	Code review cleanup	2026-03-25 10:53:07 -04:00
Mark Backman	6eb988b729	Merge pull request #4092 from harshitajain165/harshita/smallest-tts-only Add Smallest AI TTS service integration	2026-03-24 11:54:34 -04:00
Mark Backman	51d28b4a9f	Code review fixes	2026-03-24 11:21:04 -04:00
kompfner	cf083b8411	Merge pull request #4078 from pipecat-ai/cb/gemini-updates Updates for Gemini Live	2026-03-24 11:18:00 -04:00
Harshita Jain	099814d74a	Add Smallest AI TTS service integration Adds SmallestTTSService, a WebSocket-based TTS service using Smallest AI's Lightning v3.1 model. Follows current Pipecat service conventions: - SmallestTTSSettings dataclass with runtime-updatable settings (voice, language, speed, etc.) - Reconnects on model change; keepalive every 30s to prevent idle timeout - TTS settings default to None so the API applies its own defaults - Model enum: SmallestTTSModel.LIGHTNING_V3_1 Includes a foundational example (07zl-interruptible-smallest.py) using Deepgram STT + Smallest TTS + OpenAI LLM. STT integration will follow in a separate PR once the hallucination/finalize behaviour is resolved. Made-with: Cursor	2026-03-24 11:11:10 -04:00
Mark Backman	aa0b49d69f	Code review fixes	2026-03-24 09:22:08 -04:00
Aleix Conchillo Flaque	9211379720	update uv.lock	2026-03-23 20:06:28 -07:00
filipi87	8612c9f50a	Updating to use daily-python 0.27.0	2026-03-23 17:52:41 -03:00
Chad Bailey	38d7882f0f	updated context seeding to allow gemini 3.1 to greet the user	2026-03-18 21:28:17 +00:00
Mark Backman	05abc95b5f	Update uv.lock with pyasn1 v0.6.3	2026-03-17 16:10:35 -04:00
Mark Backman	dc1632bbac	Merge pull request #4023 from pipecat-ai/mb/update-small-webrtc-prebuilt-2.4.0	2026-03-16 21:09:08 -04:00
Mark Backman	154a8d1987	Merge pull request #4035 from pipecat-ai/mb/bump-pyjwt-version	2026-03-16 21:06:31 -04:00
Aleix Conchillo Flaqué	5c685c35d7	pyproject: update daily-python to 0.25.0	2026-03-16 17:41:44 -07:00
Mark Backman	24c3d23229	Bump PyJWT minimum version to 2.12.0 for CVE-2026-32597 Addresses Dependabot alert #165 (GHSA-752w-5fwx-jx9f) where PyJWT <= 2.11.0 accepts unknown `crit` header extensions.	2026-03-15 08:53:06 -04:00
Mark Backman	1064482ade	Update pipecat-ai-small-webrtc-prebuilt to 2.4.0	2026-03-13 10:20:51 -04:00

1 2 3 4

168 Commits