pipecat

Author	SHA1	Message	Date
joycech333	77cc314a08	feat: add Inception LLM service with Mercury-2 support Adds InceptionLLMService, an OpenAI-compatible service for Inception's Mercury-2 diffusion-based reasoning model. Supports reasoning_effort (instant/low/medium/high) and realtime mode for reduced TTFT.	2026-05-21 11:23:23 -04:00
Mark Backman	c1bf7dbb4a	chore: bump pipecat-ai-prebuilt to 1.0.1	2026-05-20 12:15:09 -04:00
asilvestre	c61672194d	Vonage Video Connector Transport	2026-05-18 14:40:49 +02:00
filipi87	c3338667b1	Mounting the prebuilt frontend UI and root redirect for all transports.	2026-05-15 10:06:47 -03:00
Aleix Conchillo Flaqué	644e106c03	chore(daily): bump daily-python to ~=0.28.0	2026-04-27 13:35:14 -07:00
Mark Backman	815cd44c2a	Merge pull request #4372 from pipecat-ai/mb/relax-frames-proto-5x Relax protobuf pin to support both 5.x and 6.x runtimes	2026-04-27 08:58:23 -04:00
Gökmen Görgen	f75f361629	bump `aic-sdk` to 2.2.0 and update `AICFilter` with `model_id` and `enhancement_level` changes.	2026-04-25 09:51:23 +02:00
Mark Backman	4088992d97	Relax protobuf pin to support both 5.x and 6.x runtimes Pipecat 1.0.8 hard-required protobuf 6.x via the base `protobuf>=6.31.1,<7` pin, blocking users whose dependency graph already constrains protobuf to the 5.x line. The original bump (PR #4136) was only needed because `nvidia-riva-client>=2.25.1` ships gencode compiled with protoc 6.31.1. Changes: - Widen base pin to `protobuf>=5.29.6,<7`. - Regenerate `frames_pb2.py` with `grpcio-tools~=1.67.1` (protoc 5.x). Per Google's cross-version runtime guarantee, 5.x gencode runs on both 5.x and 6.x runtimes, so this single artifact serves all users. - Loosen the dev pin `grpcio-tools` to `>=1.67.1,<2` so contributors can install `pipecat[dev,nvidia]` without resolver conflict. Comment in `frames.proto` documents the 1.67.x requirement for regeneration. - Add an explicit `protobuf>=6.31.1,<7` to the `nvidia` extra. This compensates for nvidia-riva-client's missing `protobuf` install requirement (upstream packaging gap, see https://github.com/nvidia-riva/python-clients/issues/172). When that issue is resolved, the explicit protobuf entry in the `nvidia` extra can be removed. Verified: pipecat imports cleanly on both protobuf 5.29.6 and 6.33.6; `tests/test_protobuf_serializer.py` passes; `import riva.client` succeeds when `pipecat[nvidia]` is installed.	2026-04-24 21:15:32 -04:00
filipi87	ac810e57ed	Merge branch 'main' into filipi/includes_inter_frame_spaces # Conflicts: # uv.lock	2026-04-22 15:22:06 -03:00
filipi87	bba7ca80e3	Bumping to small-webrtc-prebuilt 2.5.0 to fix karaoke highlighting.	2026-04-22 15:20:37 -03:00
Mark Backman	c091232f2f	Add xAI streaming STT service New `XAISTTService` wraps xAI's real-time speech-to-text WebSocket (`wss://api.x.ai/v1/stt`). It extends `WebsocketSTTService`, authenticates with the `XAI_API_KEY` as a Bearer token on the WS handshake, and streams raw audio (PCM/mu-law/A-law) with configurable interim results, endpointing, language, multichannel, and diarization settings. - `src/pipecat/services/xai/stt.py`: new service, settings dataclass, and `language_to_xai_stt_language` helper. - `src/pipecat/services/stt_latency.py`: `XAI_TTFS_P99` default. - `pyproject.toml` / `uv.lock`: `xai` extra now pulls in `websockets-base`. - `README.md`: link to xAI STT in the services table. - `examples/voice/voice-xai.py`: swap DeepgramSTTService for XAISTTService so the xAI voice example is fully xAI. - `examples/transcription/transcription-xai.py`: new transcription-only example using the new service.	2026-04-21 13:45:34 -04:00
dhruvladia-sarvam	f2a19cb1a3	Initial commit for vad parameters on saaras:v3	2026-04-20 13:52:48 +05:30
Aleix Conchillo Flaqué	12b8af3d89	pyproject: use UP ruff linting option	2026-04-16 09:26:12 -07:00
Mark Backman	ba023248d9	Add missing daily-python dependency for tavus extra	2026-04-14 17:48:37 -04:00
Aleix Conchillo Flaqué	a14d257cf2	update pytest to >=9	2026-04-13 15:08:47 -07:00
Cale Shapera	ec574edd53	Add Inworld Realtime Service (#4140 ) * Add Inworld Realtime LLM service Adds a WebSocket-based realtime service for Inworld's cascade STT/LLM/TTS API with semantic VAD, function calling, and streaming transcription support. New files: - src/pipecat/services/inworld/realtime/ (service, events) - src/pipecat/adapters/services/inworld_realtime_adapter.py - examples/foundational/19zb-inworld-realtime.py Also includes: - websockets dependency for inworld extra in pyproject.toml - Adapter and settings tests matching OpenAI/Grok realtime patterns - Fix for double-response when server-side VAD is enabled * Prefer init-provided system instruction in Inworld Realtime Adopt _resolve_system_instruction() from BaseLLMAdapter, matching the pattern applied to OpenAI Realtime, Grok Realtime, Gemini Live, and Nova Sonic in the pk/realtime-services-init-v-context-system-instructions-cleanup branch. * Update changelog entry with PR number * Fix changelog format to use bullet point * Polish PR: default model, example cleanup, changelog update - Change default model from gpt-4.1-nano to gpt-4.1-mini - Add function calling demo to example - Remove demo-testing artifact from system instruction - Mention Router support in changelog * Address PR review feedback for Inworld Realtime - Move example to examples/realtime/realtime-inworld.py - Change initial context role from "user" to "developer" - Remove explicit sample rates from example; sync them in _ensure_audio_config so Inworld gets the transport's actual rates - Add audio race condition guard in _handle_evt_audio_delta (matches OpenAI realtime pattern) - Convert remaining "system"/"developer" messages to "user" in adapter - Add clarifying comment for local-VAD vs server-VAD metrics paths * Simplify example, add provider tracking, remove local VAD path - Remove function calling from example, switch model to xai/grok-4-1-fast-non-reasoning - Add pipecat-realtime session key prefix and provider_data metadata for Inworld traffic attribution - Remove local VAD code path (Inworld only supports server-side VAD) - Use typed InputAudioBufferAppendEvent for audio sends * Default TTS model to inworld-tts-1.5-max * Remove dead shimmed tools code, set STT/VAD defaults - Remove non-functional AdapterType.SHIM custom tools code from adapter - Default STT model to assemblyai/u3-rt-pro - Default VAD eagerness to low	2026-04-09 13:04:17 -04:00
Mark Backman	7f3f23dcb9	Add Mistral Voxtral streaming TTS service Integrate with Mistral's Voxtral TTS API (voxtral-mini-tts-2603) using HTTP streaming with Server-Sent Events. Converts base64-encoded float32 PCM chunks from the API to int16 for the Pipecat pipeline.	2026-04-07 09:39:36 -04:00
Mark Backman	916af84974	Remove DeprecatedModuleProxy and service re-export shims Remove the deprecation proxy infrastructure that allowed old-style flat imports (e.g. `from pipecat.services.openai import OpenAILLMService`). Users must now import from specific submodules (`from pipecat.services.openai.llm import OpenAILLMService`), which is already the established pattern across all internal code and 179+ examples. - Strip 32 proxy `__init__.py` files to empty - Strip 3 non-proxy files with bare star imports (minimax, sambanova, sarvam) - Strip google/gemini_live `__init__.py` re-exports - Remove DeprecatedModuleProxy class and helpers from services/__init__.py - Remove ruff per-file ignore for services/__init__.py - Fix 2 examples using old-style imports	2026-04-03 13:43:02 -04:00
Mark Backman	bfffefa95c	Remove leftover riva and remote-smart-turn references Clean up deprecated extras from pyproject.toml and the docs build script.	2026-04-03 09:29:29 -04:00
Aleix Conchillo Flaqué	f4743a6c91	require python >= 3.11	2026-04-01 19:02:34 -04:00
Aleix Conchillo Flaqué	58b1b7249e	Update onnxruntime to 1.24.3 This version adds support for Python 3.14.	2026-04-01 19:02:32 -04:00
Aleix Conchillo Flaqué	f0d04dde1c	audio(filters): remove KrispFilter	2026-03-30 14:01:06 -07:00
Aleix Conchillo Flaqué	742a278c05	audio(filters): remove NoisereduceFilter	2026-03-30 13:58:35 -07:00
Mark Backman	ccb9dc20f8	Update langchain dependencies to latest major versions Update langchain 0.3→1.2, langchain-community 0.3→0.4, and langchain-openai 0.3→1.1. This also unblocks openai>=2.26 which was previously constrained by the now-removed openpipe package.	2026-03-29 10:17:28 -04:00
Mark Backman	2177e28ee1	Remove OpenPipe integration OpenPipe was acquired by CoreWeave in September 2025. The Python package hasn't been updated since June 2025 and the repo since 2024. The openpipe package caps openai<=1.97.1, creating dependency conflicts with other extras. Remove the dead integration to clean up the codebase.	2026-03-29 10:12:35 -04:00
Mark Backman	a3aeafcb2d	Alphabetize nebius entry in pyproject.toml extras	2026-03-29 08:58:01 -04:00
Arindam200	39919f7889	Add NebiusLLMService for Nebius Token Factory Adds an OpenAI-compatible LLM service for Nebius Token Factory, supporting open-source models (Meta Llama, Qwen, DeepSeek) via their OpenAI-compatible REST API at https://api.tokenfactory.nebius.com/v1/.	2026-03-29 14:35:46 +05:30
Aleix Conchillo Flaqué	fc76b3f2fb	update pyproject.toml and uv.lock	2026-03-27 21:36:03 -07:00
Mark Backman	0798803c70	Bump deepgram-sdk minimum version to 6.1.0	2026-03-27 14:46:17 -04:00
Mark Backman	4e4a8c45d5	build(mem0): bump mem0ai dependency to >=1.0.8,<2	2026-03-26 13:28:41 -04:00
Mark Backman	e58740e948	Bump nltk minimum version to 3.9.4 to resolve CVE-2026-33230	2026-03-25 23:16:46 -04:00
Mark Backman	1f0d9ad01a	Upgrade protobuf to 6.x for nvidia-riva-client 2.25.1 compatibility nvidia-riva-client 2.25.1 ships with gencode compiled against protobuf 6.31.1, which requires a runtime >= 6.31.1. Update protobuf from 5.29.6 to >=6.31.1,<7 and grpcio-tools from 1.67.1 to 1.78.0 to match. Regenerate frames_pb2.py with the new compiler.	2026-03-25 15:23:53 -04:00
Nicholas Zhao	02b97035f8	Add xAI TTS service	2026-03-25 10:45:15 -04:00
Mark Backman	6eb988b729	Merge pull request #4092 from harshitajain165/harshita/smallest-tts-only Add Smallest AI TTS service integration	2026-03-24 11:54:34 -04:00
Mark Backman	51d28b4a9f	Code review fixes	2026-03-24 11:21:04 -04:00
kompfner	cf083b8411	Merge pull request #4078 from pipecat-ai/cb/gemini-updates Updates for Gemini Live	2026-03-24 11:18:00 -04:00
Harshita Jain	099814d74a	Add Smallest AI TTS service integration Adds SmallestTTSService, a WebSocket-based TTS service using Smallest AI's Lightning v3.1 model. Follows current Pipecat service conventions: - SmallestTTSSettings dataclass with runtime-updatable settings (voice, language, speed, etc.) - Reconnects on model change; keepalive every 30s to prevent idle timeout - TTS settings default to None so the API applies its own defaults - Model enum: SmallestTTSModel.LIGHTNING_V3_1 Includes a foundational example (07zl-interruptible-smallest.py) using Deepgram STT + Smallest TTS + OpenAI LLM. STT integration will follow in a separate PR once the hallucination/finalize behaviour is resolved. Made-with: Cursor	2026-03-24 11:11:10 -04:00
Mark Backman	aa0b49d69f	Code review fixes	2026-03-24 09:22:08 -04:00
Alex-wuhu	8c6f4a8d7b	Add Novita AI LLM service provider	2026-03-24 09:20:50 -04:00
filipi87	8612c9f50a	Updating to use daily-python 0.27.0	2026-03-23 17:52:41 -03:00
Chad Bailey	38d7882f0f	updated context seeding to allow gemini 3.1 to greet the user	2026-03-18 21:28:17 +00:00
Mark Backman	dc1632bbac	Merge pull request #4023 from pipecat-ai/mb/update-small-webrtc-prebuilt-2.4.0	2026-03-16 21:09:08 -04:00
Mark Backman	154a8d1987	Merge pull request #4035 from pipecat-ai/mb/bump-pyjwt-version	2026-03-16 21:06:31 -04:00
Aleix Conchillo Flaqué	5c685c35d7	pyproject: update daily-python to 0.25.0	2026-03-16 17:41:44 -07:00
Mark Backman	24c3d23229	Bump PyJWT minimum version to 2.12.0 for CVE-2026-32597 Addresses Dependabot alert #165 (GHSA-752w-5fwx-jx9f) where PyJWT <= 2.11.0 accepts unknown `crit` header extensions.	2026-03-15 08:53:06 -04:00
Mark Backman	1064482ade	Update pipecat-ai-small-webrtc-prebuilt to 2.4.0	2026-03-13 10:20:51 -04:00
Mark Backman	a9e124b84f	Update sarvamai dependency from 0.1.26a2 to 0.1.26 Bump the Sarvam AI SDK to the stable release version.	2026-03-11 14:17:40 -04:00
Aleix Conchillo Flaqué	00eb190424	Update daily-python to 0.24.0	2026-03-09 20:47:13 -07:00
Mark Backman	06e49d597b	Update dev dependencies	2026-03-05 15:23:07 -05:00
Mark Backman	60e9e26164	revert onnxruntime to onnxruntime~=1.23.2 to maintain Python 3.10 support	2026-03-05 15:13:28 -05:00

1 2 3 4 5 ...

452 Commits