pipecat

Author	SHA1	Message	Date
Paul Kompfner	4d548117fa	feat: add OpenAI Responses API LLM service Add OpenAIResponsesLLMService using the Responses API, with a dedicated adapter that converts LLMContext messages to Responses API input items (system→developer, tool_calls→function_call, tool→function_call_output, multimodal content conversion, and tools schema flattening). - New adapter: open_ai_responses_adapter.py - New service: openai/responses/llm.py - Examples: 07-interruptible and 14-function-calling variants - 19 unit tests for adapter conversion logic - Eval entries for both examples	2026-03-20 21:34:22 -04:00
Paul Kompfner	0a4acfa294	Add frame_order parameter to SyncParallelPipeline Adds a FrameOrder enum with ARRIVAL (default, existing behavior) and PIPELINE (pushes frames in pipeline definition order). This lets callers guarantee output ordering between parallel pipelines — e.g. ensuring image frames precede audio frames — without needing a separate reordering processor downstream. Updates the 05-sync-speech-and-image example to use FrameOrder.PIPELINE, removing the ImageBeforeAudioReorderer class entirely.	2026-03-20 21:34:22 -04:00
Paul Kompfner	d2341e0199	Add ImageBeforeAudioReorderer to sync-speech-and-image example Add a processor after SyncParallelPipeline that ensures each image frame precedes its corresponding TTS audio frames. SyncParallelPipeline batches them together but doesn't guarantee branch ordering. The reorderer detects when TTS frames arrive before their image (via context_id tracking) and holds them until the image arrives. Also rename ImageAudioSync to MarkImageForPlaybackSync for clarity.	2026-03-20 21:34:22 -04:00
Paul Kompfner	7b859423ab	Use TextAggregationMode.TOKEN in the 05-sync-speech-and-image example since the SentenceAggregator already provides complete sentences.	2026-03-20 21:34:22 -04:00
Paul Kompfner	b68495ce0a	Add sync_with_audio support for OutputImageRawFrame Add a `sync_with_audio` field to `OutputImageRawFrame` that routes image frames through the audio queue in the output transport, ensuring images are only displayed after all preceding audio has been sent. This enables proper audio/image synchronization in pipelines like the calendar month narration example. Update the 05-sync-speech-and-image example to use an `ImageAudioSync` processor that sets this flag on image frames.	2026-03-20 21:34:22 -04:00
Mark Backman	98d3f697f1	Add WakePhraseUserTurnStartStrategy (#4064 ) - Add WakePhraseUserTurnStartStrategy for gating interaction behind wake phrase detection, with timeout and single_activation modes - Add default_user_turn_start_strategies() and default_user_turn_stop_strategies() helper functions - Deprecate WakeCheckFilter in favor of the new strategy - Extend ProcessFrameResult to stop strategies for short-circuit evaluation - Fix MinWordsUserTurnStartStrategy including filtered text in output	2026-03-20 21:34:21 -04:00
Blaine Kasten	591c02fb0e	a few updates	2026-03-19 13:37:21 -05:00
Blaine Kasten	077610184d	Add together STT and TTS services	2026-03-17 07:24:02 -05:00
Mark Backman	978a1a2083	Update the system_instruction wording in the foundational examples to not mention WebRTC call	2026-03-13 12:22:10 -04:00
Mark Backman	38a4d4ff23	Update quickstart to use cloud builds	2026-03-12 14:46:49 -04:00
kompfner	36f9a6d809	Merge pull request #4003 from pipecat-ai/pk/fix-deprecated-vad-analyzer-usage Fix deprecated vad_analyzer usage in examples	2026-03-11 20:55:39 -04:00
Paul Kompfner	e456a6bb23	Move away from remaining deprecated `TransportParams.vad_analyzer` usage in example files. Skip updates to deprecated services.	2026-03-11 17:17:40 -04:00
Mark Backman	2d9dc2fa1c	Update quickstart example for 0.0.105	2026-03-11 17:12:59 -04:00
Mark Backman	3ceff3d5fd	Merge pull request #4000 from pipecat-ai/mb/fix-openai-default-model Fix: Restore default model to gpt-4.1 for OpenAI, Azure	2026-03-11 16:29:51 -04:00
Mark Backman	4a45145cba	Restored the default model to gpt-4.1 for OpenAI and Azure LLM services The default model for OpenAILLMService and AzureLLMService was still set to gpt-4o. Restored it to gpt-4.1. Also, removed hardcoded gpt-4o/gpt-4o-mini model references from examples so they pick up the new default.	2026-03-11 16:18:47 -04:00
Paul Kompfner	080ed22ff5	Override CambTTSSettings.voice type from str to int to match Camb.ai's integer voice IDs	2026-03-11 15:44:05 -04:00
Paul Kompfner	51a8a28a99	Prefer Service.ThinkingConfig over raw ThinkingConfig class names in Anthropic and Google services and examples	2026-03-11 12:34:10 -04:00
Aleix Conchillo Flaqué	4c19337d89	Fix examples: Groq model, Google settings class, Nvidia system instruction	2026-03-10 15:29:52 -07:00
Aleix Conchillo Flaqué	a4310d4335	Merge pull request #3980 from pipecat-ai/aleix/move-google-vertex-openai Move Google Vertex and OpenAI LLM modules to subpackages	2026-03-10 13:37:02 -07:00
Aleix Conchillo Flaqué	7be2c43e1d	Update imports to use new google.gemini_live.vertex path	2026-03-10 13:00:31 -07:00
Aleix Conchillo Flaqué	b23652caa6	Update imports to use new google.vertex and google.openai paths	2026-03-10 12:58:04 -07:00
kollaikal-rupesh	80bd935c19	Add ServiceSwitcherStrategyFailover for automatic failover on service errors (#3870 ) * Add ServiceSwitcherStrategyFailover for automatic error-based service switching Introduce a strategy hierarchy: ServiceSwitcherStrategy (base) → ServiceSwitcherStrategyManual (handles ManuallySwitchServiceFrame) → ServiceSwitcherStrategyFailover (adds error-based failover). ServiceSwitcher now defaults to ServiceSwitcherStrategyManual with strategy_type optional. Non-fatal ErrorFrames are forwarded to the strategy via handle_error(). * Move metadata request into _set_active_if_available Requesting metadata is part of making a service active, so it belongs alongside setting _active_service and firing on_service_switched. This removes the duplicate queue_frame calls from ServiceSwitcher push_frame and process_frame.	2026-03-10 15:37:30 -04:00
Aleix Conchillo Flaqué	14dd028b8f	Add custom video track example with per-track params	2026-03-10 11:32:16 -07:00
Paul Kompfner	20c3f553b2	Add missing 55-* update-settings examples for OpenPipe LLM and XTTS TTS	2026-03-09 14:36:15 -04:00
kompfner	c0c49d0ddc	Merge pull request #3964 from pipecat-ai/pk/add-some-missing-55-examples Add missing 55-* update-settings examples for Piper TTS, Kokoro TTS, …	2026-03-09 12:59:36 -04:00
Mark Backman	786279f143	Remove unused imports, 2026-03-07	2026-03-09 12:44:47 -04:00
Paul Kompfner	f1bb065823	Add missing 55-* update-settings examples for Piper TTS, Kokoro TTS, Whisper STT, and Whisper MLX STT Also fix 13e-whisper-mlx.py to pass MLXModel.LARGE_V3_TURBO.value instead of the enum directly.	2026-03-09 11:54:25 -04:00
Mark Backman	c16e534f73	Merge pull request #3952 from pipecat-ai/mb/settings-alias Add Settings class attribute alias to all service classes	2026-03-09 10:45:10 -04:00
Mark Backman	d85ba75dda	Merge pull request #3953 from pipecat-ai/mb/deepgram-flux-on-the-fly Add on-the-fly Configure support for Deepgram Flux STT	2026-03-09 08:36:00 -04:00
Aleix Conchillo Flaqué	1f8cc3d216	Expose on_summary_applied event on LLMAssistantAggregator Forward the on_summary_applied event from the internal summarizer to the aggregator so users can listen for it without accessing private members. Update summarization examples to use the new public event.	2026-03-08 19:02:51 -07:00
Mark Backman	764c3c4f32	Merge pull request #3938 from koriyoshi2041/fix/replace-bare-except-handlers fix: replace bare except handlers with specific exception types	2026-03-08 09:04:49 -04:00
Mark Backman	807759b874	Revert changes to quickstart	2026-03-07 15:44:26 -05:00
Mark Backman	cd28c82de3	Update examples to use the class Settings alias	2026-03-07 09:15:24 -05:00
Mark Backman	c5da3cf2bd	Add on-the-fly Configure support for Deepgram Flux STT Wire up the existing settings update infrastructure to send a Configure WebSocket message when keyterm, eot_threshold, eager_eot_threshold, or eot_timeout_ms change mid-stream, avoiding a full reconnect.	2026-03-07 08:37:27 -05:00
Mark Backman	fdf9fb6f02	Merge pull request #3946 from pipecat-ai/mb/tts-settings-review Review TTS settings	2026-03-07 07:48:26 -05:00
Mark Backman	ec93cd1d51	Fix settings update handling in additional STT services	2026-03-06 21:52:45 -05:00
Mark Backman	750b87dc24	Fix AWS examples, update to sonnet 4.6	2026-03-06 20:53:22 -05:00
Mark Backman	671e9a6846	TTS service and example updates	2026-03-06 20:53:22 -05:00
Mark Backman	2c85d2056c	Examples fixes for Gemini Live	2026-03-06 18:42:22 -05:00
Mark Backman	6431ad8e2a	Fix service settings init ordering and example bugs - Speechmatics: move config build after super().__init__ and settings delta so turn_detection_mode (e.g. ADAPTIVE) takes effect - Google STT: fix example passing bare Language enum instead of list - Google TTS: add missing explicit defaults for all custom settings fields - Soniox: fix accidental tuple wrapping of STT service in example - Speechmatics examples: fix system->user role in kick-off messages - Deepgram Flux: move tag from settings to __init__ (billing metadata) - ElevenLabs STT: default tag_audio_events to None (use API default) - Fal STT: simplify language default handling - Google TTS: rename GoogleStreamTTSSettings to GoogleTTSSettings	2026-03-06 15:17:01 -05:00
Mark Backman	c3794956ef	Add deprecation version, fix foundational example double system message	2026-03-06 15:16:58 -05:00
kompfner	1a1c5668de	Merge pull request #3942 from pipecat-ai/pk/aws-nova-sonic-audio-config Add AudioConfig class to AWSNovaSonicLLMService for non-deprecated au…	2026-03-06 14:58:22 -05:00
Paul Kompfner	9b7a86bb12	Add AudioConfig class to AWSNovaSonicLLMService for non-deprecated audio configuration The audio fields (sample rates, sample sizes, channel counts) on the deprecated `Params` class had no non-deprecated equivalent. This adds an `AudioConfig` class and `audio_config` init arg so users can specify audio configuration without relying on the deprecated `params` parameter.	2026-03-06 14:39:53 -05:00
filipi87	c243850cf1	Removing observer from the inworld example.	2026-03-06 16:14:23 -03:00
Aleix Conchillo Flaqué	e65ceb4edc	Merge pull request #3931 from pipecat-ai/aleix/examples-always-use-user-role Update foundational examples to use system_instruction	2026-03-06 10:41:33 -08:00
Aleix Conchillo Flaqué	593b75bc8b	Update foundational examples to use "user" role Use system_instruction on LLM service constructors instead of adding system messages to LLMContext. Messages added to context now use "user" role.	2026-03-06 09:53:33 -08:00
Paul Kompfner	2b8a6d9ca4	In OpenAI/Azure Realtime examples, migrate to `settings=OpenAIRealtimeLLMSettings(...)` pattern Move `session_properties` and `system_instruction` into the `settings` arg, matching the canonical pattern used across the codebase.	2026-03-06 12:00:41 -05:00
kigland	848f35f5df	fix: replace bare except handlers with specific exception types	2026-03-06 23:05:02 +08:00
Paul Kompfner	5b270fec8e	In AWS Nova Sonic examples, migrate to newer pattern of passing in `settings` with `voice` and `system_instruction`, in favor of passing in `voice_id` as a direct init arg and the system instruction as the first message in the context	2026-03-06 09:57:57 -05:00
Paul Kompfner	78deaa735d	Move `system_instruction` into `LLMSettings` Add `system_instruction` field to `LLMSettings` so it is runtime-updatable via settings. For Google (GoogleLLMService, GoogleVertexLLMService), deprecate the init-time arg since it was already shipped. For Anthropic, AWS Bedrock, and OpenAI, remove the init-time arg entirely since it was never shipped. Still need to handle realtime services (OpenAI Realtime, Grok Realtime, Gemini Live).	2026-03-06 09:57:08 -05:00

1 2 3 4 5 ...

1762 Commits