pipecat

Author	SHA1	Message	Date
Cale Shapera	ec574edd53	Add Inworld Realtime Service (#4140 ) * Add Inworld Realtime LLM service Adds a WebSocket-based realtime service for Inworld's cascade STT/LLM/TTS API with semantic VAD, function calling, and streaming transcription support. New files: - src/pipecat/services/inworld/realtime/ (service, events) - src/pipecat/adapters/services/inworld_realtime_adapter.py - examples/foundational/19zb-inworld-realtime.py Also includes: - websockets dependency for inworld extra in pyproject.toml - Adapter and settings tests matching OpenAI/Grok realtime patterns - Fix for double-response when server-side VAD is enabled * Prefer init-provided system instruction in Inworld Realtime Adopt _resolve_system_instruction() from BaseLLMAdapter, matching the pattern applied to OpenAI Realtime, Grok Realtime, Gemini Live, and Nova Sonic in the pk/realtime-services-init-v-context-system-instructions-cleanup branch. * Update changelog entry with PR number * Fix changelog format to use bullet point * Polish PR: default model, example cleanup, changelog update - Change default model from gpt-4.1-nano to gpt-4.1-mini - Add function calling demo to example - Remove demo-testing artifact from system instruction - Mention Router support in changelog * Address PR review feedback for Inworld Realtime - Move example to examples/realtime/realtime-inworld.py - Change initial context role from "user" to "developer" - Remove explicit sample rates from example; sync them in _ensure_audio_config so Inworld gets the transport's actual rates - Add audio race condition guard in _handle_evt_audio_delta (matches OpenAI realtime pattern) - Convert remaining "system"/"developer" messages to "user" in adapter - Add clarifying comment for local-VAD vs server-VAD metrics paths * Simplify example, add provider tracking, remove local VAD path - Remove function calling from example, switch model to xai/grok-4-1-fast-non-reasoning - Add pipecat-realtime session key prefix and provider_data metadata for Inworld traffic attribution - Remove local VAD code path (Inworld only supports server-side VAD) - Use typed InputAudioBufferAppendEvent for audio sends * Default TTS model to inworld-tts-1.5-max * Remove dead shimmed tools code, set STT/VAD defaults - Remove non-functional AdapterType.SHIM custom tools code from adapter - Default STT model to assemblyai/u3-rt-pro - Default VAD eagerness to low	2026-04-09 13:04:17 -04:00
filipi87	edc197d050	Creating a new example for async stream using Google.	2026-04-09 09:50:00 -03:00
filipi87	7ece8e3c4a	Creating a new example for async stream using Anthropic.	2026-04-09 09:41:07 -03:00
filipi87	a544f885a3	Added new examples: function-calling-openai-async-stream.py and function-calling-openai-responses-async-stream.py	2026-04-09 09:04:06 -03:00
Mark Backman	0acfb4dd49	Merge pull request #4251 from pipecat-ai/mb/mistral-tts Add Mistral Voxtral streaming TTS service	2026-04-07 12:50:48 -04:00
Mark Backman	aa7a014518	Add mistral voice example	2026-04-07 12:32:06 -04:00
Filipi da Silva Fuchter	6eccd16543	Merge pull request #4217 from pipecat-ai/filipi/async_tools Supporting async function calls.	2026-04-07 09:35:03 -03:00
filipi87	d8dc6bc7d0	New example for async function calls using Google.	2026-04-07 09:31:22 -03:00
filipi87	d12a8529e2	New example for async function calls using OpenAI responses.	2026-04-07 09:28:01 -03:00
filipi87	aa061f7e2c	Renaming the openai and anthropic examples to async instead of delayed.	2026-04-07 09:23:45 -03:00
Filipi da Silva Fuchter	e863293198	Improving docstring description. Co-authored-by: kompfner <paul@daily.co>	2026-04-07 08:14:39 -04:00
Filipi da Silva Fuchter	a451c42dc7	Merge pull request #4247 from pipecat-ai/filipi/background_sound_example Fixing the background sound example.	2026-04-07 09:06:14 -03:00
filipi87	ceaa27ee6e	Fixing the background sound example.	2026-04-06 18:25:30 -03:00
Mark Backman	916af84974	Remove DeprecatedModuleProxy and service re-export shims Remove the deprecation proxy infrastructure that allowed old-style flat imports (e.g. `from pipecat.services.openai import OpenAILLMService`). Users must now import from specific submodules (`from pipecat.services.openai.llm import OpenAILLMService`), which is already the established pattern across all internal code and 179+ examples. - Strip 32 proxy `__init__.py` files to empty - Strip 3 non-proxy files with bare star imports (minimax, sambanova, sarvam) - Strip google/gemini_live `__init__.py` re-exports - Remove DeprecatedModuleProxy class and helpers from services/__init__.py - Remove ruff per-file ignore for services/__init__.py - Fix 2 examples using old-style imports	2026-04-03 13:43:02 -04:00
Mark Backman	c2358b273b	Use Parameters instead of Attributes in docstrings to fix duplicate object warnings Napoleon's Attributes section creates class-level attribute docs that duplicate the __init__ parameter docs when napoleon_include_init_with_doc is enabled. Using Parameters avoids the duplication.	2026-04-03 10:36:36 -04:00
Mark Backman	8adb38f87c	Remove unused imports across codebase	2026-04-02 22:21:16 -04:00
vipyne	1d7404ef21	Update MCP examples	2026-04-02 18:15:56 -05:00
Om Chauhan	e22f9f84bb	fixed MCPClient to reuse session across tool calls	2026-04-02 18:06:28 -05:00
filipi87	7af72eee3e	Creating new delayed examples for openai and anthropic.	2026-04-02 18:40:41 -03:00
filipi87	3724ecd378	Supporting async function calls.	2026-04-02 16:58:19 -03:00
Mark Backman	0c59819682	Remove allow_interruptions from voice-sarvam example This was missed from the allow_interruptions removal commit.	2026-04-02 11:32:44 -04:00
Mark Backman	e74930b954	Remove deprecated text_aggregator and text_filter params from TTS Remove the deprecated text_aggregator parameter from TTSService, CartesiaTTSService, and RimeTTSService, and the deprecated text_filter parameter from TTSService. Users should use LLMTextProcessor before the TTS service instead. Update the voice-switching example to use LLMTextProcessor with PatternPairAggregator.	2026-04-01 17:03:05 -04:00
Harshita Jain	bd6cbd7fe7	feat: add Smallest AI STT service integration (#4162 ) Add SmallestSTTService using the Pulse WebSocket API for real-time transcription. Includes SmallestSTTSettings dataclass, 32-language support with resolve_language fallback, VAD-driven finalize signal, and SMALLEST_TTFS_P99 latency constant. Also adds X-Source and X-Pipecat-Version headers to Smallest STT and TTS WebSocket connections.	2026-04-01 13:44:04 -04:00
Mark Backman	3ca656cae5	Update simli name to match others	2026-03-31 22:54:21 -04:00
Mark Backman	d3021b4590	Rename example files to prepend parent folder name, preventing package shadowing Example files like openai.py shadow installed packages when Python adds the script directory to sys.path. Prepend the parent folder name to each example file (e.g. openai.py -> function-calling-openai.py). Also split thinking-and-mcp/ into separate mcp/ and thinking/ directories.	2026-03-31 22:06:01 -04:00
Mark Backman	7501effad5	Remove deprecated service module shims and old implementations Delete deprecated import shims that only re-export from new locations: - services/ai_services.py - services/gemini_multimodal_live/ - services/aws_nova_sonic/ - services/openai_realtime/ - services/deepgram/{stt,tts}_sagemaker.py - services/google/{llm_openai,llm_vertex,google}.py - services/google/gemini_live/llm_vertex.py - services/riva/ - services/nim/ Remove deprecated implementations replaced by newer services: - services/openai_realtime_beta/ (use openai.realtime) - services/google/openai/ (use google.llm) Also removes associated examples and tests for deleted services.	2026-03-31 15:34:14 -04:00
Mark Backman	27cb078716	Add missing google-vertex.py file	2026-03-31 15:25:52 -04:00
Mark Backman	47b41a0ff7	Rename services/ to voice/ and function-calling/, flatten to top level Replace the nested services/speech/ and services/function-calling/ with top-level voice/ and function-calling/ directories. Update eval script paths and README to match.	2026-03-31 15:20:03 -04:00
Mark Backman	f14638a1fd	Revert "Flatten services/ nesting: promote speech and function-calling to top level" This reverts commit `e1939ecd44`.	2026-03-31 14:59:23 -04:00
Mark Backman	e1939ecd44	Flatten services/ nesting: promote speech and function-calling to top level Move services/speech/* directly into services/ and services/function-calling/* into top-level function-calling/. Update eval script paths and README.	2026-03-31 14:55:22 -04:00
Mark Backman	1d85aedcae	Split features/ into audio/, observability/, and rag/ subfolders Extract focused example groups from the catch-all features/ folder: - audio/: audio recording, background sound, sound effects - observability/: observer, heartbeats, sentry metrics - rag/: mem0, gemini-rag, gemini grounding metadata Update README to document the new folders.	2026-03-31 13:15:06 -04:00
Mark Backman	e719cbbe6d	Reorganize examples into topic-based subfolders Move 304 examples from a flat numbered directory into 14 descriptive subfolders: getting-started, services (speech + function-calling), transcription, vision, realtime, persistent-context, context-summarization, update-settings (stt/tts/llm), turn-management, thinking-and-mcp, transports, video-avatar, video-processing, and features. Strip numbered prefixes from filenames (e.g. 07c-interruptible-deepgram.py becomes services/speech/deepgram.py) since the folder context makes them redundant. Keep numbered prefixes only in getting-started/ where ordering matters. Update eval script paths and README to match the new structure.	2026-03-31 13:12:24 -04:00
Mark Backman	f2ce7ececc	Move foundational examples to examples/	2026-03-31 13:12:24 -04:00
kompfner	bd7496fa27	Merge pull request #4211 from pipecat-ai/pk/openai-responses-websocket-service-refactor Introduce WebsocketLLMService and refactor OpenAIResponsesLLMService …	2026-03-31 13:02:45 -04:00
Paul Kompfner	0a8bcf58c4	Register on_connection_error event handler in WebsocketLLMService	2026-03-31 10:52:33 -04:00
Paul Kompfner	30903042e5	Work around OpenAI Python SDK temperature bug in example	2026-03-31 10:16:30 -04:00
Mark Backman	32022a952e	Merge pull request #4205 from pipecat-ai/mb/remove-quickstart Remove quickstart example from repo	2026-03-30 18:58:49 -04:00
Mark Backman	b78ae40d3c	Remove quickstart example from repo	2026-03-30 18:20:41 -04:00
Aleix Conchillo Flaqué	dd1bea2a5f	audio(turn): remove FalSmartTurnAnalyzer and LocalSmartTurnAnalyzer	2026-03-30 14:04:29 -07:00
Aleix Conchillo Flaqué	f0d04dde1c	audio(filters): remove KrispFilter	2026-03-30 14:01:06 -07:00
Paul Kompfner	1c8d31de70	Add trace logging for previous_response_id decisions and fix example Add detailed trace-level logging to _apply_previous_response_optimization showing why the optimization was applied or fell back to full context, including the relevant data for debugging. Use append_to_context=False for the filler TTSSpeakFrame in the function-calling example to avoid altering the conversation history and breaking the previous_response_id prefix match.	2026-03-30 09:59:03 -04:00
Paul Kompfner	f2a8a9e753	Add WebSocket-based OpenAI Responses LLM service with previous_response_id optimization Introduce a WebSocket variant of the OpenAI Responses API service that maintains a persistent connection to wss://api.openai.com/v1/responses for lower-latency inference. The WebSocket variant automatically uses previous_response_id to send only incremental context when possible, falling back to full context on reconnection or cache miss. The WebSocket variant becomes the new default OpenAIResponsesLLMService, and the HTTP variant is renamed to OpenAIResponsesHttpLLMService. Both share a private base class with common settings, parameter building, and run_inference (always HTTP) logic.	2026-03-30 09:58:56 -04:00
Mark Backman	8c9e189394	Fix langchain imports for langchain 1.x compatibility ChatPromptTemplate moved from langchain.prompts to langchain_core.prompts in langchain 1.x.	2026-03-29 10:27:48 -04:00
Mark Backman	2177e28ee1	Remove OpenPipe integration OpenPipe was acquired by CoreWeave in September 2025. The Python package hasn't been updated since June 2025 and the repo since 2024. The openpipe package caps openai<=1.97.1, creating dependency conflicts with other extras. Remove the dead integration to clean up the codebase.	2026-03-29 10:12:35 -04:00
Mark Backman	63254fe337	Add NebiusLLMService with developer role and tool support fixes - Add Nebius LLM service wrapping OpenAI-compatible Token Factory API - Set supports_developer_role = False (Nebius rejects developer role) - Default to openai/gpt-oss-120b model (supports function calling) - Add Nebius function-calling example and env.example entry - Fix Sarvam developer role support - Update examples to use developer role for intro messages	2026-03-29 08:50:11 -04:00
Aleix Conchillo Flaqué	8b64166bb7	Fix Sarvam examples to use 'user' role instead of 'developer' Sarvam uses the OpenAI-compatible API but does not support the 'developer' role, causing errors. Use 'user' role instead.	2026-03-27 20:33:25 -07:00
Paul Kompfner	5caf53f086	Tweak 26i example system instruction for Gemini 3.1 Flash Live compatibility Gemini 3.1 Flash Live won't reliably report ending its turn until after it says something following a tool call. Restructure the system instruction so the model says goodbye after calling end_conversation, and add a comment explaining the deferred EndFrame behavior that makes this work.	2026-03-27 17:13:17 -04:00
Paul Kompfner	04adb697be	Warn when TEXT modality is set for Gemini Live, and remove 26d text example All recent Gemini Live models (including the default gemini-2.5-flash-native-audio-preview-12-2025, and going at least as far back as gemini-2.5-flash-native-audio-preview-09-2025) only support AUDIO as a response modality. We considered using `modalities=TEXT` as a Pipecat-level signal to suppress audio output frames (so developers could pair Gemini Live with an external TTS), but the output transcription from the API arrives too late relative to the audio to be useful for driving an external TTS service. For now, just log a warning when a TEXT modality is configured (at init or via set_model_modalities) and proceed as normal. The 26d text-modality example is removed since it no longer represents a viable configuration.	2026-03-27 16:21:15 -04:00
filipi87	f9670b9601	Removing the models from the Inworld example so we can use the default model.	2026-03-27 14:23:20 -03:00
Mark Backman	cbb3d99493	Merge pull request #4166 from pipecat-ai/mb/fix-example-ordering-56 Fix example numbering, add LemonSlice to evals	2026-03-27 10:29:07 -04:00

1 2 3 4 5 ...

1857 Commits