pipecat

Author	SHA1	Message	Date
Mark Backman	63254fe337	Add NebiusLLMService with developer role and tool support fixes - Add Nebius LLM service wrapping OpenAI-compatible Token Factory API - Set supports_developer_role = False (Nebius rejects developer role) - Default to openai/gpt-oss-120b model (supports function calling) - Add Nebius function-calling example and env.example entry - Fix Sarvam developer role support - Update examples to use developer role for intro messages	2026-03-29 08:50:11 -04:00
Arindam200	39919f7889	Add NebiusLLMService for Nebius Token Factory Adds an OpenAI-compatible LLM service for Nebius Token Factory, supporting open-source models (Meta Llama, Qwen, DeepSeek) via their OpenAI-compatible REST API at https://api.tokenfactory.nebius.com/v1/.	2026-03-29 14:35:46 +05:30
Aleix Conchillo Flaqué	a84c69858e	Merge pull request #4185 from pipecat-ai/changelog-0.0.108 Release 0.0.108 - Changelog Update v0.0.108	2026-03-27 21:47:53 -07:00
aconchillo	ca224219dc	Update changelog for version 0.0.108	2026-03-27 21:43:37 -07:00
Aleix Conchillo Flaqué	83dc979d19	Merge pull request #4186 from pipecat-ai/mb/fix-websocket-disconnect-race-condition Fix FastAPI WebSocket disconnect race condition	2026-03-27 21:40:21 -07:00
Aleix Conchillo Flaqué	fc76b3f2fb	update pyproject.toml and uv.lock	2026-03-27 21:36:03 -07:00
Mark Backman	4670370dbb	Add changelog for #4186	2026-03-28 00:02:44 -04:00
Mark Backman	47e53890e3	Fix FastAPI WebSocket disconnect race condition causing pipeline hang When the remote side disconnects while send() is in flight, send() was setting _closing=True. This prevented the receive loop from firing on_client_disconnected, causing the pipeline to hang waiting for a disconnect signal that never came. The fix removes _closing from send() (that flag means we initiated the close) and instead checks Starlette application_state in _can_send() to suppress subsequent sends after a failure. Fixes #3912	2026-03-28 00:01:25 -04:00
Aleix Conchillo Flaqué	195180b6f4	Merge pull request #4184 from pipecat-ai/aleix/fix-sarvam-examples-role Fix Sarvam examples to use 'user' role instead of 'developer'	2026-03-27 20:34:59 -07:00
Aleix Conchillo Flaqué	8b64166bb7	Fix Sarvam examples to use 'user' role instead of 'developer' Sarvam uses the OpenAI-compatible API but does not support the 'developer' role, causing errors. Use 'user' role instead.	2026-03-27 20:33:25 -07:00
Aleix Conchillo Flaqué	1d18995435	Merge pull request #4183 from pipecat-ai/aleix/fix-task-scheduling Yield after create_task to ensure timer tasks are scheduled	2026-03-27 20:32:32 -07:00
Aleix Conchillo Flaqué	ea7324b2ba	Add changelog for #4183	2026-03-27 19:03:55 -07:00
Aleix Conchillo Flaqué	52ed7137af	Yield after create_task to ensure timer tasks are scheduled Add `await asyncio.sleep(0)` after `create_task()` calls in UserIdleController, SpeechTimeoutUserTurnStopStrategy, TurnAnalyzerUserTurnStopStrategy, and UserTurnCompletionLLMServiceMixin so the event loop schedules the newly created timer tasks before the caller continues.	2026-03-27 19:03:23 -07:00
kompfner	b33df03724	Merge pull request #4179 from pipecat-ai/pk/fix-gemini-live-vertex Don't send history_config for Gemini Live Vertex (unsupported)	2026-03-27 17:34:29 -04:00
Paul Kompfner	28fbe1db08	Don't send history_config for Gemini Live Vertex (unsupported)	2026-03-27 17:30:47 -04:00
kompfner	9240e92d9f	Merge pull request #4177 from pipecat-ai/pk/tweak-26i-for-gemini-3.1-flash-live-support Tweak 26i example system instruction for Gemini 3.1 Flash Live compat…	2026-03-27 17:20:06 -04:00
Paul Kompfner	5caf53f086	Tweak 26i example system instruction for Gemini 3.1 Flash Live compatibility Gemini 3.1 Flash Live won't reliably report ending its turn until after it says something following a tool call. Restructure the system instruction so the model says goodbye after calling end_conversation, and add a comment explaining the deferred EndFrame behavior that makes this work.	2026-03-27 17:13:17 -04:00
Mark Backman	ac2716811c	Merge pull request #4176 from pipecat-ai/mb/fix-websocket-rtvi-messages Fix RTVI events not delivered over WebSocket transports	2026-03-27 16:50:37 -04:00
Mark Backman	d313d56776	Fix RTVI events not delivered over WebSocket transports The base serializer filters out RTVI protocol messages by default (ignore_rtvi_messages=True) to prevent them from being sent over telephony media streams. ProtobufFrameSerializer is used by WebSocket transports, which are the delivery channel for these messages, so disable the filter there.	2026-03-27 16:47:11 -04:00
kompfner	159776f106	Merge pull request #4175 from pipecat-ai/pk/gemini-live-dropped-support-for-text-modality Warn when TEXT modality is set for Gemini Live, and remove 26d text example	2026-03-27 16:26:36 -04:00
kompfner	a23803478f	Merge pull request #4171 from pipecat-ai/pk/fix-gemini-3.1-flash-live-video Gate Gemini Live sending real-time input messages to the API until it…	2026-03-27 16:26:03 -04:00
Mark Backman	bae193ab4d	Merge pull request #4172 from pipecat-ai/mb/rime-tts-fixes Fix Rime TTS stop-frame handling and handle done message	2026-03-27 16:22:25 -04:00
Paul Kompfner	04adb697be	Warn when TEXT modality is set for Gemini Live, and remove 26d text example All recent Gemini Live models (including the default gemini-2.5-flash-native-audio-preview-12-2025, and going at least as far back as gemini-2.5-flash-native-audio-preview-09-2025) only support AUDIO as a response modality. We considered using `modalities=TEXT` as a Pipecat-level signal to suppress audio output frames (so developers could pair Gemini Live with an external TTS), but the output transcription from the API arrives too late relative to the audio to be useful for driving an external TTS service. For now, just log a warning when a TEXT modality is configured (at init or via set_model_modalities) and proceed as normal. The 26d text-modality example is removed since it no longer represents a viable configuration.	2026-03-27 16:21:15 -04:00
Mark Backman	4f9c8a6860	Merge pull request #4174 from pipecat-ai/fix/deepgram-sdk-6.1.0-compat Fix Deepgram STT compatibility with deepgram-sdk 6.1.0	2026-03-27 15:11:43 -04:00
Mark Backman	a1a29b3933	Add changelog for #4174	2026-03-27 14:50:12 -04:00
Mark Backman	0798803c70	Bump deepgram-sdk minimum version to 6.1.0	2026-03-27 14:46:17 -04:00
Mark Backman	6422661d08	Fix Deepgram STT compatibility with deepgram-sdk 6.1.0 The SDK now requires explicit message objects for send_keep_alive, send_close_stream, and send_finalize instead of no-arg calls.	2026-03-27 14:40:48 -04:00
Mark Backman	ed94b65d83	Merge pull request #4173 from pipecat-ai/filipi/updating_inworld_examples Removing the models from the Inworld example so we can use the default model.	2026-03-27 14:02:55 -04:00
filipi87	f9670b9601	Removing the models from the Inworld example so we can use the default model.	2026-03-27 14:23:20 -03:00
Paul Kompfner	5b2991f47f	Gate Gemini Live sending real-time input messages to the API until it's ready, i.e. after we've sent the initial conversation history (or determined that we don't need to). This fixes the 26c example when using Gemini 3.1 Flash Live, which seems to be more strict about not receiving real-time input (at least, video messages) before conversation history.	2026-03-27 12:41:05 -04:00
Mark Backman	fc3186dc0d	Add changelog entries for PR #4172	2026-03-27 12:38:53 -04:00
Mark Backman	1808b447c9	Handle done message from Rime TTS to avoid stop-frame timeout Rime's WebSocket API sends a done message when synthesis completes. Handle it to stop TTFB metrics, push TTSStoppedFrame, and remove the audio context immediately instead of relying on the 3-second stop_frame_timeout_s fallback.	2026-03-27 12:37:03 -04:00
Mark Backman	70df9d3fe4	Fix duplicate TTSStoppedFrame in TTS service timeout path	2026-03-27 12:07:37 -04:00
Filipi da Silva Fuchter	a8bfc23d3a	Merge pull request #4167 from pipecat-ai/filipi/inworld_improvements InworldTTSService improvements.	2026-03-27 11:15:14 -04:00
filipi87	e2870fc2ac	Changing to debug the log when we are not able to append audio to the context.	2026-03-27 12:12:16 -03:00
filipi87	e851f8c1d5	Adding changelog entry for the fix.	2026-03-27 12:11:35 -03:00
filipi87	b31bece617	Not trying to recreate the context.	2026-03-27 12:06:21 -03:00
kompfner	9e350bcc2f	Merge pull request #4147 from pipecat-ai/cb/gemini-transcript-fixes Fix Gemini Live to handle bundled server_content fields	2026-03-27 11:00:10 -04:00
Paul Kompfner	9c2594c484	Remove brittle test	2026-03-27 10:56:39 -04:00
Mark Backman	900fc88430	Merge pull request #4128 from pipecat-ai/mb/end-of-turn-assembly	2026-03-27 10:47:09 -04:00
filipi87	4ef5ac6f0c	InworldTTSService improvements.	2026-03-27 11:33:32 -03:00
Mark Backman	cbb3d99493	Merge pull request #4166 from pipecat-ai/mb/fix-example-ordering-56 Fix example numbering, add LemonSlice to evals	2026-03-27 10:29:07 -04:00
Filipi da Silva Fuchter	fb1996cedc	Merge pull request #4143 from pipecat-ai/cb/sagemaker-flux Add Deepgram Flux STT service for AWS SageMaker	2026-03-27 10:27:49 -04:00
Filipi da Silva Fuchter	95c55ec6c3	Merge pull request #4145 from pipecat-ai/filipi/tts_improvements_remove_reset TTS improvements.	2026-03-27 10:24:59 -04:00
Mark Backman	a45de9af7f	Merge pull request #4161 from tanmayc25/fix/lemonslice-missing-dtmf-callback fix(lemonslice): add missing on_dtmf_event callback in DailyCallbacks construction	2026-03-27 10:19:54 -04:00
Mark Backman	5e61a57582	Fix changelog entry for #4161	2026-03-27 10:16:25 -04:00
Mark Backman	d8b0ed18fd	Fix example numbering, add LemonSlice to evals	2026-03-27 10:11:37 -04:00
Mark Backman	789275a57b	Merge pull request #4164 from pipecat-ai/mb/update-community-integrations-guide docs: update COMMUNITY_INTEGRATIONS.md for accuracy	2026-03-27 09:38:31 -04:00
Filipi da Silva Fuchter	38c961a363	Merge pull request #4113 from inworld-ai/ian/lang-timestamps fix(inworld): fallback to full text when TTS timestamps are not received	2026-03-27 09:34:05 -04:00
Mark Backman	41a86a51bf	docs: update COMMUNITY_INTEGRATIONS.md for accuracy - Replace deprecated TTS classes (AudioContextWordTTSService, WordTTSService) with current hierarchy (WebsocketTTSService, InterruptibleTTSService, TTSService) - Add WebsocketSTTService and SDK-based STTService categories - Fix LLM section: document _process_context, adapter_class, remove deprecated create_context_aggregator guidance, add thought frames for reasoning models - Fix Vision section: run_vision takes UserImageRawFrame not LLMContext, yields Vision*Frame types not TextFrame - Fix push_error API: takes (error_msg, exception) not ErrorFrame - Fix frame name: TTSRawAudioFrame → TTSAudioRawFrame - Remove stale v13+ version reference - Clarify @traced_stt method convention	2026-03-27 09:22:32 -04:00

1 2 3 4 5 ...

8695 Commits