pipecat

Author	SHA1	Message	Date
filipi87	fe8cb2f4e0	Always appending TTSTextFrame to the audio context.	2026-03-20 21:34:21 -04:00
filipi87	cdf44f7a3f	Fixing the frame ordering of the AggregatedTextFrame.	2026-03-20 21:34:21 -04:00
filipi87	d32a8a9ee2	Fixing TTS frame order.	2026-03-20 21:34:21 -04:00
joachimchauvet	ed160fd2e0	fix(livekit): suppress InvalidState log spam from audio mixer during interruptions	2026-03-20 21:34:21 -04:00
aconchillo	84eddb64d5	Update changelog for version 0.0.106	2026-03-20 21:34:21 -04:00
Aleix Conchillo Flaqué	189249caec	Add missing on_dtmf_event callback to Tavus transport The on_dtmf_event callback was added to DailyCallbacks in #4047 but the Tavus transport was not updated, causing a missing argument error.	2026-03-20 21:34:21 -04:00
Filipi da Silva Fuchter	3c90468e03	Fixed the ordering of `_maybe_pause_frame_processing` call in `TTSService` (#4071 ) * Fixing the invocation of pause_frame_processing at the correct time when receiving LLMFullResponseEndFrame and EndFrame.	2026-03-20 21:34:21 -04:00
Mark Backman	98d3f697f1	Add WakePhraseUserTurnStartStrategy (#4064 ) - Add WakePhraseUserTurnStartStrategy for gating interaction behind wake phrase detection, with timeout and single_activation modes - Add default_user_turn_start_strategies() and default_user_turn_stop_strategies() helper functions - Deprecate WakeCheckFilter in favor of the new strategy - Extend ProcessFrameResult to stop strategies for short-circuit evaluation - Fix MinWordsUserTurnStartStrategy including filtered text in output	2026-03-20 21:34:21 -04:00
Mark Backman	b9d996ff41	Improvements for Nova Sonic LLM and TTS output frames (#4042 ) * Fix empty user transcription causing spurious interruption in Nova Sonic Skip _report_user_transcription_ended() when _user_text_buffer is empty, which happens when the initial prompt is text-only. Previously, an empty TranscriptionFrame was pushed upstream, triggering a chain reaction: on_user_turn_stopped → UserStartedSpeakingFrame → interruption → premature BotStoppedSpeaking → multiple response start/stop cycles. * Improve TextFrame and assistant end of turn logic Now, SPECULATIVE text results are used to push the LLMTextFrame, AggregatedTextFrame, and TTSTextFrame. Additionally, the TTSTextFrames are push at the end of the corresponding audio segment. * Remove BotStoppedSpeakingFrame fallback from Nova Sonic Now that assistant response end is detected directly from Nova Sonic contentEnd events (END_TURN and INTERRUPTED), the BotStoppedSpeakingFrame handler is no longer needed. Inline the cleanup logic in reset_conversation.	2026-03-20 21:34:21 -04:00
Mark Backman	5de4256ab1	GradiumSTTService improvements (#4066 ) * Remove duplicate reconnection logic from Gradium STT The _receive_messages method had its own while-True reconnect loop, duplicating the reconnection handling already provided by WebsocketService._receive_task_handler (exponential backoff, max retries, error reporting). Flatten to just the inner message loop and let the base class handle reconnection. * Align Gradium STT VAD handling with base class patterns Replace the process_frame override with a _handle_vad_user_stopped_speaking override, which is the proper hook provided by STTService. Move start_processing_metrics() into run_stt (matching Gladia's pattern). Remove unused FrameDirection and VADUserStartedSpeakingFrame imports. * Add transcript aggregation delay after flushed to capture trailing tokens Gradium flushed response can arrive before all text tokens have been delivered. Instead of finalizing immediately on flushed, start a short timer (100ms) that allows trailing tokens to accumulate before pushing the final TranscriptionFrame. * Add changelog for PR #4066 * Change default encoding to pcm_16000 * Decouple encoding from sample_rate in Gradium STT The encoding parameter now takes just the base type (pcm, wav, opus) and the sample rate is derived from the pipeline audio_in_sample_rate, assembled dynamically via input_format_from_encoding(). This fixes the mismatch where SAMPLE_RATE=24000 was passed to the base class while encoding defaulted to pcm_16000.	2026-03-20 21:34:21 -04:00
Mark Backman	e2e0d9f8c4	fix: pass list-type Deepgram settings as lists instead of stringifying List-valued settings like keyterm, keywords, search, redact, and replace were being converted to strings before being passed to the SDK connect() method. The SDK expects lists so its encode_query can produce repeated query params (keyterm=a&keyterm=b).	2026-03-20 21:34:21 -04:00
Mark Backman	4c10fab0c9	Add changelog for #4046	2026-03-20 21:34:21 -04:00
Mark Backman	b610ba0aa5	Fix OpenAI STT crash when language is a plain string instead of Language enum	2026-03-20 21:34:21 -04:00
Mark Backman	d7d6ad6e96	Fix SonioxSTTService crash when language_hints contains plain strings (#4045 ) Refactor language_to_soniox_language to use resolve_language + LANGUAGE_MAP pattern consistent with other services. Fix resolve_language fallback to use str(language) instead of language.value so plain strings don't crash.	2026-03-20 21:34:21 -04:00
Mark Backman	7eedd5929d	Add changelog for #4026	2026-03-20 21:34:21 -04:00
Mark Backman	490e460c4b	Fix DeepgramSTTService base_url forcing HTTPS/WSS schemes The base_url parameter previously forced wss:// and https:// schemes, breaking air-gapped or private deployments that need ws:// or http://. Extract URL derivation into _derive_deepgram_urls() helper that respects the developers scheme choice while deriving the paired WebSocket and HTTP URLs the Deepgram SDK requires. Closes #4019	2026-03-20 21:34:21 -04:00
Mark Backman	e1ce74c7a5	Fix deprecation warning when using filter_incomplete_user_turns	2026-03-20 21:34:21 -04:00
Mark Backman	5faac08d36	docs: add changelog for #4058	2026-03-20 21:34:21 -04:00
Mark Backman	4171a75f79	fix: resolve raw language strings through Language enum for proper service conversion Raw strings like "de-DE" passed as the language parameter to TTS/STT services were bypassing the Language enum resolution logic, causing silent failures (e.g. ElevenLabs expects "de" not "de-DE"). Now raw strings are first converted to Language enums so they go through the same resolve_language() path, with a warning logged for unrecognized strings.	2026-03-20 21:34:21 -04:00
Mark Backman	fa345a510f	Add changelog for #4057	2026-03-20 21:34:21 -04:00
Mark Backman	55fb274d5a	Fix stale state in user turn stop strategies between turns Reset stop strategies at turn start (not just turn stop) so that late transcriptions arriving between turns do not leave stale _text that causes premature stops on the next turn. Also cancel pending timeout tasks in reset() for both SpeechTimeout and TurnAnalyzer strategies.	2026-03-20 21:34:21 -04:00
Mark Backman	fffb16ad39	Update uv.lock with pyasn1 v0.6.3	2026-03-20 21:34:20 -04:00
Mark Backman	9a32364b34	feat: add enable_dialout parameter to configure() for dial-out rooms Expose enable_dialout as a configure() parameter (default False) so dial-out examples can opt in without needing to build DailyRoomProperties manually.	2026-03-20 21:34:20 -04:00
Mark Backman	732afde3ea	fix: clean up configure() type hints, deduplicate token expiry, and improve comment Narrow misleading Optional type hints on parameters that never accept None, extract the duplicated token_exp_duration * 60 * 60 calculation, remove unnecessary forward-reference quotes on DailyMeetingTokenProperties, and clarify why enable_dialout is explicitly set to False.	2026-03-20 21:34:20 -04:00
copilot-swe-agent[bot]	e5215a636f	fix: set enable_dialout to False in PSTN runner to prevent room creation failures Co-authored-by: jamsea <614910+jamsea@users.noreply.github.com>	2026-03-20 21:34:20 -04:00
copilot-swe-agent[bot]	c0bc94a9ce	Initial plan	2026-03-20 21:34:20 -04:00
Julien Vantyghem	d26f512ba3	update docstring following https://github.com/pipecat-ai/pipecat/pull/3916	2026-03-20 21:34:20 -04:00
Blaine Kasten	fe84a881dd	turn off server vad	2026-03-20 11:17:38 -05:00
Blaine Kasten	591c02fb0e	a few updates	2026-03-19 13:37:21 -05:00
Blaine Kasten	077610184d	Add together STT and TTS services	2026-03-17 07:24:02 -05:00
Mark Backman	a0595adbdc	Merge pull request #4012 from pipecat-ai/mb/deprecate-old-local-smart-turn	2026-03-16 21:09:26 -04:00
Mark Backman	dc1632bbac	Merge pull request #4023 from pipecat-ai/mb/update-small-webrtc-prebuilt-2.4.0	2026-03-16 21:09:08 -04:00
Mark Backman	53f49ac094	Merge pull request #4024 from pipecat-ai/mb/fix-lang-enum-stt-tts	2026-03-16 21:08:48 -04:00
Mark Backman	bf02d61418	Merge pull request #4025 from pipecat-ai/mb/fix-example-system-instruction	2026-03-16 21:07:01 -04:00
Mark Backman	154a8d1987	Merge pull request #4035 from pipecat-ai/mb/bump-pyjwt-version	2026-03-16 21:06:31 -04:00
Mark Backman	fa5b757408	Merge pull request #4044 from pipecat-ai/mb/pyopenssl-upgrade	2026-03-16 21:06:09 -04:00
Aleix Conchillo Flaqué	c765bc98d3	Merge pull request #4047 from pipecat-ai/aleix/daily-python-0.25.0-dtmf-events Update daily-python to 0.25.0 and add DTMF input events	2026-03-16 18:05:10 -07:00
Aleix Conchillo Flaqué	59486d5abf	Add changelog entries for PR #4047	2026-03-16 17:58:12 -07:00
Aleix Conchillo Flaqué	5cb6aecc9f	Add DTMF input event support to Daily transport Handle Daily's on_dtmf_event callback, convert it to an InputDTMFFrame pushed into the input transport. Also add __str__ methods to InputDTMFFrame and OutputDTMFFrame for better logging.	2026-03-16 17:57:39 -07:00
Aleix Conchillo Flaqué	5c685c35d7	pyproject: update daily-python to 0.25.0	2026-03-16 17:41:44 -07:00
Aleix Conchillo Flaqué	1a1d5e6a84	Merge pull request #4006 from pipecat-ai/aleix/task-frame-flush-ordering handle EndTaskFrame, StopTaskFrame and CancelTaskFrame downstream	2026-03-16 17:35:11 -07:00
Mark Backman	538b9fa2d9	Bump pyopenssl in uv.lock to 26.0.0	2026-03-16 17:58:44 -04:00
Mark Backman	b437cbe126	Merge pull request #4037 from omChauhanDev/fix/llm-switcher-timeout-secs forward timeout_secs in LLMSwitcher register methods	2026-03-15 10:08:11 -04:00
Om Chauhan	ed0f5ab09b	added changelog	2026-03-15 19:15:18 +05:30
Om Chauhan	a6ad8a355b	forward timeout_secs in LLMSwitcher register methods	2026-03-15 19:10:32 +05:30
Mark Backman	e8415b7451	Add changelog for #4035	2026-03-15 08:56:54 -04:00
Mark Backman	24c3d23229	Bump PyJWT minimum version to 2.12.0 for CVE-2026-32597 Addresses Dependabot alert #165 (GHSA-752w-5fwx-jx9f) where PyJWT <= 2.11.0 accepts unknown `crit` header extensions.	2026-03-15 08:53:06 -04:00
Mark Backman	978a1a2083	Update the system_instruction wording in the foundational examples to not mention WebRTC call	2026-03-13 12:22:10 -04:00
Mark Backman	0ec5f5e5ac	Add missing language deprecations for XTTSService, LmntTTSService	2026-03-13 11:33:59 -04:00
Mark Backman	1ea23ad362	Add changelog for #4024	2026-03-13 10:58:51 -04:00

1 2 3 4 5 ...

8383 Commits