pipecat

Author	SHA1	Message	Date
filipi87	deefc32faf	fix: hold skipped TTS frames in position until preceding spoken frames complete Skipped frames (e.g. code blocks filtered via skip_aggregator_types) were emitted to the assistant context immediately instead of waiting for preceding spoken frames to finish. Introduces AggregatedFrameSequencer to hold each frame's slot and flush only after all earlier spoken sentences are complete, keeping context ordering correct.	2026-05-20 10:03:03 -03:00
Mark Backman	82cd931efa	Merge pull request #4306 from YFortin/fix/azure-tts-last-word-race fix(azure-tts): Route completion through word boundary queue to prevent last word from being missed	2026-05-19 22:27:50 -04:00
Mark Backman	c09f6d5adb	Merge pull request #4052 from Vonage/vonage_video_connector_transport Vonage WebRTC Transport Integration	2026-05-19 10:56:20 -04:00
asilvestre	e2d249e5d9	adding uv.lock	2026-05-19 16:33:38 +02:00
asilvestre	956b39b0dc	remove extraenous await in cleanup	2026-05-19 16:33:04 +02:00
asilvestre	bc769eaa82	Changing the example to use OpenAI	2026-05-18 14:40:56 +02:00
asilvestre	ee5aa4dc71	SubscribeSettings to be pydantic and comment fixes	2026-05-18 14:40:56 +02:00
asilvestre	dd38fbc735	add documentation entry	2026-05-18 14:40:56 +02:00
asilvestre	a1c40df471	add documentation entry	2026-05-18 14:40:56 +02:00
asilvestre	c4ff9300c9	fix linting and typechecking	2026-05-18 14:40:56 +02:00
asilvestre	cab4585cbb	added changelog	2026-05-18 14:40:56 +02:00
Antoni Silvestre	18368d047e	Linting and changes to adapt to v1.0	2026-05-18 14:40:56 +02:00
asilvestre	e3abb4b6d7	apply suggestions in PR	2026-05-18 14:40:56 +02:00
Antoni Silvestre	0fd971d59d	Update src/pipecat/runner/types.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-05-18 14:40:56 +02:00
asilvestre	c61672194d	Vonage Video Connector Transport	2026-05-18 14:40:49 +02:00
Filipi da Silva Fuchter	c51a817efa	Merge pull request #4442 from pipecat-ai/filipi/runner_all_transports Unified start route to make all transports available	2026-05-18 09:27:44 -03:00
Bismeet singh	d85eda6da8	Merge pull request #4507 from BismeetSingh/fix/elevenlabs-stt-service-crash-language Fix/elevenlabs stt service crash language	2026-05-17 10:17:07 -04:00
Aleix Conchillo Flaqué	71feb42711	Merge pull request #4503 from pipecat-ai/changelog-1.2.1 Release 1.2.1 - Changelog Update v1.2.1	2026-05-15 15:19:55 -07:00
aconchillo	6b93ca0cb6	Update changelog for version 1.2.1	2026-05-15 22:18:46 +00:00
Aleix Conchillo Flaqué	b6ecce754b	Merge pull request #4501 from pipecat-ai/aleix/fix-filter-incomplete-tool-calls Fix filter-incomplete + function-calling deadlock	2026-05-15 15:11:45 -07:00
Aleix Conchillo Flaqué	d39e6bf921	Add changelog for #4501	2026-05-15 14:54:51 -07:00
Aleix Conchillo Flaqué	63064860ef	Move OpenAITTSService instructions into Settings in the example Mirrors the deprecation in ``OpenAITTSService.__init__``: ``instructions`` is now a Settings field. The constructor still accepts it for backward compatibility but the canonical path is through ``Settings``.	2026-05-15 14:54:51 -07:00
Aleix Conchillo Flaqué	f5158d51e7	Add filter-incomplete + function-calling turn-management example A copy of ``turn-management-filter-incomplete-turns.py`` extended with a ``get_weather(location)`` direct function. Exercises the path where the LLM responds to a complete user turn by calling a tool — used to reproduce (and now verify the fix for) the ``_user_speaking`` gating bug between filter-incomplete and function calls.	2026-05-15 14:54:51 -07:00
Aleix Conchillo Flaqué	94dbd2fa68	Broadcast UserTurnInferenceCompletedFrame on tool calls in filter-incomplete With ``filter_incomplete_user_turns`` enabled, an LLM that responded to a user turn by calling a tool (without first emitting a ✓ marker) never finalized the user turn. ``UserStoppedSpeakingFrame`` stayed deferred, the assistant aggregator kept ``_user_speaking=True``, and when ``FunctionCallResultFrame`` arrived its ``not self._user_speaking`` gate dropped the context push — the LLM continuation never ran and the call hung silently. Broadcast ``UserTurnInferenceCompletedFrame`` on ``FunctionCallsStartedFrame`` (i.e. the moment the LLM commits to a tool call, before the function dispatches), gated by a new ``_turn_completion_broadcasted`` flag so the ✓ path and the tool-call path don't both fire. The flag resets in ``_turn_reset`` alongside the other per-turn state. Emitting on the start frame rather than ``LLMFullResponseEndFrame`` also shrinks the race window — ``UserStoppedSpeakingFrame`` (a ``SystemFrame``) has the maximum possible head start over the ``FunctionCallResultFrame`` (``DataFrame``) that follows.	2026-05-15 14:50:35 -07:00
Mark Backman	c6ea6c6522	Merge pull request #4500 from pipecat-ai/mb/update-gradium-endpoints Update Gradium STT/TTS endpoints to region-neutral URLs	2026-05-15 15:59:14 -04:00
Mark Backman	58a22aeeb1	Add changelog for #4500	2026-05-15 15:19:39 -04:00
Mark Backman	5403aa56e4	Remove Gradium endpoint overrides from voice example Drop the explicit US-region URLs so the example picks up the new region-neutral defaults in GradiumSTTService and GradiumTTSService.	2026-05-15 15:17:12 -04:00
Mark Backman	0e0d76d020	Update Gradium endpoints to region-neutral URLs Drop the EU-region default from the STT/TTS WebSocket URLs in favor of the generic api.gradium.ai endpoint, and remove the explicit overrides from the examples so they pick up the new defaults.	2026-05-15 15:02:05 -04:00
filipi87	b493ed8d3a	Removing the websocket transport from elevenlabs example.	2026-05-15 10:11:38 -03:00
filipi87	c3338667b1	Mounting the prebuilt frontend UI and root redirect for all transports.	2026-05-15 10:06:47 -03:00
Aleix Conchillo Flaqué	ea296babe9	Merge pull request #4498 from pipecat-ai/changelog-1.2.0 Release 1.2.0 - Changelog Update v1.2.0	2026-05-14 14:47:47 -07:00
aconchillo	b13af2b053	Update changelog for version 1.2.0	2026-05-14 21:45:36 +00:00
Aleix Conchillo Flaqué	7b6d878f07	update uv.lock	2026-05-14 14:41:38 -07:00
Aleix Conchillo Flaqué	8e405f15aa	changelog: fix 4446.change.md file name	2026-05-14 14:38:54 -07:00
Aleix Conchillo Flaqué	44a40e8eb2	Merge pull request #4497 from pipecat-ai/aleix/fix-tts-context-id-fallback Fall back to _turn_context_id in get_active_audio_context_id	2026-05-14 13:34:34 -07:00
Aleix Conchillo Flaqué	ea97cb1a78	Add changelog for #4497	2026-05-14 13:22:50 -07:00
Aleix Conchillo Flaqué	22650b1b56	Move QwenLLMService model into Settings in the qwen example Mirrors the deprecation in ``QwenLLMService.__init__``: ``model`` should be passed via ``settings=QwenLLMService.Settings(model=...)`` instead of as a direct constructor arg.	2026-05-14 13:22:07 -07:00
Aleix Conchillo Flaqué	b76831e677	Fall back to _turn_context_id in get_active_audio_context_id TTS services whose wire protocol does not echo the context_id back on incoming audio (Sarvam, Smallest, Soniox, Inworld, ...) call ``get_active_audio_context_id()`` to tag each chunk. That accessor returned only ``_playing_context_id`` — the playback-side cursor set asynchronously by ``_audio_context_task_handler`` when it pops a context off the serialization queue. Result: incoming audio that arrived in the gap between contexts or at the very start of a turn (before the playback loop popped) had ``context_id=None`` and was dropped with ``unable to append audio to context: no context ID provided``. Fall back to ``_turn_context_id`` (the synthesis-side cursor, set as soon as the turn's context is created) so the gap is covered without prematurely nulling the playback cursor.	2026-05-14 13:22:00 -07:00
Mark Backman	b57111743f	Merge pull request #4495 from pipecat-ai/mb/soniox-stt-lang-counter	2026-05-14 15:57:31 -04:00
Mark Backman	dcbb0070c9	Add changelog for Soniox language selection	2026-05-14 15:42:43 -04:00
Mark Backman	73278d3309	Use majority language for Soniox transcripts	2026-05-14 15:18:43 -04:00
filipi87	c8efe319b3	Adding the changelog for the changes.	2026-05-14 11:10:33 -03:00
Mark Backman	49bda11ae8	Merge pull request #4482 from pipecat-ai/mb/soniox-stt-token-language Propagate Soniox token language	2026-05-13 16:28:56 -04:00
Aleix Conchillo Flaqué	07640582ce	Merge pull request #4467 from pipecat-ai/aleix/fix-tts-ttfb-tracing Fix metrics.ttfb and partial output on TTS/STT/LLM OpenTelemetry spans	2026-05-13 13:10:52 -07:00
Mark Backman	078af6969a	Merge pull request #4473 from timofey-TK/inworld-tts-v2 Add support for Inworld TTS v2 fields	2026-05-13 15:32:16 -04:00
Mark Backman	9f40ba21c2	Add changelog for Soniox language fix	2026-05-13 15:26:10 -04:00
Mark Backman	82f0896d6a	Propagate Soniox token language	2026-05-13 15:23:22 -04:00
kompfner	7e4cd23de4	Merge pull request #4474 from pipecat-ai/pk/inworld-realtime-tools Extend cancel_on_interruption=False to Inworld Realtime (best-effort + warning)	2026-05-13 15:12:34 -04:00
TimTk	97f50c8aa2	Address review: use resolve_language, narrow delivery_mode type, update changelog - Replace custom LANGUAGE_MAP fallback in language_to_inworld_language with resolve_language(language, LANGUAGE_MAP, use_base_code=False) to match the pattern used by other services and restore the unverified-language warning - Tighten delivery_mode type from str to Literal["STABLE", "BALANCED", "CREATIVE"] - Update changelog entry to mention delivery_mode and language normalization	2026-05-13 21:43:02 +03:00
Mark Backman	08680732f6	Merge pull request #4475 from pipecat-ai/mb/cartesia-korean-fix Fix Cartesia CJK timestamp spacing	2026-05-13 13:20:42 -04:00

1 2 3 4 5 ...

9501 Commits