Mark Backman
b012220ab9
Update GradiumTTSService to flush instead of ending stream
2026-01-29 18:49:14 -05:00
Mark Backman
7be2b8cc34
Merge pull request #3587 from pipecat-ai/mb/gradium-improvements
...
GradiumSTTService now flushes pending transcripts on VAD stopped dete…
2026-01-29 18:11:25 -05:00
Aleix Conchillo Flaqué
671cc8eb74
Merge pull request #3590 from pipecat-ai/aleix/custom-cli-runner-args
...
runner: allow custom CLI arguments
2026-01-29 13:53:27 -08:00
Aleix Conchillo Flaqué
b4dce656f0
Merge pull request #3594 from pipecat-ai/aleix/user-turn-controller-reset-timeout-on-interims
...
UserTurnController: reset user turn timeout with interim transcriptions
2026-01-29 13:12:44 -08:00
Aleix Conchillo Flaqué
253a1d1114
UserTurnController: reset user turn timeout with interim transcriptions
2026-01-29 13:10:10 -08:00
Aleix Conchillo Flaqué
ca613bcb79
Merge pull request #3592 from pipecat-ai/aleix/broadcast-frame-no-deepcopy
...
don't deep copy fields when broadcasting frames
2026-01-29 11:50:20 -08:00
Aleix Conchillo Flaqué
0423acd8a0
STTService: just clear buffer before running run_stt()
2026-01-29 11:47:57 -08:00
Aleix Conchillo Flaqué
7eabaaa0ef
FrameProcessors: do not deepcopy fields when broadcasting frames
2026-01-29 11:47:57 -08:00
Aleix Conchillo Flaqué
bbb8b53d03
runner: allow custom CLI arguments
2026-01-29 10:15:53 -08:00
Aleix Conchillo Flaqué
f3b72e9263
Merge pull request #3585 from pipecat-ai/aleix/improve-piper-tts-support
...
improve Piper TTS support
2026-01-29 08:36:13 -08:00
Mark Backman
31c7fbc5ba
Add delay_in_frames and language support
2026-01-29 10:59:04 -05:00
Mark Backman
6ab12626d6
GradiumSTTService now flushes pending transcripts on VAD stopped detection
2026-01-29 10:26:17 -05:00
Mark Backman
b77a50de73
Merge pull request #3529 from lukepayyapilli/fix/llm-timeout-without-retry
...
feat: handle exceptions for BaseOpenAILLMService
2026-01-29 09:12:54 -05:00
Luke Payyapilli
433c1b9b92
add catch-all exception handler per review feedback
2026-01-29 09:07:06 -05:00
Aleix Conchillo Flaqué
bd00587092
changelog: add files for 3585
2026-01-29 00:16:39 -08:00
Aleix Conchillo Flaqué
5a85e27cc5
PiperHttpTTSService: allow passing a voice id
2026-01-29 00:16:39 -08:00
Aleix Conchillo Flaqué
11daa43b1b
TTSService: resample _stream_audio_frames_from_iterator() input audio if needed
2026-01-29 00:16:39 -08:00
Aleix Conchillo Flaqué
875614ff7a
tts: add support for local PiperTTSService
2026-01-29 00:16:39 -08:00
Aleix Conchillo Flaqué
eb1bf1e446
tts: rename PiperTTSService to PiperHttpTTSService
2026-01-28 23:27:32 -08:00
mattie ruth backman
7456a0a55f
Fix the /start and /offer/api proxy endpoints for smallWebRTC to match pipecat cloud behavior WRT requestData
2026-01-28 15:25:13 -05:00
Filipi da Silva Fuchter
27277ed3d9
Merge pull request #3571 from pipecat-ai/filipi/funcion_call_improvements
...
Function call improvements
2026-01-28 14:03:40 -05:00
filipi87
5543bc56f3
Add changelog files for PR #3571
...
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com >
2026-01-28 15:43:59 -03:00
filipi87
c8496dfb8e
Updated the examples which use UserImageRequestFrame to defer the function call result.
2026-01-28 15:39:21 -03:00
filipi87
d3f4cbb620
Providing a way to defer the function call results.
2026-01-28 15:39:06 -03:00
filipi87
c9f922c479
Removed an overridden method that was identical to the parent implementation.
2026-01-28 15:38:40 -03:00
Aleix Conchillo Flaqué
49bd3da26b
Merge pull request #3582 from pipecat-ai/aleix/daily-sample-room-url
...
rename DAILY_SAMPLE_ROOM_URL to DAILY_ROOM_URL
2026-01-28 10:38:14 -08:00
Aleix Conchillo Flaqué
f3ef488925
rename DAILY_SAMPLE_ROOM_URL to DAILY_ROOM_URL
2026-01-28 10:05:27 -08:00
Aleix Conchillo Flaqué
4f08098917
Merge pull request #3580 from Pulkit0729/fix/livekit
...
fix: adding missing livekit transport configs
2026-01-28 10:04:34 -08:00
Pulkit
a7cd5b0322
fix: adding missing livekit transport configs
2026-01-28 23:15:03 +05:30
Aleix Conchillo Flaqué
55dadc9118
tests(genesys): fix formatting
2026-01-28 09:15:42 -08:00
Aleix Conchillo Flaqué
01bbf61e0d
Merge pull request #3500 from ssillerom/feature/genesys_serializer
...
Feature/genesys serializer
2026-01-28 09:09:11 -08:00
ssillerom
10fb77c0e2
added changelog file
2026-01-28 18:07:33 +01:00
ssillerom
2612fae527
ruff linting
2026-01-28 18:02:51 +01:00
ssillerom
c5be67f293
fix: create disconnect message passing output vars
2026-01-28 17:56:21 +01:00
kompfner
312caaba86
Merge pull request #3429 from lukepayyapilli/fix/gemini-live-interrupted-signal
...
feat: handle server_content.interrupted for faster interruptions
2026-01-28 10:25:36 -05:00
Luke Payyapilli
ff0eb6d286
fix: emit ErrorFrame on LLM completion timeout
2026-01-28 09:44:32 -05:00
ssillerom
ef6bbace98
fixes: super init inhereted class to set event hanlders in the construct
2026-01-28 15:40:24 +01:00
Filipi da Silva Fuchter
06ec21387f
Merge pull request #3581 from pipecat-ai/filipi/open_ai_audio_duration
...
Fixed race condition in OpenAIRealtimeLLMService
2026-01-28 07:42:35 -05:00
filipi87
bdae177125
Adding changelog entry for the OpenAiRealtimeLLMService fix.
2026-01-28 08:39:11 -03:00
filipi87
468e159f9b
Fixed race condition in OpenAIRealtimeLLMService that could cause an error when truncating the conversation.
2026-01-28 08:36:31 -03:00
ssillerom
a4acafd3be
feature: added event handlers in constructor and call func in each _handle_* func
2026-01-28 10:54:26 +01:00
ssillerom
105824a372
Merge main into feature/genesys_serializer
...
Incorporates latest changes from main branch including:
- AIC filter and VAD updates
- STT service improvements
- Base serializer changes
- Various bug fixes
2026-01-28 10:48:56 +01:00
ssillerom
55e0d4ecc4
ruff fixes done
2026-01-28 08:59:28 +01:00
ssillerom
9102e81cb8
added tests to the PR
2026-01-27 23:39:43 +01:00
ssillerom
d7d8e93a3d
feature: added custom params in closed message to genesys, simplified create_* functions, simplified constructor method and simplified opened message
2026-01-27 23:36:47 +01:00
Mark Backman
bf9b166464
Merge pull request #3575 from pipecat-ai/mb/fix-turn-stopped-event-end-cancel-frame
...
Emit on_assistant_turn_stopped and on_user_turn_stopped from EndFrame…
2026-01-27 14:55:34 -05:00
Mark Backman
e80e0eab29
Emit on_assistant_turn_stopped and on_user_turn_stopped from EndFrame or CancelFrame
2026-01-27 14:50:10 -05:00
Mark Backman
61242e6575
Merge pull request #3574 from pipecat-ai/mb/fix-websocket-close-message-handling
...
Fix WebsocketService infinite loop on graceful server disconnect
2026-01-27 13:53:26 -05:00
Aleix Conchillo Flaqué
8841387121
Merge pull request #3560 from pipecat-ai/aleix/serializer-base-objects
...
FrameSerializer: subclass from BaseObject so we can add events
2026-01-27 09:58:44 -08:00
Aleix Conchillo Flaqué
ee695ae9fe
FrameSerializer: subclass from BaseObject so we can add events
2026-01-27 09:53:46 -08:00
Mark Backman
52012b0fb2
Fix WebsocketService infinite loop on graceful server disconnect
2026-01-27 12:41:28 -05:00
Mark Backman
f7a1c6b719
Merge pull request #3408 from ai-coustics/aic-v2
...
Add ai-coustics AIC SDK v2 support with model downloading
2026-01-27 10:38:26 -05:00
Gökmen Görgen
6aa77ccc13
group aic related changes in changelog.
2026-01-27 16:22:54 +01:00
Gökmen Görgen
45b7ec4e2c
re-enable 07zd-interruptible-aicoustics.py in release evals.
2026-01-27 16:18:56 +01:00
Mark Backman
1c434c6ad5
Merge pull request #3562 from speechmatics/fix/smx-ttfs-finals
...
Support TTFS for Speechmatics STT
2026-01-27 08:35:34 -05:00
Mark Backman
4591affba9
Merge pull request #3568 from pipecat-ai/mb/changelog-3536
2026-01-27 07:14:41 -05:00
Sam Sykes
91346f5f37
Add support for self.request_finalize() for Pipecat-based VAD.
2026-01-27 10:44:35 +00:00
Filipi da Silva Fuchter
6a66ebe332
Merge pull request #3541 from pipecat-ai/filipi/audio_buffer
...
Refactoring AudioBufferProcessor to fix audio track synchronization.
2026-01-27 05:32:41 -05:00
Filipi da Silva Fuchter
c1d4180042
Merge pull request #3567 from pipecat-ai/filipi/openai_realtime_audio_duration
...
Fixed race condition in OpenAIRealtimeBetaLLMService
2026-01-27 05:30:33 -05:00
Gökmen Görgen
81a53c699c
handle AIC processor init errors gracefully and ensure _aic_ready reflects readiness
2026-01-27 11:28:05 +01:00
Sam Sykes
60168f7f69
remove comment
2026-01-26 23:16:43 +00:00
Sam Sykes
23d7608e5f
changelog update
2026-01-26 23:15:30 +00:00
Sam Sykes
99242c0a93
linting updates
2026-01-26 23:14:40 +00:00
Sam Sykes
3a71865cf4
removed old metrics
2026-01-26 23:11:25 +00:00
Mark Backman
ecf2e69f3f
Merge pull request #3536 from surapuramakhil/main
...
LLMAssistantAggregator: preserve non-ASCII characters in JSON output
2026-01-26 16:42:05 -05:00
Mark Backman
febd52274d
Add changelog fragment for PR 3536
2026-01-26 16:42:00 -05:00
Mark Backman
1542d922e7
Merge pull request #3546 from pipecat-ai/pk/changelog-fragment-for-pr-3406
...
Added a changelog fragment for PR 3406
2026-01-26 16:31:57 -05:00
Paul Kompfner
15d5d1159e
Added a changelog fragment for PR 3406
2026-01-26 16:27:33 -05:00
Mark Backman
884630a6bd
Merge pull request #3559 from pipecat-ai/aleix/transport-broadcast-fixes
...
transports: fix broadcast_frame_class reference
2026-01-26 16:25:31 -05:00
Mark Backman
1cf137c6a8
Merge pull request #3565 from pipecat-ai/markbackman-patch-1
2026-01-26 15:49:35 -05:00
filipi87
98fcfd7c91
Adding changelog entry for the OpenAiRealtimeBetaLLMService fix.
2026-01-26 17:19:08 -03:00
filipi87
2f23f2e39c
Fixed race condition in OpenAIRealtimeBetaLLMService that could cause an error when truncating the conversation.
2026-01-26 17:08:27 -03:00
Mark Backman
9c6b11cecf
Update README links to use absolute URLs
2026-01-26 13:03:39 -05:00
Sam Sykes
fc1444c9d6
Updated changelog
2026-01-26 16:25:37 +00:00
Sam Sykes
ea94939add
update dependency
2026-01-26 16:24:56 +00:00
Sam Sykes
0c69ae6371
Changelog entry.
2026-01-26 16:07:59 +00:00
Sam Sykes
8b88280bb1
Default to using EXTERNAL mode.
2026-01-26 15:52:42 +00:00
Sam Sykes
960d0faea5
support is_eou for final segment in utterance
2026-01-26 15:48:04 +00:00
Luke Payyapilli
b9390ccb1b
Address review: remove UserStartedSpeakingFrame, add explanatory comment
2026-01-26 10:08:17 -05:00
Mark Backman
061a0dc43d
Merge pull request #3498 from pipecat-ai/mb/azure-tts-8khz-workaround
...
AzureTTSService 8khz workaround
2026-01-26 09:48:22 -05:00
Mark Backman
328bbe069f
Merge pull request #3554 from pipecat-ai/mb/simplify-stt-ttfb
...
Simplify STT finalize handling
2026-01-26 08:00:04 -05:00
Mark Backman
dc32ecc872
Merge pull request #3555 from pipecat-ai/mb/speechmatics-stt-ttfb
...
Align Speechmatics STT TTFB metrics with STT classes
2026-01-26 07:59:34 -05:00
Gökmen Görgen
ca2eb1904f
Merge remote-tracking branch 'origin/aic-v2' into aic-v2
2026-01-26 10:16:23 +01:00
Gökmen Görgen
4bce58f270
update changelog and remove outdated dependency notes
2026-01-26 10:15:15 +01:00
Gökmen Görgen
7572d63f8f
Update src/pipecat/audio/vad/aic_vad.py
...
Co-authored-by: Andres O. Vela <andresovela@users.noreply.github.com >
2026-01-26 10:06:40 +01:00
Gökmen Görgen
3c463c9416
Update src/pipecat/audio/vad/aic_vad.py
...
Co-authored-by: Andres O. Vela <andresovela@users.noreply.github.com >
2026-01-26 10:06:33 +01:00
Gökmen Görgen
bd618d64e3
Update src/pipecat/audio/filters/aic_filter.py
...
Co-authored-by: Andres O. Vela <andresovela@users.noreply.github.com >
2026-01-26 10:06:16 +01:00
Gökmen Görgen
a824660df7
add unit tests for AICVADAnalyzer and AICFilter.
2026-01-26 09:56:36 +01:00
Gökmen Görgen
58b9019852
bump aic-sdk to 2.0.1 in optional dependencies.
2026-01-26 09:14:16 +01:00
Gökmen Görgen
afcdef8c81
docstring clarification.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
bd92104fb3
clarify voice confidence method behavior in AIC VAD.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
34e9f224a8
Update src/pipecat/audio/vad/aic_vad.py
...
Co-authored-by: Andres O. Vela <andresovela@users.noreply.github.com >
2026-01-26 08:44:17 +01:00
Gökmen Görgen
dca7f3b5b0
add changelog.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
70a85cd192
use path for keeping the consistency between the parameters.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
91e86658b7
force developer to set a license key, it's required.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
0a8588669c
address feedback.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
0e99400148
two dots are rust specific thinks, I'm not sure if it's familiar for Python developers.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
648f20db6d
Update src/pipecat/audio/vad/aic_vad.py
...
Co-authored-by: Andres O. Vela <andresovela@users.noreply.github.com >
2026-01-26 08:44:17 +01:00
Gökmen Görgen
09b5b6b12d
Update src/pipecat/audio/vad/aic_vad.py
...
Co-authored-by: Andres O. Vela <andresovela@users.noreply.github.com >
2026-01-26 08:44:17 +01:00
Gökmen Görgen
0e6a423955
Update src/pipecat/audio/filters/aic_filter.py
...
Co-authored-by: Andres O. Vela <andresovela@users.noreply.github.com >
2026-01-26 08:44:17 +01:00
Gökmen Görgen
dc8972cd94
log optimal number of frames for given sample rate in AICFilter.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
e4e2231958
Update src/pipecat/audio/vad/aic_vad.py
...
Co-authored-by: Andres O. Vela <andresovela@users.noreply.github.com >
2026-01-26 08:44:17 +01:00
Gökmen Görgen
18b3ee743b
replace os with pathlib.Path in AICFilter for path handling consistency.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
65b8e0e89c
rename enabled to bypass in AICFilter for clarity.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
b77f8b065f
remove voice gain.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
5fd43faec3
add min speech duration.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
abebcf37bd
address feedback.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
ca4e3c79f9
Update pyproject.toml
...
Co-authored-by: Andres O. Vela <andresovela@users.noreply.github.com >
2026-01-26 08:44:17 +01:00
Gökmen Görgen
e8d1bec03b
Update src/pipecat/audio/filters/aic_filter.py
...
Co-authored-by: Andres O. Vela <andresovela@users.noreply.github.com >
2026-01-26 08:44:17 +01:00
Gökmen Görgen
f0cc54589e
remove enhancement level parameter from AICFilter.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
22b9aac2ff
use quail model in the example.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
7f86f4ac27
fix class name.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
dcab79753b
even the parameters are fixed, keep aic ready for processing.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
bdded9b026
set SDK ID for telemetry in AIC filter.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
1e1e275fea
address feedback.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
effb6aa8f4
clean up unused imports in audio utils.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
a4a9bae79e
drop v1 support from aic.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
c943ef9261
keep uv.lock as it is.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
f05809520b
Remove outdated AIC Filter and VAD v2 files, migrate to consolidated implementations.
...
Added the new ACIFilter to the same module.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
ec17dc6626
aic-sdk-py v2.
...
# Conflicts:
# uv.lock
# Conflicts:
# examples/foundational/07zd-interruptible-aicoustics.py
# pyproject.toml
# src/pipecat/audio/filters/aic_filter.py
# src/pipecat/audio/vad/aic_vad.py
2026-01-26 08:44:17 +01:00
Gökmen Görgen
4e85e81d9b
Update src/pipecat/audio/filters/aic_filter.py
...
Co-authored-by: Tobias <76444201+Fl1tzi@users.noreply.github.com >
2026-01-26 08:44:17 +01:00
Gökmen Görgen
a1cc88a233
Update src/pipecat/audio/filters/aic_filter.py
...
Co-authored-by: Tobias <76444201+Fl1tzi@users.noreply.github.com >
2026-01-26 08:44:17 +01:00
Gökmen Görgen
61a230ec53
Update src/pipecat/audio/filters/aic_filter.py
...
Co-authored-by: Stephan Eckes <stephan@steck.tech >
2026-01-26 08:44:17 +01:00
Gökmen Görgen
a13380b574
clean up unused imports in audio utils.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
2a927189d9
reorganize imports.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
a90c15362c
drop v1 support from aic.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
d3bdd2d246
use new model id.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
465ae4f706
keep uv.lock as it is.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
a0d801b658
Remove outdated AIC Filter and VAD v2 files, migrate to consolidated implementations.
...
Added the new ACIFilter to the same module.
2026-01-26 08:44:17 +01:00
Gökmen Görgen
35919a84e3
aic-sdk-py v2.
...
# Conflicts:
# uv.lock
2026-01-26 08:44:17 +01:00
Aleix Conchillo Flaqué
f94a60f381
transports: fix broadcast_frame_class reference
2026-01-25 15:42:09 -08:00
ssillerom
a446bca72d
changes: added OutputTransportUrgentFrame to on closed, removed callback
2026-01-25 21:12:28 +01:00
Sergio Sillero
8ae834366b
Merge branch 'pipecat-ai:main' into feature/genesys_serializer
2026-01-25 21:04:27 +01:00
Mark Backman
a4acc12f91
Align Speechmatics STT TTFB metrics with STT classes
2026-01-24 18:26:34 -05:00
Mark Backman
e93112e76e
Simplify STT finalize handling
2026-01-24 15:28:27 -05:00
Mark Backman
680bcaac66
Merge pull request #3550 from pipecat-ai/mb/update-smart-turn-data-env-var
...
Update env var to PIPECAT_SMART_TURN_LOG_DATA
2026-01-24 13:52:36 -05:00
Mark Backman
d2ac9006a2
Update env var to PIPECAT_SMART_TURN_LOG_DATA
2026-01-24 12:50:42 -05:00
Mark Backman
bcb019e8ab
Add TTFB metrics for STT services ( #3495 )
2026-01-23 18:47:34 -05:00
kompfner
4ea546785f
Merge pull request #3406 from omChauhanDev/fix/openrouter-gemini-messages
...
fix(openrouter): handle multiple system messages for Gemini models
2026-01-23 14:53:59 -05:00
filipi87
f128cdd19a
Adding a changelog entry to the AudioBufferProcessor fix.
2026-01-23 16:16:01 -03:00
filipi87
7921bce4af
Refactoring AudioBufferProcessor to fix audio track synchronization.
2026-01-23 16:15:48 -03:00
Luke Payyapilli
cadced3f79
feat: handle server_content.interrupted for faster barge-in response
2026-01-23 10:41:04 -05:00
Aleix Conchillo Flaqué
8951442b8e
Merge pull request #3534 from pipecat-ai/aleix/claude-skills-pr-description
...
claude: add pr-description skill
2026-01-22 17:34:46 -08:00
Aleix Conchillo Flaqué
7e6e3031e7
claude: add pr-description skill
2026-01-22 13:41:50 -08:00
Akhil
3b3c7aa8cc
LLMAssistantAggregator: preserve non-ASCII characters in JSON output
...
Add ensure_ascii=False to json.dumps() calls for tool call arguments
and function call results to prevent unnecessary unicode escaping.
2026-01-22 15:37:44 -06:00
Aleix Conchillo Flaqué
308829f92b
Merge pull request #3533 from pipecat-ai/aleix/claude-skills-docstring
...
claude: add docstring skill
2026-01-22 12:58:38 -08:00
Aleix Conchillo Flaqué
82a799e63e
claude: add docstring skill
2026-01-22 12:53:38 -08:00
Cale Shapera
6b5bcae86f
change default Inworld TTS model to inworld-tts-1.5-max ( #3531 )
2026-01-22 14:21:15 -05:00
Mark Backman
836073849c
Merge pull request #3527 from weakcamel/patch-1
...
Update README.md - fix Google Imagen URL
2026-01-22 10:46:10 -05:00
Waldek Maleska
b13b65d6e2
Update README.md - fix Google Imagen URL
2026-01-22 15:17:41 +00:00
Mark Backman
3d545b718d
Merge pull request #3344 from omChauhanDev/fix/stt-dynamic-language-update
...
fix: treat language as first-class STT setting
2026-01-22 09:21:56 -05:00
marcus-daily
f2fa5d9733
Updating changelog
2026-01-22 14:17:59 +00:00
marcus-daily
76b774072c
Formatting fixes
2026-01-22 14:17:59 +00:00
marcus-daily
b6341ffaa5
Save Smart Turn input data if SMART_TURN_LOG_DATA is set
2026-01-22 14:17:59 +00:00
Mark Backman
29fae67c9e
Merge pull request #3523 from omChauhanDev/add-location-support-google-tts
...
feat(google): add location parameter to TTS services
2026-01-22 09:12:16 -05:00
Mark Backman
718ea1c15e
Merge pull request #3526 from pipecat-ai/mb/remove-logs
...
Remove application logs
2026-01-22 08:48:07 -05:00
Mark Backman
8e09d94614
Remove application logs
2026-01-22 08:28:52 -05:00
Aleix Conchillo Flaqué
de73e28563
Merge pull request #3510 from omChauhanDev/feat/add-reached-filter-methods
...
feat(task): add additive filter methods for frame monitoring
2026-01-21 21:05:33 -08:00
Aleix Conchillo Flaqué
55250b4f7e
Merge pull request #3521 from pipecat-ai/aleix/claude-changelog-skill
...
claude: initial /changelog skill
2026-01-21 20:50:47 -08:00
Om Chauhan
281145a991
added changelog
2026-01-22 09:55:57 +05:30
Om Chauhan
7bd32e2fe5
feat(google): add location parameter to TTS services
2026-01-22 09:49:19 +05:30
James Hush
8f05d95f50
feat: add video_out_codec parameter for DailyTransport ( #3520 )
...
* feat: add video_out_codec parameter for DailyTransport
Add video_out_codec parameter to TransportParams allowing configuration
of the preferred video codec (VP8, H264, H265) for video output.
When set, this passes the preferredCodec option to Daily's
VideoPublishingSettings during the join operation.
* chore: move video_out_codec parameter to changelog folder (#3522 )
* Initial plan
* Move video_out_codec parameter to changelog/3520.added.md
Co-authored-by: jamsea <614910+jamsea@users.noreply.github.com >
* Revert all CHANGELOG.md changes, keep only changelog/3520.added.md
Co-authored-by: jamsea <614910+jamsea@users.noreply.github.com >
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: jamsea <614910+jamsea@users.noreply.github.com >
---------
Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com >
Co-authored-by: jamsea <614910+jamsea@users.noreply.github.com >
2026-01-22 11:31:07 +08:00
Om Chauhan
87c12f3098
changed frame filter storage type from tuples to sets
2026-01-22 08:43:46 +05:30
Om Chauhan
9c0bf89247
added changelog
2026-01-22 08:43:46 +05:30
Om Chauhan
6e44a2ab49
feat(task): add additive filter methods for frame monitoring
2026-01-22 08:43:46 +05:30
Aleix Conchillo Flaqué
7aa7b86aed
claude: initial /changelog skill
2026-01-21 18:43:04 -08:00
Aleix Conchillo Flaqué
5ad9faeb4c
Merge pull request #3519 from pipecat-ai/aleix/embedded-rtvi-processor
...
automatically add RTVI to the pipeline
2026-01-21 18:17:26 -08:00
Aleix Conchillo Flaqué
9e8f8b45c6
added changelog files for #3519
2026-01-21 18:14:17 -08:00
Aleix Conchillo Flaqué
0ee11ad333
tests: disable RTVI in tests by default
2026-01-21 18:14:17 -08:00
Aleix Conchillo Flaqué
124a3c35af
RTVIObserver: don't handle some frames direction
2026-01-21 18:14:17 -08:00
Aleix Conchillo Flaqué
054e504868
examples(foundational): remove RTVI (automatically added by PipelineTask)
2026-01-21 18:14:17 -08:00
Aleix Conchillo Flaqué
e85a00cc0e
PipelineTask: automatically add RTVI processor and RTVI observer
...
If `enable_rtvi` is enabled (enabled by default) and RTVI processor will be
added automatically to the pipeline. Also, and RTVI observer will be
registered.
2026-01-21 18:14:17 -08:00
Aleix Conchillo Flaqué
cc61cdbba3
RTVIProcessor: add create_rtvi_observer()
2026-01-21 18:14:17 -08:00
Aleix Conchillo Flaqué
62f4708d43
transports: broadcast InputTransportMessageFrame frames
2026-01-21 18:14:17 -08:00
Aleix Conchillo Flaqué
ba0ddb1832
FrameProcessor: copy kwargs when broadcasting frame
2026-01-21 18:14:17 -08:00
Aleix Conchillo Flaqué
eacd2a4b71
FrameProcessor: add broadcast_frame_instance()
2026-01-21 18:14:17 -08:00
Mark Backman
7ed110650d
Merge pull request #3516 from okue/minorpatch1
...
refactor(user_mute): remove unnecessary _bot_speaking assignment in _handle_bot_stopped_speaking
2026-01-21 10:33:59 -05:00
okue
4a724379fc
refactor(user_mute): remove unnecessary _bot_speaking assignment in _handle_bot_stopped_speaking
...
The _bot_speaking flag does not need to be set in this method,
so the redundant assignment has been removed.
2026-01-21 23:59:15 +09:00
Aleix Conchillo Flaqué
768d3958dd
Merge pull request #3512 from pipecat-ai/changelog-0.0.100
...
Release 0.0.100 - Changelog Update
2026-01-20 19:32:56 -08:00
aconchillo
5f9ff8bd58
Update changelog for version 0.0.100
2026-01-20 19:21:19 -08:00
Aleix Conchillo Flaqué
59ed422052
Merge pull request #3511 from pipecat-ai/aleix/camb-tts-client-on-start
...
CambTTSService: initialize client during StartFrame
2026-01-20 19:17:45 -08:00
Aleix Conchillo Flaqué
7e0ca113af
CambTTSService: initialize client during StartFrame
2026-01-20 19:07:12 -08:00
Aleix Conchillo Flaqué
13c52e0e6d
Merge pull request #3509 from pipecat-ai/aleix/nvidia-stt-tts-improvements
...
NVIDIA STT/TTS performance improvements
2026-01-20 16:39:12 -08:00
Aleix Conchillo Flaqué
a787fd9cd8
NVIDIATTSService: process incoming audio frame right away
...
Process audio as soon as we receive it from the generator. Previously, we were
reading from the generator and adding elements into a queue until there was no
more data, then we would process the queue.
2026-01-20 15:41:05 -08:00
Aleix Conchillo Flaqué
14495c425a
NVIDIASTTService: no need for additional queue and task
2026-01-20 13:50:17 -08:00
Aleix Conchillo Flaqué
461bd0a2e0
update changelog for #3494 and #3499
2026-01-20 13:26:40 -08:00
Aleix Conchillo Flaqué
bd45ce2b4e
Merge pull request #3499 from lukepayyapilli/fix/livekit-video-queue-memory-leak
...
fix(livekit): prevent memory leak when video_in_enabled is False
2026-01-20 13:21:21 -08:00
Aleix Conchillo Flaqué
a266644b06
Merge pull request #3494 from omChauhanDev/fix/uninterruptible-frame-handling
...
fix: preserve UninterruptibleFrames in __reset_process_queue
2026-01-20 13:19:40 -08:00
Mark Backman
03faadd7f9
Merge pull request #3508 from pipecat-ai/ss/log-daily-ids
...
Log Daily participant and meeting session IDs upon successful join in…
2026-01-20 15:43:48 -05:00
Aleix Conchillo Flaqué
bf43032652
Merge pull request #3504 from pipecat-ai/aleix/nvidia-stt-tts-error-handling
...
NVIDIA STT/TTS error handling
2026-01-20 09:41:08 -08:00
Sunah Suh
fa6f924b31
Log Daily participant and meeting session IDs upon successful join in Daily Transport
2026-01-20 11:31:17 -06:00
Aleix Conchillo Flaqué
a010a020fd
add changelog fo 3504
2026-01-20 09:03:30 -08:00
Aleix Conchillo Flaqué
655006aff5
NvidiaSegmentedSTTService: simplify exception handling
2026-01-20 08:58:14 -08:00
Aleix Conchillo Flaqué
671dc8cd9b
NvidiaSTTService: initialize client on StartFrame
...
Initialize client on StartFrame so errrors are reported within the pipeline.
2026-01-20 08:58:14 -08:00
Aleix Conchillo Flaqué
9a718ded1e
NvidiaTTSService: initialize client on StartFrame
...
Initialize client on StartFrame so errrors are reported within the pipeline.
2026-01-20 08:58:14 -08:00
Aleix Conchillo Flaqué
024809b39a
Merge pull request #3503 from pipecat-ai/aleix/ai-service-start-end-cancel
...
AIService: handle StartFrame/EndFrame/CancelFrame exceptions
2026-01-20 08:56:39 -08:00
Aleix Conchillo Flaqué
6cf0d53d00
AIService: handle StartFrame/EndFrame/CancelFrame exceptions
...
If AIService subclasses implement start()/stop()/cancel() and exception are not
handled, execution will not continue and therefore the originator frames will
not be pushed. This would cause the pipeline to not be started (i.e. StartFrame
would not be pushed downstream) or stopped properly.
2026-01-20 08:54:22 -08:00
kompfner
778dacc9a8
Merge pull request #3486 from pipecat-ai/pk/fix-nova-sonic-reset-conversation
...
Fix `AWSNovaSonicLLMService.reset_conversation()`
2026-01-20 10:07:38 -05:00
Paul Kompfner
06b3ecd2d6
In AWS Nova Sonic service, send the "interactive" user message (which triggers the bot response) only after sending the audio input start event, per the AWS team's recommendation
2026-01-20 09:56:25 -05:00
Paul Kompfner
b4d143e39b
Add CHANGELOG for fixing AWSNovaSonicLLMService.reset_conversation()
2026-01-20 09:56:25 -05:00
Paul Kompfner
c89083e72e
Improve 20e example to ask the bot to give a recap when loading a previous conversation from disk
2026-01-20 09:56:25 -05:00
Luke Payyapilli
1ac811ab32
chore: revert unrelated uv.lock changes
2026-01-20 09:19:43 -05:00
Luke Payyapilli
f6359d460e
chore: install livekit as optional extra in CI instead of dev dep
2026-01-20 09:16:16 -05:00
Aleix Conchillo Flaqué
f03a7175c7
Merge pull request #3501 from pipecat-ai/aleix/improve-eval-numerical-word-prompt
...
scripts(eval): give examples to numerical word answers
2026-01-19 20:22:06 -08:00
Aleix Conchillo Flaqué
aed44c863a
scripts(eval): give examples to numerical word answers
...
Some models need extra help.
2026-01-19 14:37:00 -08:00
ssillerom
fa5da3b0be
change comments
2026-01-19 20:49:23 +01:00
ssillerom
7e82a0cf49
feature: Genesys AudioHook WebSocket protocol serializer for Pipecat
2026-01-19 20:45:22 +01:00
Mark Backman
cddd6d5b0a
Merge pull request #3492 from pipecat-ai/mb/remove-unused-imports
...
Remove unused imports
2026-01-19 14:07:16 -05:00
Mark Backman
11cf891ac8
Manual updates for unused imports
2026-01-19 14:03:22 -05:00
Luke Payyapilli
c89ae717fe
style: fix ruff formatting
2026-01-19 11:13:41 -05:00
Luke Payyapilli
562bdd3084
test: add livekit to dev deps and improve test clarity
2026-01-19 11:11:54 -05:00
Mark Backman
cc4c3650e1
Merge pull request #3491 from pipecat-ai/mb/update-release-evals
...
Add Camb TTS to release evals
2026-01-19 11:04:05 -05:00
Luke Payyapilli
dfc1f09b77
fix(livekit): prevent memory leak when video_in_enabled is False
2026-01-19 11:00:23 -05:00
Mark Backman
0b1a4792b8
Bump to latest azure-cognitiveservices-speech version, 1.47.0
2026-01-19 09:52:28 -05:00
Mark Backman
14bd3b1b32
Set Azure TTS default prosody rate to None
2026-01-19 09:19:57 -05:00
Mark Backman
f733e77496
AzureTTS: work around word ordering issue at 8khz sample rate
2026-01-19 09:13:41 -05:00
Filipi da Silva Fuchter
5fc46cc450
Merge pull request #3493 from omChauhanDev/fix/globally-unique-pc-id
...
fix: make SmallWebRTCConnection pc_id globally unique
2026-01-19 09:04:48 -05:00
Om Chauhan
4a9eb82f92
fix: preserve UninterruptibleFrames in __reset_process_queue
2026-01-18 20:39:13 +05:30
Om Chauhan
990d8386e4
fix: make SmallWebRTCConnection pc_id globally unique
2026-01-18 19:41:51 +05:30
Mark Backman
ce7d823770
Remove unused imports
2026-01-18 08:22:22 -05:00
Mark Backman
0b93c3f900
Add Camb TTS to release evals
2026-01-17 16:27:16 -05:00
Mark Backman
829c5f4604
Merge pull request #3169 from Incanta/hathora
...
Add Hathora STT and TTS services
2026-01-17 16:25:12 -05:00
Mike Seese
dc8ea615d9
add hathora to run-release-evals.py
2026-01-17 10:33:58 -08:00
Mike Seese
a3d206050d
move hathora example as requested
2026-01-17 10:31:08 -08:00
Mike Seese
f48a567873
run the linter
2026-01-17 10:30:47 -08:00
Mark Backman
e69ccd8ea7
Merge pull request #3490 from pipecat-ai/mb/on-user-mute-events
...
Add on_user_mute_started and on_user_mute_stopped events
2026-01-17 11:05:15 -05:00
Mark Backman
11924bb980
Add on_user_mute_started and on_user_mute_stopped events
2026-01-17 11:01:46 -05:00
Mark Backman
af89154e96
Merge pull request #3489 from pipecat-ai/mb/fix-azure-tts-punctuation-spacing
...
fix: AzureTTSService punctuation spacing
2026-01-17 11:00:30 -05:00
Mark Backman
1485ea0831
Merge pull request #3488 from pipecat-ai/mb/on-user-turn-idle
...
Update on_user_idle to on_user_turn_idle
2026-01-17 11:00:16 -05:00
Mark Backman
e22bc777d8
Fix spacing for CJK languages
2026-01-17 09:04:50 -05:00
Mark Backman
043403fe23
fix: AzureTTSService punctuation spacing
2026-01-17 08:18:31 -05:00
Mark Backman
1e1160906e
Update on_user_idle to on_user_turn_idle
2026-01-17 07:04:27 -05:00
Aleix Conchillo Flaqué
f7d3e63063
Merge pull request #3474 from pipecat-ai/fix/optional-member-access-function-call-cancel
...
Fix Pylance reportOptionalMemberAccess in _handle_function_call_cancel
2026-01-16 22:06:45 -08:00
Paul Kompfner
6fa797c8e4
Fix AWS Nova Sonic reset_conversation(), which would previously error out.
...
Issues:
- After disconnecting, we were prematurely sending audio messages using the new prompt and content names, before the new prompt and content were created
- We weren't properly sending system instruction and conversation history messages to Nova Sonic with `"interactive": false`
2026-01-16 22:31:54 -05:00
Mark Backman
473d39791b
Merge pull request #3482 from pipecat-ai/mb/user-idle-in-user-aggregator
...
Add UserIdleController, deprecate UserIdleProcessor
2026-01-16 18:47:10 -05:00
Aleix Conchillo Flaqué
2114abb8c6
add changelog file for 3484
2026-01-16 15:46:29 -08:00
Aleix Conchillo Flaqué
4fb4c26f55
Merge pull request #3484 from amichyrpi/main
...
Remove async_mode parameter from Mem0 storage
2026-01-16 15:44:52 -08:00
Mark Backman
2e8e574ea5
Add UserIdleController, deprecate UserIdleProcessor
2026-01-16 18:44:19 -05:00
Aleix Conchillo Flaqué
84c7e97be2
Merge pull request #3483 from pipecat-ai/aleix/throttle-user-speaking-frame
...
throttle user speaking frame
2026-01-16 15:29:37 -08:00
Amory Hen
a6e7c99d55
Remove async_mode parameter from Mem0 storage
2026-01-17 00:26:38 +01:00
Aleix Conchillo Flaqué
ac3fa7f91f
BaseOuputTransport: minor cleanup
2026-01-16 15:15:49 -08:00
Aleix Conchillo Flaqué
6eadad53b2
BaseInputTransport: throttle UserSpeakingFrame
2026-01-16 15:15:49 -08:00
kompfner
b11150f31f
Merge pull request #3480 from pipecat-ai/pk/fix-grok-realtime-smallwebrtc
...
Fix an issue where Grok Realtime would error out when running with Sm…
2026-01-16 15:46:27 -05:00
Paul Kompfner
836cf60611
Fix an issue where Grok Realtime would error out when running with SmallWebRTC transport.
...
The underlying issue was related to the fact that we were sending audio to Grok before we had configured the Grok session with our default input sample rate (16000), so Grok was interpreting those initial audio chunks as having its default sample rate (24000). We didn't see this issue when using the Daily transport simply because in our test environments Daily took a smidge longer than a reflexive (localhost) pure WebRTC connection, so we would only send audio to Grok *after* we had configured the Grok session with the desired sample rate.
2026-01-16 15:41:33 -05:00
James Hush
1c13ad95a5
Fix Pylance reportOptionalMemberAccess in _handle_function_call_cancel
...
Extract dictionary value to local variable and check for None before
accessing cancel_on_interruption attribute, since the dictionary values
are typed as Optional[FunctionCallInProgressFrame].
2026-01-16 15:04:26 -05:00
Mark Backman
1e8516e91d
Merge pull request #3476 from pipecat-ai/mb/project-urls
...
Update project.urls for PyPI
2026-01-16 14:57:39 -05:00
Mark Backman
32c775311d
Merge pull request #3471 from pipecat-ai/mb/fix-pydantic-2.12-docs
...
Revert pydantic 2.12 extra type annotation
2026-01-16 14:57:24 -05:00
Mark Backman
28d0bb98de
Merge pull request #3472 from pipecat-ai/mb/whisker-dev
...
Add whisker_setup.py setup file to .gitignore
2026-01-16 14:55:48 -05:00
Aleix Conchillo Flaqué
a9a9f3aeaa
Merge pull request #3462 from pipecat-ai/aleix/fix-min-words-transcription-aggregation
...
MinWordsUserTurnStartStrategy: don't aggregate transcriptions
2026-01-16 11:18:23 -08:00
Aleix Conchillo Flaqué
c2a0735975
MinWordsUserTurnStartStrategy: don't aggregate transcriptions
...
If we aggregate transcriptions we will get incorrect interruptions. For example,
if we have a strategy with min_words=3 and we say "One" and pause, then "Two"
and pause and then "Three", this would trigger the start of the turn when it
shouldn't. We should only look at the incoming transcription text and don't
aggregate it with the previous.
2026-01-16 11:16:06 -08:00
Aleix Conchillo Flaqué
41cb53f6c2
Merge pull request #3479 from pipecat-ai/aleix/turns-mute-to-user-mute
...
turns: move mute to user_mute
2026-01-16 11:11:50 -08:00
Aleix Conchillo Flaqué
58552af8fd
examples(foundational): remote STTMuteFilter example
2026-01-16 11:07:20 -08:00
Aleix Conchillo Flaqué
c7ab87b0cc
turns: move mute to user_mute
2026-01-16 11:07:20 -08:00
Mark Backman
11ecc5fdee
Update project.urls for PyPI
2026-01-16 12:48:13 -05:00
kompfner
19fb3eed9f
Merge pull request #3466 from pipecat-ai/pk/fix-aws-nova-sonic-rtvi-bot-output
...
Fix realtime (speech-to-speech) services' RTVI event compatibility
2026-01-16 09:56:13 -05:00
Mark Backman
b292b32374
Merge pull request #3461 from glennpow/glenn/websocket-headers
...
Allow WebsocketClientTransport to send custom headers
2026-01-15 20:26:36 -05:00
Mark Backman
63d1393bb0
Add whisker_setup.py to .gitignore
2026-01-15 20:21:25 -05:00
Glenn Powell
37914cb062
Removed import and added changelog entry.
2026-01-15 16:47:15 -08:00
Mark Backman
ec40696854
Revert pydantic 2.12 extra type annotation
2026-01-15 19:16:15 -05:00
Mike Seese
2249f3d673
add requested changes from code review
2026-01-15 15:27:56 -08:00
Mike Seese
d2df324f29
fix some bugs after testing changes
2026-01-15 15:27:56 -08:00
Mike Seese
67fdb0b659
use parent _settings dict instead of self._params pattern
2026-01-15 15:27:56 -08:00
Mike Seese
e77bdf66f9
add can_generate_metrics functions
2026-01-15 15:27:56 -08:00
Mike Seese
1b3b67779c
switch hathora services to use InputParams pattern
2026-01-15 15:27:55 -08:00
Mike Seese
6c7e386391
remove traced_stt from run_stt
2026-01-15 15:27:55 -08:00
Mike Seese
ba25b279d6
fix issues with PR suggestions
2026-01-15 15:27:55 -08:00
Mike Seese
e7c83c19b6
port turn_start_strategies to the newer user_turn_strategies
2026-01-15 15:27:55 -08:00
Mike Seese
7be7fb49a3
remove turn_analyzer args from transport params
2026-01-15 15:27:54 -08:00
Mike Seese
bcccb4cbb3
put fallback sample_rate value in function arg
2026-01-15 15:27:54 -08:00
Mike Seese
e9f1d951d3
Apply suggestions from code review
...
Co-authored-by: Mark Backman <m.backman@gmail.com >
2026-01-15 15:27:54 -08:00
Mike Seese
e5632a9339
transition Hathora service to use the unified API and apply PR feedback
...
add Hathora to root files
Hathora run linter
added hathora changelog
2026-01-15 15:27:53 -08:00
Mike Seese
1510fb4fc0
add Hathora STT and TTS services
2026-01-15 15:26:52 -08:00
Mark Backman
64a1ad2649
Merge pull request #3470 from pipecat-ai/mb/fix-docs-0.0.99
...
Docs fixes after 0.0.99
2026-01-15 17:34:44 -05:00
Mark Backman
4458ca1d24
Mock FastAPI
2026-01-15 17:29:47 -05:00
Mark Backman
21aaa48e62
Fix pydantic issues impacting autodoc
2026-01-15 17:29:47 -05:00
Mark Backman
e75c241030
Merge pull request #3468 from pipecat-ai/mb/camb-cleanuo
...
Clean up CambTTSService
2026-01-15 17:16:28 -05:00
Mark Backman
60216048a8
Docs fixes after 0.0.99
2026-01-15 16:40:42 -05:00
Mark Backman
f3c2e29fb4
Clean up CambTTSService
2026-01-15 15:59:17 -05:00
Paul Kompfner
ce99924be4
Add CHANGELOG entry describing fix for the missing "bot-llm-text" RTVI event when using realtime (speech-to-speech) services
2026-01-15 15:55:39 -05:00
Paul Kompfner
5de80a60d4
Fix "bot-llm-text" not firing when using Grok Realtime
2026-01-15 15:30:00 -05:00
Paul Kompfner
5753762350
Fix "bot-llm-text" not firing when using OpenAI Realtime
2026-01-15 15:16:08 -05:00
Paul Kompfner
885b318b04
Fix "bot-llm-text" not firing when using Gemini Live
2026-01-15 15:03:45 -05:00
Paul Kompfner
7a22d58cf4
Fix "bot-llm-text" not firing when using AWS Nova Sonic
2026-01-15 14:56:50 -05:00
Mark Backman
c8e4b462c9
Merge pull request #3460 from pipecat-ai/mb/reorder-07-examples
...
Renumber the 07 foundational examples
2026-01-15 14:44:21 -05:00
Mark Backman
30a3f42255
Merge pull request #3349 from eRuaro/feat/camb-tts-integration
...
Add Camb.ai TTS integration with MARS models
2026-01-15 14:43:12 -05:00
Neil Ruaro
26ddb2de2f
minimal uv.lock update for camb-sdk
2026-01-16 03:18:01 +08:00
Neil Ruaro
f60eeaa212
reverted uv.lock, updated readthedocs.yaml, copyright year updates
2026-01-16 02:50:18 +08:00
Neil Ruaro
8cf72b36cb
manually add camb-sdk to uv.lock, exclude camb from docs build
2026-01-16 02:26:38 +08:00
Neil Ruaro
38c3bcef96
exclude camb from docs build
2026-01-16 02:20:26 +08:00
Neil Ruaro
80604ba7b6
remove _update_settings method
2026-01-16 02:00:48 +08:00
Neil Ruaro
256c70c631
use UserTurnStrategies
2026-01-16 01:32:08 +08:00
Glenn Powell
0e3532c529
Allow WebsocketClientTransport to send custom headers
2026-01-15 09:31:48 -08:00
Neil Ruaro
9942fcfeb2
updated per PR reviews
2026-01-16 01:20:17 +08:00
Neil Ruaro
003c24ca6e
Make model parameter explicit in docstring example
2026-01-16 01:18:37 +08:00
Neil Ruaro
ed120d014d
Add model-specific sample rates, transport example, and fix audio buffer alignment
2026-01-16 01:18:37 +08:00
Neil Ruaro
e76a3d04f0
Update Camb TTS to 48kHz sample rate
2026-01-16 01:18:37 +08:00
Neil Ruaro
641d17007f
Clean up Camb TTS service and tests
2026-01-16 01:18:37 +08:00
Neil Ruaro
9293b5f24a
Migrate Camb TTS service from raw HTTP to official SDK
...
- Replace aiohttp with camb SDK (AsyncCambAI client)
- Add support for passing existing SDK client instance
- Simplify API: no longer requires aiohttp_session parameter
- Update example to use simplified initialization
- Rewrite tests to mock SDK client instead of HTTP servers
2026-01-16 01:18:37 +08:00
Neil Ruaro
c1f3cbd1d4
Yield TTSAudioRawFrame directly instead of calling private method
2026-01-16 01:18:37 +08:00
Neil Ruaro
78fa2ab65e
Update default voice ID, fix MARS naming, and clean up example
2026-01-16 01:18:37 +08:00
Neil Ruaro
56da2caeed
Update Camb.ai TTS inference options
2026-01-16 01:18:37 +08:00
Neil Ruaro
a541d65255
Update MARS model names to mars-flash, mars-pro, mars-instruct
...
Rename model identifiers from mars-8-* to the new naming convention:
- mars-8-flash -> mars-flash (default)
- mars-8 -> removed
- mars-8-instruct -> mars-instruct
- Added mars-pro
2026-01-16 01:18:37 +08:00
Neil Ruaro
a3d7e9eafe
Address PR feedback: add --voice-id arg, remove test script
...
- Add --voice-id CLI argument to example (default: 2681)
- Remove test_camb_quick.py from examples/ (tests belong in tests/)
- Update docstring with new usage
2026-01-16 01:18:36 +08:00
Neil Ruaro
54933bea2a
Rename changelog to PR number
2026-01-16 01:18:36 +08:00
Neil Ruaro
fcab9899cc
Add changelog entry for Camb.ai TTS integration
2026-01-16 01:18:36 +08:00
Neil Ruaro
be098e85db
Remove non-working Daily/WebRTC example
...
The Daily transport example had authentication issues. Keeping the
local audio example (07zb-interruptible-camb-local.py) which works.
2026-01-16 01:18:36 +08:00
Neil Ruaro
ed0ff46a87
added local test
2026-01-16 01:18:36 +08:00
Neil Ruaro
7ae0d651d6
added cambai tts integration
2026-01-16 01:18:36 +08:00
Mark Backman
efd4432cfb
Renumber the 07 foundational examples
2026-01-15 10:26:17 -05:00
kompfner
24082b84f2
Merge pull request #3453 from pipecat-ai/pk/consistency-pass-on-user-started-stopped-speaking-frames
...
Do a consistency pass on how we're sending `UserStartedSpeakingFrame`…
2026-01-15 09:24:14 -05:00
Aleix Conchillo Flaqué
dcd5840341
Merge pull request #3455 from pipecat-ai/aleix/reset-user-turn-start-strategies
...
UserTurnController: reset user turn start strategies when turn triggered
2026-01-14 19:28:32 -08:00
Aleix Conchillo Flaqué
9e705ce768
UserTurnController: reset user turn start strategies when turn triggered
2026-01-14 18:20:29 -08:00
Mark Backman
965466cc09
Merge pull request #3454 from pipecat-ai/mb/external-turn-strategies-timeout
...
fix to make on_user_turn_stop_timeout work with ExternalUserTurnStrat…
2026-01-14 20:15:31 -05:00
Mark Backman
f3993f1775
fix to make on_user_turn_stop_timeout work with ExternalUserTurnStrategies
2026-01-14 20:10:56 -05:00
Paul Kompfner
e107902b14
Do a consistency pass on how we're sending UserStartedSpeakingFrames and UserStoppedSpeakingFrames. The codebase is now consistent in broadcasting both types of frames up and downstream.
2026-01-14 18:47:15 -05:00
kompfner
e7b5ff49f4
Merge pull request #3447 from pipecat-ai/pk/add-pr-3420-to-changelog
...
Add PR 3420 to CHANGELOG (it was missing)
2026-01-14 15:33:44 -05:00
Paul Kompfner
e33172c44e
Add PR 3420 to CHANGELOG (it was missing)
2026-01-14 15:33:07 -05:00
Mark Backman
3d858e8aa6
Merge pull request #3444 from pipecat-ai/mb/update-quickstart-0.0.99
...
Update quickstart example for 0.0.99
2026-01-14 10:29:55 -05:00
Mark Backman
eab059c49a
Merge pull request #3446 from pipecat-ai/mb/add-3392-changelog
...
Add PR 3392 to changelog, linting cleanup
2026-01-14 10:28:57 -05:00
Mark Backman
4aaff04fb3
Add PR 3392 to changelog, linting cleanup
2026-01-14 09:43:17 -05:00
Mark Backman
cb364f3cab
Update quickstart example for 0.0.99
2026-01-14 08:59:20 -05:00
Mark Backman
a9bfb090c3
Merge pull request #3287 from ashotbagh/feature/asyncai-multicontext-wss
...
Fix TTFB metric and add multi-context WebSocket support for Async TTS
2026-01-14 07:52:52 -05:00
Ashot
c4ae4025f3
Adjustments of Async TTS for multicontext websocket support
2026-01-14 16:33:30 +04:00
Ashot
15067c678d
adapt Async TTS to updated AudioContextTTSService
2026-01-14 15:45:27 +04:00
Ashot
5ae592f38e
Improve Async TTS interruption handling by using AudioContextTTSService class and add changelog fragments
2026-01-14 15:45:27 +04:00
Ashot
9cdbc56be3
Fix TTFB metric and add multi-context WebSocket support for Async TTS
2026-01-14 15:45:27 +04:00
Aleix Conchillo Flaqué
86ed485711
Merge pull request #3440 from pipecat-ai/changelog-0.0.99
...
Release 0.0.99 - Changelog Update
2026-01-13 17:02:41 -08:00
Aleix Conchillo Flaqué
7e1b4a4e90
update cosmetic changelog updates for 0.0.99
2026-01-13 16:59:46 -08:00
aconchillo
4531d517da
Update changelog for version 0.0.99
2026-01-14 00:49:15 +00:00
Aleix Conchillo Flaqué
6fd5847f84
Merge pull request #3439 from pipecat-ai/aleix/uv-lock-2026-01-13
...
uv.lock: upgrade to latest versions
2026-01-13 16:48:07 -08:00
Aleix Conchillo Flaqué
2015eba9b2
uv.lock: upgrade to latest versions
2026-01-13 16:45:44 -08:00
Mark Backman
84f16ee895
Merge pull request #3438 from pipecat-ai/mb/fix-26a
...
Fix 26a foundational
2026-01-13 19:43:50 -05:00
Aleix Conchillo Flaqué
5b2af03b16
Merge pull request #3437 from pipecat-ai/aleix/update-aggregator-logs
...
LLMContextAggregatorPair: make strategy logs less verbose
2026-01-13 16:39:29 -08:00
Mark Backman
b313395dc3
Fix 26a foundational
2026-01-13 19:31:24 -05:00
Aleix Conchillo Flaqué
0d6bdbee10
LLMContextAggregatorPair: make strategy logs less verbose
2026-01-13 15:11:22 -08:00
Aleix Conchillo Flaqué
248dac3a9d
Merge pull request #3420 from pipecat-ai/pk/fix-gemini-3-parallel-function-calls
...
Fix parallel function calling with Gemini 3.
2026-01-13 14:40:33 -08:00
Paul Kompfner
be49a54856
Fast-exit in the fix for parallel function calling with Gemini 3, if we can determine up-front that there's no work to do
2026-01-13 17:32:20 -05:00
Aleix Conchillo Flaqué
bd9ee0d646
Merge pull request #3434 from pipecat-ai/aleix/context-appregator-pair-tuple
...
context aggregator pair tuple
2026-01-13 14:12:51 -08:00
Mark Backman
442e0e582d
Merge pull request #3431 from pipecat-ai/mb/update-realtime-examples-transcript-handler
...
Update GeminiLiveLLMService to push thought frames, update 26a for new transcript events
2026-01-13 17:10:40 -05:00
kompfner
38194c0cff
Merge pull request #3436 from pipecat-ai/pk/remove-transcript-processor-reference
...
Remove dead import of `TranscriptProcessor` (which is now deprecated)
2026-01-13 17:06:17 -05:00
Paul Kompfner
0ebdaba03c
Remove dead import of TranscriptProcessor (which is now deprecated)
2026-01-13 17:02:57 -05:00
Aleix Conchillo Flaqué
ee82377d68
examples: fix 22d to push some CancelFrame and EndFrame
2026-01-13 14:01:53 -08:00
Aleix Conchillo Flaqué
861588e4a3
examples: update all examples to use the new LLMContextAggregatorPair tuple
2026-01-13 14:01:53 -08:00
Aleix Conchillo Flaqué
1ab3bf2ef6
LLMContextAggregatorPair: instances can now return a tuple
2026-01-13 14:01:53 -08:00
Mark Backman
bb00d223c9
Update 26a to use context aggregator transcription events
2026-01-13 17:01:10 -05:00
Aleix Conchillo Flaqué
86fbfaddd1
Merge pull request #3435 from pipecat-ai/aleix/fix-llm-context-create-audio-message
...
LLMContext: fix create_audio_message
2026-01-13 13:59:28 -08:00
Aleix Conchillo Flaqué
5612bf513b
LLMContext: fix create_audio_message
2026-01-13 13:53:34 -08:00
Mark Backman
87d0dc9e24
Merge pull request #3412 from pipecat-ai/mb/remove-41a-b
...
Remove foundational examples 41a and 41b
2026-01-13 16:45:26 -05:00
Paul Kompfner
30fbcfbf71
Rework fix for parallel function calling with Gemini 3
2026-01-13 16:33:59 -05:00
Mark Backman
5d90f4ea06
Merge pull request #3428 from pipecat-ai/mb/fix-tracing-none-values
...
Fix TTS, realtime LLM services could return unknown for model_name
2026-01-13 15:40:10 -05:00
kompfner
f6d09e1574
Merge pull request #3430 from pipecat-ai/pk/request-image-frame-fixes
...
Fix request_image_frame and usage
2026-01-13 15:36:44 -05:00
Mark Backman
b8e48dee7f
Merge pull request #3433 from pipecat-ai/mb/port-realtime-examples-transcript-events
...
Update examples to use transcription events from context aggregators
2026-01-13 15:36:06 -05:00
Mark Backman
a6ccb9ec69
Merge pull request #3427 from pipecat-ai/mb/add-07j-gladia-vad-example
...
Add 07j Gladia VAD foundational example, add to release evals
2026-01-13 15:35:24 -05:00
Mark Backman
66551ebdf5
Merge pull request #3426 from pipecat-ai/mb/changelog-3404
...
Add changelog fragments for PR 3404
2026-01-13 15:34:58 -05:00
Aleix Conchillo Flaqué
21534f7d83
added changelog file for #3430
2026-01-13 12:21:22 -08:00
Mark Backman
d591f9e108
Remove 28-transcription-processor.py
2026-01-13 15:20:59 -05:00
Mark Backman
aa2589d3be
Update examples to use transcription events from context aggregators
2026-01-13 15:19:47 -05:00
Aleix Conchillo Flaqué
9d6067fa78
examples(foundational): speak "Let me check on that" in 14d examples
2026-01-13 12:11:30 -08:00
Aleix Conchillo Flaqué
027e54425a
examples(foundational): associate image requests to function calls
2026-01-13 12:11:30 -08:00
Aleix Conchillo Flaqué
e268c73c41
LLMAssistantAggregator: cache function call requested images
2026-01-13 12:10:08 -08:00
Aleix Conchillo Flaqué
d3c57e2da0
UserImageRawFrame: don't deprecate request field
2026-01-13 11:56:13 -08:00
Aleix Conchillo Flaqué
02eace5a16
UserImageRequestFrame: don't deprecate function call related fields
2026-01-13 11:55:55 -08:00
Mark Backman
15bc1dd999
Update GeminiLiveLLMService to push Thought frames when thought content is returned
2026-01-13 14:13:00 -05:00
Paul Kompfner
b937956dc8
Fix request_image_frame and usage
2026-01-13 13:23:01 -05:00
Mark Backman
efbc0c8510
Fix TTS, realtime LLM services could return unknown for model_name
2026-01-13 12:12:15 -05:00
Himanshu Gunwant
d0f227189c
fix: openai llm model name is unknown ( #3422 )
2026-01-13 11:55:52 -05:00
Mark Backman
41eef5efc4
Add 07j Gladia VAD foundational example, add to release evals
2026-01-13 11:36:15 -05:00
Mark Backman
f00f9d9f1a
Add changelog fragments for PR 3404
2026-01-13 11:29:17 -05:00
Mark Backman
ae59b3ba36
Merge pull request #3404 from poseneror/feature/gladia-vad-events
...
feat(gladia): add VAD events support
2026-01-13 11:26:56 -05:00
Paul Kompfner
6668712f7b
Add evals for parallel function calling
2026-01-13 11:03:38 -05:00
Paul Kompfner
8812686b17
Fix parallel function calling with Gemini 3.
...
Gemini expects parallel function calls to be passed in as a single multi-part `Content` block. This is important because only one of the function calls in a batch of parallel function calls gets a thought signature—if they're passed in as separate `Content` blocks, there'd be one or more missing thought signatures, which would result in a Gemini error.
2026-01-13 11:03:38 -05:00
kompfner
8b0f0b5bb4
Merge pull request #3425 from pipecat-ai/pk/gemini-3-flash-new-thinking-levels
...
Add Gemini 3 Flash-specific thinking levels
2026-01-13 11:02:53 -05:00
Paul Kompfner
f5e8a04e3b
Bump aiortc dependency, which relaxes the constraint on av, which was pinned to 14.4.0, which no longer has all necessary wheels
2026-01-13 10:50:08 -05:00
Mark Backman
a298ce3b41
Merge pull request #3424 from pipecat-ai/mb/tts-append-trailing-space
...
Add append_trailing_space to TTSService to prevent vocalizing trailin…
2026-01-13 10:42:40 -05:00
Mark Backman
31daa889e8
Add append_trailing_space to TTSService to prevent vocalizing trailing punctuation; update DeepgramTTSService and RimeTTSService to use the arg
2026-01-13 10:38:54 -05:00
Paul Kompfner
76a058178e
Add Gemini 3 Flash-specific thinking levels
2026-01-13 09:50:59 -05:00
poseneror
3304b18ac2
Add should_interrupt + broadcast user events
2026-01-13 14:27:35 +02:00
poseneror
b95a6afe77
feat(gladia): add VAD events support
...
Add support for Gladia's speech_start/speech_end events to emit
UserStartedSpeakingFrame and UserStoppedSpeakingFrame frames.
When enable_vad=True in GladiaInputParams:
- speech_start triggers interruption and pushes UserStartedSpeakingFrame
- speech_end pushes UserStoppedSpeakingFrame
- Tracks speaking state to prevent duplicate events
This allows using Gladia's built-in VAD instead of a separate VAD
in the pipeline.
2026-01-13 14:27:35 +02:00
Mark Backman
f6ed7d7582
Merge pull request #3418 from pipecat-ai/mb/speechmatics-task-cleanup
2026-01-12 19:24:56 -05:00
Mark Backman
cd3290df1c
Small cleanup for task creation in SpeechmaticsSTTService
2026-01-12 16:00:32 -05:00
Mark Backman
2296caf529
Merge pull request #3414 from pipecat-ai/mb/changelog-3410
...
Update changelog for PR 3410.changed.md
2026-01-12 13:43:42 -05:00
Mark Backman
90ded6658d
Merge pull request #3403 from pipecat-ai/mb/inworld-tts-add-keepalive
...
InworldTTSService: Add keepalive task
2026-01-12 13:31:24 -05:00
Mark Backman
7e97fb80a5
Merge pull request #3392 from pipecat-ai/mb/websocket-service-connection-closed-error
...
Add reconnect logic to WebsocketService in the event of ConnectionClo…
2026-01-12 13:11:43 -05:00
Mark Backman
b58471fdb1
Add Exotel and Vonage to Serializers in README services list
2026-01-12 12:24:56 -05:00
Aleix Conchillo Flaqué
46b4f9f29b
Merge pull request #3413 from pipecat-ai/aleix/fix-assistant-thought-aggregation
...
LLMAssistantAggregator: reset aggregation after adding the thought, not before
2026-01-12 09:21:42 -08:00
Aleix Conchillo Flaqué
ec20d72aba
LLMAssistantAggregator: reset aggregation after adding the thought, not before
2026-01-12 09:18:13 -08:00
Mark Backman
5743e2a99b
Update changelog for PR 3410.changed.md
2026-01-12 12:15:40 -05:00
Mark Backman
2f429a2e76
Merge pull request #3410 from Vonage/feat/fastapi-ws-vonage-serializer
...
feat: update FastAPI WebSocket transport and add Vonage serializer
2026-01-12 12:10:57 -05:00
Varun Pratap Singh
3e982f7a4a
refactor: rename audio_packet_bytes to fixed_audio_packet_size
2026-01-12 22:11:39 +05:30
Mark Backman
89484e281d
Remove foundational examples 41a and 41b
2026-01-12 10:11:58 -05:00
Varun Pratap Singh
14a115f372
changelog: add fragments for PR #3410
2026-01-12 18:12:27 +05:30
Varun Pratap Singh
e96595fe59
feat: update FastAPI WebSocket transport and add Vonage serializer
2026-01-12 17:50:38 +05:30
Mark Backman
f58d21862b
WebsocketService: Add _maybe_try_reconnect and use for exception cases
2026-01-11 16:43:37 -05:00
Om Chauhan
38506f51f7
fix(openrouter): handle multiple system messages for Gemini models
2026-01-11 21:19:47 +05:30
Mark Backman
aac24ad2d4
InworldTTSService: Add keepalive task
2026-01-10 11:20:20 -05:00
Aleix Conchillo Flaqué
1df9575e20
Merge pull request #3400 from pipecat-ai/aleix/ensure-bot-speaking-flag-is-set
...
BaseOutputTransport: ensure bot speaking flag is set on time
2026-01-10 07:34:26 -08:00
Aleix Conchillo Flaqué
64609fe80f
BaseOutputTransport: ensure bot speaking flag is set on time
2026-01-09 20:40:25 -08:00
Aleix Conchillo Flaqué
533a54e111
Merge pull request #3399 from pipecat-ai/aleix/groq-switch-orpheus
...
GroqTTSService: switch to canopylabs/orpheus-v1-english
2026-01-09 20:39:56 -08:00
Aleix Conchillo Flaqué
b59c3eb470
GroqTTSService: switch to canopylabs/orpheus-v1-english
2026-01-09 18:14:48 -08:00
Aleix Conchillo Flaqué
0366fc35cb
Merge pull request #3398 from pipecat-ai/aleix/examples-foundational-fix-49c-transport
...
examples(foundational): add missing transport.output() to 49c
2026-01-09 17:39:54 -08:00
Aleix Conchillo Flaqué
d86ff4b1ee
Merge pull request #3397 from pipecat-ai/aleix/add-setup-pipeline-task
...
PipelineTask: add external pipeline task setup files
2026-01-09 17:39:25 -08:00
Aleix Conchillo Flaqué
f8040324e1
Merge pull request #3396 from pipecat-ai/aleix/add-pipeline-task-pipeline-property
...
PipelineTask: add pipeline property
2026-01-09 17:38:51 -08:00
Mark Backman
9c81acb159
Track websocket disconnecting status to improve error handling
2026-01-09 20:24:07 -05:00
Aleix Conchillo Flaqué
65395b1112
examples(foundational): add missing transport.output() to 49c
2026-01-09 16:44:04 -08:00
Aleix Conchillo Flaqué
d2696be03b
PipelineTask: add external pipeline task setup files
2026-01-09 16:42:27 -08:00
Aleix Conchillo Flaqué
2da4d420f9
PipelineTask: add pipeline property
2026-01-09 15:47:02 -08:00
Aleix Conchillo Flaqué
a992f95c02
clarify changelog with #3343 fix
2026-01-09 10:37:16 -08:00
Aleix Conchillo Flaqué
edd8e07df6
update changelog with #3343 fix
2026-01-09 10:31:29 -08:00
Aleix Conchillo Flaqué
c813d43da0
Merge pull request #3343 from omChauhanDev/fix/auto-resolve-function-result
...
fix: keeping the Aggregator and Service states synchronized.
2026-01-09 10:04:20 -08:00
Aleix Conchillo Flaqué
c973445ab7
Merge pull request #3385 from pipecat-ai/aleix/context-aggregator-turn-stop-messages
...
user and assistant aggregator turn events
2026-01-09 09:52:48 -08:00
Aleix Conchillo Flaqué
25f6ba76d6
add start timestamp to user and assistant turn messages
2026-01-09 09:50:21 -08:00
Aleix Conchillo Flaqué
8f47c569f9
examples(foundational): add 28-user-assistant-turns.py
2026-01-09 09:50:21 -08:00
Aleix Conchillo Flaqué
c16801e524
examples(foundational): update 49 series with on_assistant_thought
2026-01-09 09:50:21 -08:00
Aleix Conchillo Flaqué
dafcd0448f
added changelog for new assistant turn events
2026-01-09 09:50:21 -08:00
Aleix Conchillo Flaqué
24a52375c7
tests: added LLMAssistantAggregator unit tests
2026-01-09 09:50:21 -08:00
Aleix Conchillo Flaqué
5f9e95038e
BaseObject: improve logging messages
2026-01-09 09:50:21 -08:00
Aleix Conchillo Flaqué
5cbb21afb2
deprecate TranscriptProcessor and related dataclasses and frames
2026-01-09 09:50:21 -08:00
Aleix Conchillo Flaqué
119fab2996
LLMAssistantAggregator: allow thought aggregation without appending to context
2026-01-09 09:42:41 -08:00
Aleix Conchillo Flaqué
38d354c4ed
LLMAssistantAggregator: add assistant turn and thought events
2026-01-09 09:42:41 -08:00
Aleix Conchillo Flaqué
cdb1074e11
LLMAssistantAggregator: no need to use BotStoppedSpeakingFrame
...
The end of turn is already handle with interruptions or with
LLMFullResponseEndFrame. LLMFullResponseEndFrame should never be blocked,
otherwise the assistant would not work.
2026-01-09 09:42:41 -08:00
Aleix Conchillo Flaqué
4b61fd2d7d
LLMUserAggregator: add user turn stopped message argument
...
It is now possible to get the user aggregation when a `on_user_turn_stopped`
event is emitted.
2026-01-09 09:42:41 -08:00
Aleix Conchillo Flaqué
5a0a5c120b
Merge pull request #3394 from pipecat-ai/aleix/base-smart-turn-update-vad-start-secs
...
smartturn: rename on_vad_start_secs_updated to update_vad_start_secs
2026-01-09 09:40:29 -08:00
Aleix Conchillo Flaqué
d92926ae54
smartturn: rename on_vad_start_secs_updated to update_vad_start_secs
2026-01-09 09:34:15 -08:00
Aleix Conchillo Flaqué
b34af5da24
Merge pull request #3372 from pipecat-ai/aleix/add-user-turn-controller-processor
...
add new UserTurnController and UserTurnProcessor
2026-01-09 09:29:10 -08:00
Aleix Conchillo Flaqué
5da1f86575
scripts: add 53-concurrent-llm-evaluation.py to release evals
2026-01-09 09:26:38 -08:00
Aleix Conchillo Flaqué
b0185e3539
tests: improve LLMUserAggregator tests
2026-01-09 09:21:28 -08:00
Aleix Conchillo Flaqué
7232da6ba1
tests: added unit tests for UserTurnProcessor
2026-01-09 09:21:28 -08:00
Aleix Conchillo Flaqué
9dff75cd44
examples: add 53-concurrent-llm-evaluation.py
2026-01-09 09:21:28 -08:00
Aleix Conchillo Flaqué
6038860be0
tests: added unit tests for UserTurnController
2026-01-09 09:21:28 -08:00
Aleix Conchillo Flaqué
4653de9f03
tests: rename test_bot_turn_start_strategy to test_user_turn_stop_strategy
2026-01-09 09:21:28 -08:00
Aleix Conchillo Flaqué
fef79651ef
turns: add UserTurnProcessor for advanced pipeline user turn management
2026-01-09 09:21:28 -08:00
Aleix Conchillo Flaqué
3d54ca0a7c
LLMUserAggregator: user UserTurnController for user turn management
2026-01-09 09:21:28 -08:00
Aleix Conchillo Flaqué
199986815c
turns: add UserTurnController for user turn management
2026-01-09 09:21:28 -08:00
Filipi da Silva Fuchter
0a3c00f68b
Merge pull request #3391 from pipecat-ai/filipi/krisp_followup_improvements
...
Krisp VIVA follow-up improvements
2026-01-09 10:39:23 -05:00
marcus-daily
3e2467eb71
Fixing ruff formatting
2026-01-09 15:07:13 +00:00
marcus-daily
c4cc476c3d
Updating changelog
2026-01-09 15:07:13 +00:00
marcus-daily
cc6ff1ac54
Reverting quickstart to match main
2026-01-09 15:07:13 +00:00
marcus-daily
b075502c4c
Addressing code review comments
2026-01-09 15:07:13 +00:00
marcus-daily
35a99f92ab
Take into account VAD start_secs when passing audio data to Smart Turn, and add an extra 500ms of pre-speech audio for good measure
2026-01-09 15:07:13 +00:00
Mark Backman
4fe0836cf9
Add reconnect logic to WebsocketService in the event of ConnectionClosedError
2026-01-09 09:03:01 -05:00
filipi87
8b7cc65ae6
Mentioning the Krisp Viva improvements in the changelog.
2026-01-09 10:43:01 -03:00
filipi87
4d495ba74f
Fixing ruff format.
2026-01-09 10:32:36 -03:00
filipi87
de5de0b162
Fixed KrispVivaTurn to properly release the Krisp SDK.
2026-01-09 10:31:17 -03:00
filipi87
311da30802
Updating the Krisp Viva example to use Krisp turn model.
2026-01-09 10:19:13 -03:00
Garegin Harutyunyan
16819a5caa
Krisp VIVA SDK Filter and Turn support. ( #3261 )
...
* Krisp VIVA SDK Filter and Turn support.
* Reverted the krisp_filter.py as it's already deprectaed.
* enabled test with krisp_audio mock.
* More review comment fixes.
reverted the state logic in viva filter to be similar to the existing impl on main branch.
Fixed tests, ruff, etc.
* More review comments for Turn detection.
removed integration tests.
* Moved the SDK init/deinit into start/stop
2026-01-09 08:15:08 -05:00
Mark Backman
72a44c2fcd
Merge pull request #3386 from pipecat-ai/mb/deepgram-deprecate-vad-events
...
Deprecate support for vad_events in DeepgramSTTService
2026-01-09 07:56:03 -05:00
Mark Backman
7783b20b91
Merge pull request #3390 from dhruvladia-sarvam/update/sarvam-plugins
2026-01-09 07:11:13 -05:00
dhruvladia-sarvam
962ccbc0d7
fix
2026-01-09 14:26:28 +05:30
Mark Backman
4d61c5d7b2
Deprecate support for vad_events in DeepgramSTTService
2026-01-08 20:32:30 -05:00
Mark Backman
7ca4597ade
Merge pull request #3379 from lukepayyapilli/fix/fastapi-websocket-json-text-handling
...
Fix FastAPIWebsocketTransport to handle both binary and text messages
2026-01-08 17:26:35 -05:00
Luke Payyapilli
f1a22728ab
Add websocket extra to coverage workflow
2026-01-08 17:13:31 -05:00
Luke Payyapilli
ca88fc849f
Add websocket extra to CI for FastAPI test coverage
2026-01-08 17:09:27 -05:00
Luke Payyapilli
ccd795445f
Fix protobuf serializer test to compare attributes instead of frame objects
2026-01-08 17:00:40 -05:00
Luke Payyapilli
1874269a48
Remove FrameSerializerType enum and type property from serializers
2026-01-08 16:54:23 -05:00
Mark Backman
8b20373a8e
Merge pull request #3380 from pipecat-ai/mb/changelog-3366
...
Add changelog fragment for PR 3366
2026-01-08 14:49:47 -05:00
Aleix Conchillo Flaqué
15dcb77a0c
Merge pull request #3364 from pipecat-ai/rajneesh/add-daily-sip-provider-option
...
Add support for specifying sip provider.
2026-01-08 11:47:09 -08:00
Mark Backman
5d2fac9cd7
Add changelog fragment for PR 3366
2026-01-08 14:43:23 -05:00
Mark Backman
682b253760
Merge pull request #3366 from lukepayyapilli/fix/cartesia-allow-none-language
...
Allow language=None in CartesiaTTSService for auto-detection
2026-01-08 14:42:09 -05:00
Luke Payyapilli
f440de82e2
Handle None language in _process_word_timestamps_for_language
2026-01-08 13:59:21 -05:00
Mark Backman
5e0e6822c7
Merge pull request #3360 from pipecat-ai/mb/openai-realtime-send-image
...
Add video input (e.g. image input) support for OpenAI Realtime
2026-01-08 13:26:35 -05:00
Mark Backman
2aadac7a4d
Update OpenAIRealtime image to video to align with GeminiLive
2026-01-08 13:23:08 -05:00
Filipi da Silva Fuchter
1098394486
Merge pull request #3374 from pipecat-ai/filipi/external_turn_controllers_interruptions
...
External turn controllers improvements
2026-01-08 13:05:41 -05:00
Mark Backman
b90a34228f
Update 19c to remove pausing audio and input
2026-01-08 13:00:45 -05:00
Mark Backman
8bf8ebd34b
Remove start_audio_paused from OpenAI Realtime demos, and others
2026-01-08 13:00:45 -05:00
Mark Backman
673d88417c
Change Gemini Live and OpenAI Realtime logging to trace when sending a video frame
2026-01-08 13:00:45 -05:00
Mark Backman
3a7b489208
Add foundational 19c and add to evals
2026-01-08 13:00:45 -05:00
Mark Backman
7ae9eebc34
Add image input support for OpenAI Realtime
2026-01-08 13:00:44 -05:00
Mark Backman
8f83ba5878
Merge pull request #3376 from dhruvladia-sarvam/update/sarvam-plugins
...
headers update
2026-01-08 12:57:32 -05:00
filipi87
b8af3fa214
Improving should_interrupt docs for Speechmatics.
2026-01-08 14:53:29 -03:00
dhruvladia-sarvam
5ddec4f596
fix
2026-01-08 23:07:40 +05:30
dhruvladia-sarvam
8f4b4f4941
fix
2026-01-08 23:04:43 +05:30
dhruvladia-sarvam
953349f262
fix
2026-01-08 22:53:59 +05:30
Luke Payyapilli
b52ae0e56b
Fix FastAPIWebsocketTransport to handle both binary and text messages
2026-01-08 11:25:18 -05:00
dhruvladia-sarvam
893b448534
headers update
2026-01-08 21:09:41 +05:30
Mark Backman
973769b8bc
Merge pull request #3370 from pipecat-ai/mb/fix-azure-tts
...
AzureTTSService cleanup
2026-01-08 09:34:02 -05:00
filipi87
c8fa9d34e1
Adding a changelog entry for the new should_interrupt property.
2026-01-08 10:58:07 -03:00
filipi87
3069deb92f
Allows defining whether Speechmatics should send an interruption when the user’s turn has started.
2026-01-08 10:50:33 -03:00
filipi87
68c9c01747
Allows defining whether Flux should send an interruption when the user’s turn has started.
2026-01-08 10:44:53 -03:00
filipi87
5e8f0baa12
Allows defining whether Deepgram should send an interruption when the user’s turn has started.
2026-01-08 10:36:52 -03:00
Mark Backman
8d1286cc00
Merge pull request #3371 from speechmatics/fix/voice-version-bump
2026-01-08 07:21:43 -05:00
Aleix Conchillo Flaqué
bda4dd339a
Merge pull request #3373 from pipecat-ai/aleix/update-copyright-notices-2026
...
update examples and tests copyright and use a proper dash in 2024-2026
2026-01-07 20:36:40 -08:00
rajneeshksoni
f2e3034d24
Add support for specifying sip provider.
...
optional "provider" field in the RoomSipParams
2026-01-08 09:05:56 +05:30
Aleix Conchillo Flaqué
2626154a64
update examples and tests copyright and use a proper dash in 2024-2026
2026-01-07 19:32:22 -08:00
Sam Sykes
b770b2a419
Changelog
2026-01-07 16:56:56 -08:00
Sam Sykes
158c34b0f9
version bump
2026-01-07 16:54:53 -08:00
Mark Backman
d507c88d3e
Merge pull request #3369 from pipecat-ai/mb/copyright-2026
...
Update copyright date range to 2024-2026
2026-01-07 17:07:05 -05:00
Mark Backman
98f70b775f
Update copyright date range to 2024-2026
2026-01-07 16:58:13 -05:00
Mark Backman
54f4b824e4
Merge pull request #3356 from pipecat-ai/mb/gemini-live-user-transcript-timeout
...
Add timeout for handling user transcript messages
2026-01-07 16:47:23 -05:00
Mark Backman
2aa5307f0a
Add _push_user_transcription to unify the logic to push user transcripts from a single utility function
2026-01-07 16:43:48 -05:00
Mark Backman
6c10d6ef8a
Merge pull request #3367 from pipecat-ai/marcus/smart-turn-v3.2
...
Updated Smart Turn model weights to v3.2
2026-01-07 16:37:53 -05:00
Mark Backman
89b36f2b25
AzureTTSService: Restore metrics generation
2026-01-07 16:33:52 -05:00
Mark Backman
79a6adbcf3
AzureTTSService: Handle first chunk only for timestamps and TTFB metrics
2026-01-07 16:15:01 -05:00
Mark Backman
95f00a3c4b
AzureTTSService: Align error handling with Pipecat norms
2026-01-07 15:45:30 -05:00
Mark Backman
3f8373f76f
AzureTTSService: prevent word timestamp carryover on interruption
2026-01-07 15:39:37 -05:00
Mark Backman
23a9d3f4d7
Merge pull request #3334 from obata-kotobasamurai/fix/azure-tts-word-timestamp
...
Add word-level timestamp support to Azure TTS with race condition fix
2026-01-07 14:48:02 -05:00
Mark Backman
333279f45a
Merge pull request #3328 from speechmatics/fix/speectmatics-vad
...
Update to SpeechmaticsSTTService for `0.0.99`
2026-01-07 14:42:21 -05:00
yukiobata1
add5f51201
updated azure tts.py file
2026-01-08 03:14:37 +09:00
marcus-daily
d1bedef5b3
Updated Smart Turn model weights to v3.2
2026-01-07 17:23:11 +00:00
Mark Backman
54cf0116a8
Merge pull request #3363 from pipecat-ai/mb/update-audo-context-inheritance
...
Update AudioContextTTSService to inherit from WebsocketTTSService
2026-01-07 12:08:47 -05:00
Luke Payyapilli
6b252fb46e
Allow language=None in CartesiaTTSService for auto-detection
2026-01-07 11:50:21 -05:00
Sam Sykes
3e00a16f0f
Remove unused import and correction to docs.
2026-01-07 07:45:26 -08:00
Sam Sykes
ecfd93544a
Correction to UserStartedSpeakingFrame timing.
2026-01-07 07:43:47 -08:00
Sam Sykes
3ec89e49bf
Added changelog for split_sentences and code tidy for end of turn handling.
2026-01-07 07:41:49 -08:00
Mark Backman
8762506e9f
Update AudioContextTTSService to inherit from WebsocketTTSService
2026-01-07 09:10:27 -05:00
yukiobata1
7204bf9914
added changegelog
2026-01-07 13:32:31 +09:00
yukiobata1
f62c262f23
Call start_word_timestamps() when the first audio chunk arrives
2026-01-07 13:10:41 +09:00
Mark Backman
10aa784809
Merge pull request #3351 from okue/fix/stt-model-name-attribute
...
Fix STT model name attribute retrieval in tracing decorator
2026-01-06 16:10:56 -05:00
Filipi da Silva Fuchter
904f5dc183
Merge pull request #3338 from omChauhanDev/fix/smallwebrtc-mute-timeout-spam
...
fix(smallwebrtc): suppress timeout warnings when tracks are disabled
2026-01-06 09:07:52 -05:00
Mark Backman
c61a5e7173
Merge pull request #3346 from pipecat-ai/mb/cartesia-pronunciation-dict
...
Cartesia TTS: Add support for pronunciation_dict_id
2026-01-06 08:52:09 -05:00
Filipi da Silva Fuchter
81b28beef5
Merge pull request #3357 from pipecat-ai/filipi/live_avatar
...
Added support for using the HeyGen LiveAvatar API with the HeyGenTransport
2026-01-06 08:22:39 -05:00
filipi87
0d34356678
Adding a changelog entry for the HeyGen LiveAvatar API change.
2026-01-06 10:19:19 -03:00
filipi87
5412840a93
Added support for using the HeyGen LiveAvatar API with the HeyGenTransport.
2026-01-06 10:16:12 -03:00
yukiobata1
137bbb3d2c
updated tts.py to match mark's version
2026-01-06 21:16:13 +09:00
Mark Backman
5a40054ac2
Merge pull request #3216 from mayurdd/patch-1
...
Adding include_language_detection param to Elevenlabs Realtime STT
2026-01-05 17:01:02 -05:00
Mark Backman
be621fbc5c
Add timeout for handling user transcript messages
2026-01-05 16:58:14 -05:00
Mark Backman
9ab4836601
Merge pull request #3323 from pipecat-ai/mb/changelog-3322
...
Add changelog fragment for PR #3322
2026-01-05 16:55:52 -05:00
mayurdd
4671102833
Addressing the comments
2026-01-05 13:35:37 -08:00
Mayur Sirwani
67401a275b
Adding include_language_detection to Elevenlabs Realtime STT
...
Adding a param to the config while connecting to the session
2026-01-05 13:27:27 -08:00
Mark Backman
c422588071
Merge pull request #3345 from pipecat-ai/mb/avoid-tts-dot
...
Add trailing space to DeepgramTTSService text generation
2026-01-05 15:56:14 -05:00
kompfner
fb12fec899
Merge pull request #3354 from pipecat-ai/pk/fix-aws-nova-sonic-example-for-nova-2-sonic
...
Fix the 20e example to use the proper conversation-start pattern for …
2026-01-05 11:17:57 -05:00
Paul Kompfner
c53c49558f
Fix the 20e example to use the proper conversation-start pattern for the Nova 2 Sonic model
2026-01-05 10:56:08 -05:00
okue
1a26a2daa4
Fix STT model name attribute retrieval in tracing decorator
...
Changed getattr with default value to use 'or' operator for fallback.
This ensures proper model name retrieval when model_name attribute exists but is None or empty.
2026-01-05 17:20:48 +09:00
Mark Backman
d8be1282b5
Cartesia TTS: Add support for pronunciation_dict_id
2026-01-04 09:30:04 -05:00
Mark Backman
91bc5236b5
Add trailing space to DeepgramTTSService text generation
2026-01-04 08:53:48 -05:00
Om Chauhan
1ceb01665f
fix: treat language as first-class STT setting
2026-01-04 11:04:30 +05:30
Om Chauhan
b278957111
fix: broadcast FunctionCallResultFrame, on implicit return
2026-01-03 19:52:25 +05:30
Mark Backman
1c80c739d6
Merge pull request #3335 from pipecat-ai/mb/update-evals-07-variants
...
Add 07 example variants to release evals
2026-01-02 15:32:12 -05:00
Om Chauhan
700a94222b
fix(smallwebrtc): suppress timeout warnings when tracks are disabled
2026-01-01 22:00:08 +05:30
Sam Sykes
d5d2156689
Updated changelog.
2025-12-31 19:07:11 +00:00
Sam Sykes
8203ad08a8
Updated to have default as FIXED for Pipecat VAD.
2025-12-31 19:05:29 +00:00
Mark Backman
31907b90f0
Add 07 example variants to release evals
2025-12-31 09:11:00 -05:00
Mark Backman
7b595f10ce
Merge pull request #3329 from omChauhanDev/deepgram-tts-validation
...
added encoding validation in DeepgramTTSService
2025-12-31 08:20:40 -05:00
yukiobata1
4f93d331b7
Added await to self.start_word_timestamps()
2025-12-31 19:19:21 +09:00
yukiobata1
32c6dccebe
Add word-level timestamp support to Azure TTS with cumulative PTS fix
...
This commit adds word boundary support to AzureTTSService and fixes
the race condition that causes scrambled TTS output across multiple
sentences.
## Features Added
- Change AzureTTSService to inherit from WordTTSService
- Subscribe to Azure SDK's synthesis_word_boundary event
- Emit word-level text with timing information via _words_queue
- Add synthesis lock for sequential sentence processing
## Race Condition Fix
Previously, each sentence's word boundary timestamps reset to 0,
causing downstream components to interleave words when reordering
frames by PTS. This resulted in scrambled output like:
'Hello ! I What am questions AI have assistant...'
The fix adds cumulative audio offset tracking to ensure monotonically
increasing PTS across all sentences:
Sentence 1: pts = 0.1s, 0.5s, 0.8s (cumulative at end: 0.8s)
Sentence 2: pts = 0.9s, 1.2s, 1.5s (0.8s + relative offset)
## Key Changes
- _cumulative_audio_offset: tracks total audio duration
- _handle_word_boundary: adds cumulative offset to timestamps
- _handle_completed: accumulates audio duration for next sentence
- flush_audio: resets cumulative offset at end of LLM response
- _handle_interruption: resets state on user interruption
- run_tts: uses synthesis lock for sequential processing
Fixes #2918
🤖 Generated with [Claude Code](https://claude.com/claude-code )
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2025-12-31 18:49:48 +09:00
Aleix Conchillo Flaqué
cbdc2b7d2d
Merge pull request #3330 from pipecat-ai/aleix/update-turn-start-strategies-deprecations
...
update turn start strategies deprecations
2025-12-30 21:04:47 -08:00
Aleix Conchillo Flaqué
66a9dc70c7
LLMUserAggregator: fix turn strategies renaming
2025-12-30 20:59:48 -08:00
Aleix Conchillo Flaqué
846ca500d3
turns: update old turn_start_strategies deprecations
2025-12-30 19:50:10 -08:00
Om Chauhan
bd6afd445d
added changelog
2025-12-31 09:18:18 +05:30
Om Chauhan
0663bbc2fb
added encoding validation in DeepgramTTSService
2025-12-31 08:33:17 +05:30
Sam Sykes
8e7a951af8
updated changelog
2025-12-31 01:36:58 +00:00
Sam Sykes
ba1aeb8f7f
Changelog
2025-12-31 01:31:46 +00:00
Sam Sykes
f7c74cfa80
Updated VAD
2025-12-31 01:28:31 +00:00
Mark Backman
2e700c8576
Merge pull request #3324 from pipecat-ai/mb/bump-small-webrtc-prebuilt-version
...
Bump small-webrtc-prebuilt verison to 2.0.4, update uv.lock
2025-12-30 20:10:11 -05:00
Aleix Conchillo Flaqué
fd2efb3b3a
Merge pull request #3325 from pipecat-ai/aleix/rename-bot-turn-start-to-user-turn-stop
...
turns: rename bot turn start to user turn stop strategies
2025-12-30 14:36:02 -08:00
Aleix Conchillo Flaqué
eb5a797b12
turns: rename bot turn start to user turn stop strategies
2025-12-30 14:33:58 -08:00
Mark Backman
f4626a4fc4
Bump small-webrtc-prebuilt verison to 2.0.4, update uv.lock
2025-12-30 14:19:20 -05:00
Aleix Conchillo Flaqué
fb9a772e33
Merge pull request #3319 from pipecat-ai/aleix/openaillmcontext-backwards-compatibility
...
BaseInputTransport: fix OpenAILLMContext backwards compatibility
2025-12-30 09:35:43 -08:00
Aleix Conchillo Flaqué
4630e76942
ExternalUserTurnStartStrategy: disable interruptions
2025-12-30 09:32:31 -08:00
Aleix Conchillo Flaqué
4dba9ea329
BaseInputTransport: fix OpenAILLMContext backwards compatibility
2025-12-30 09:32:31 -08:00
Mark Backman
233bc23bf9
Merge pull request #3320 from joshwhiton/fix-changelog-numba-pin
...
Fix numba pin wording in changelog
2025-12-30 08:50:06 -05:00
Mark Backman
e0b40a330f
Add changelog fragment for PR #3322
2025-12-30 08:40:28 -05:00
Mark Backman
9c6d0f1be1
Merge pull request #3322 from gui217/fix/rnnoise_filter_handle_empty_audio
...
Fix/rnnoise filter handle empty audio
2025-12-30 08:39:42 -05:00
gui217
32c3298eff
Apply ruff formatting to test file
2025-12-30 13:39:36 +02:00
gui217
ec5fb392c4
Clean up test comments after rnnoise filter fix
2025-12-30 13:35:13 +02:00
gui217
bad8f8aa51
Fix rnnoise filter to handle empty audio
2025-12-30 13:32:36 +02:00
joshwhiton
6a7b6bcded
Fix numba pin wording in changelog
2025-12-30 11:26:22 +07:00
Aleix Conchillo Flaqué
00548769cb
Merge pull request #3318 from pipecat-ai/aleix/llm-user-aggregator-allow-interruptions
...
LLMUserAggregator: also read deprecated allow_interruptions
2025-12-29 18:11:57 -08:00
Aleix Conchillo Flaqué
0a0ab51cc7
LLMUserAggregator: also read deprecated allow_interruptions
2025-12-29 17:57:18 -08:00
Mark Backman
8339c2c2c7
Merge pull request #3317 from pipecat-ai/mb/add-changelog-other
...
Add 'other' changelog category
2025-12-29 20:46:18 -05:00
Aleix Conchillo Flaqué
ad4c22cf44
Merge pull request #3316 from pipecat-ai/aleix/llm-user-aggreagtor-enable-interruptions
...
turns(user): add support for enabling/disabling interruptions
2025-12-29 17:45:56 -08:00
Mark Backman
8ac6421988
Add 'other' changelog category
2025-12-29 20:43:24 -05:00
Aleix Conchillo Flaqué
9fe99ed880
add and update changelog entries
2025-12-29 17:35:10 -08:00
Aleix Conchillo Flaqué
97ab0d4f53
examples: added 52-live-translation without interruptions
2025-12-29 17:30:06 -08:00
Aleix Conchillo Flaqué
ffbbb1b3f5
turns(user): add support for enabling/disabling interruptions
2025-12-29 17:00:03 -08:00
Aleix Conchillo Flaqué
e22a6c9e4d
Merge pull request #3305 from omChauhanDev/fix_unregister_function
...
fix: missing key access by adding existence check
2025-12-29 14:52:41 -08:00
Aleix Conchillo Flaqué
09e79149ea
Merge pull request #3310 from omChauhanDev/fix-task-manager
...
fix: preserve asyncio.Task return value in create_task
2025-12-29 14:43:24 -08:00
Aleix Conchillo Flaqué
c799d63f8c
Merge pull request #3308 from pipecat-ai/aleix/external-turn-start-strategies
...
turns: add external user and bot turn start strategies
2025-12-29 14:42:38 -08:00
Aleix Conchillo Flaqué
bd9a316d7a
transports: don't use interruptions_allowed to avoid deprecation warning
2025-12-29 14:40:00 -08:00
Aleix Conchillo Flaqué
c8f47b4b22
turns: add UserTurnStartedParams and BotTurnStartedParams
2025-12-29 14:32:08 -08:00
Aleix Conchillo Flaqué
cf46431d92
update changelog file
2025-12-29 10:30:41 -08:00
Mark Backman
c28ed2206c
DeepgramSTTService pushes user started/stopped speaking and interruption frames
2025-12-29 10:17:35 -08:00
Mark Backman
30e6a33930
Update VoicemailDetector to use ExternalTurnStartStrategies
2025-12-29 10:17:35 -08:00
Aleix Conchillo Flaqué
46db8e58d6
LLMUserAggregator: fix backwards compatibility with ExternalTurnStartStrategies
2025-12-29 10:17:35 -08:00
Aleix Conchillo Flaqué
e757b4bf6f
tests: added external user and bot turn start strategies
2025-12-29 10:17:35 -08:00
Aleix Conchillo Flaqué
c821e9f8fd
turns: add external user and bot turn start strategies
...
External strategies are strategies where the logic for user turn start and turn
end come from a different processors (e.g. an STT).
2025-12-29 10:17:35 -08:00
Mark Backman
01ce06c756
Merge pull request #3288 from pipecat-ai/mb/inworld-cleanup
...
Inworld TTS service clean up
2025-12-29 13:07:20 -05:00
Mark Backman
4bc490c843
Merge pull request #3289 from pipecat-ai/mb/audio-context-tts-service-base
...
Add AudioContextTTSService base class, update AudioContextWordTTSServ…
2025-12-29 13:05:06 -05:00
Mark Backman
345885fe7d
Merge pull request #3271 from pipecat-ai/mb/changelog-3268
...
Update fragment name for 3268
2025-12-29 13:04:03 -05:00
Mark Backman
6475077fc8
Merge pull request #3313 from pipecat-ai/mb/ultravox-s2s-readme
...
Update Ultravox README link
2025-12-29 13:03:39 -05:00
Mark Backman
d646ca594b
Update Ultravox README link
2025-12-29 11:43:28 -05:00
Mark Backman
7c0d897aa3
Merge pull request #3300 from omChauhanDev/nvidia-expose-use_ssl-param
...
exposed use_ssl param in nvidia services
2025-12-29 09:18:26 -05:00
Aleix Conchillo Flaqué
0e8e3afc85
Merge pull request #3307 from pipecat-ai/aleix/simplify-turns-package-imports
...
turns: simplify imports and don't require full strategy module path
2025-12-28 18:51:23 -08:00
Aleix Conchillo Flaqué
db85043841
Merge pull request #3297 from pipecat-ai/aleix/deprecate-allow-interruptions
...
deprecate allow interruptions
2025-12-28 18:50:15 -08:00
Om Chauhan
a181e01310
fixed: create_task to return coroutine result
2025-12-29 07:46:15 +05:30
Aleix Conchillo Flaqué
5496aa722f
turns: simplify imports and don't require full strategy module path
2025-12-28 16:20:15 -08:00
Aleix Conchillo Flaqué
053f59ed6e
FrameProcessor: deprecated interruptions_allowed
2025-12-28 08:27:02 -08:00
Aleix Conchillo Flaqué
5b93fb9609
PipelineTask: deprecate allow_interruptions parameter
2025-12-28 08:27:02 -08:00
Aleix Conchillo Flaqué
192ede6e34
Merge pull request #3298 from pipecat-ai/aleix/push-user-started-speaking-first
...
push UserStartedSpeakingFrame before interruption
2025-12-28 08:24:50 -08:00
Aleix Conchillo Flaqué
956f004424
Merge pull request #3296 from pipecat-ai/aleix/move-turn-start-strategies-to-aggregator
...
LLMUserAggregator: move turn_start_strategies from PipelineTask
2025-12-28 08:19:23 -08:00
Aleix Conchillo Flaqué
8b861d9143
LLMUserAggregator: move turn_start_strategies from PipelineTask
2025-12-28 08:16:34 -08:00
Aleix Conchillo Flaqué
e5bd55d1d5
Merge pull request #3292 from pipecat-ai/aleix/initial-user-mute-strategies
...
initial user mute strategies
2025-12-28 08:14:48 -08:00
Aleix Conchillo Flaqué
094d9fd7d7
turns(mute): make strategies available in __init__
2025-12-28 08:12:44 -08:00
Aleix Conchillo Flaqué
c7589663b5
deprecate STTMuteFilter in favor of LLMUSerAggregator user mute strategies
2025-12-28 08:12:44 -08:00
Om Chauhan
0f144f48cb
fix: missing key access by adding existence check
2025-12-28 10:28:37 +05:30
Aleix Conchillo Flaqué
a962c4eeba
STTMuteFilter: use FunctionCallsStartedFrame and support multiple function calls
2025-12-27 13:52:30 -08:00
Aleix Conchillo Flaqué
43fc26cf0e
tests: add user mute strategies tests to user aggregator
2025-12-27 13:49:31 -08:00
Aleix Conchillo Flaqué
53b450c1d1
added changelog entry for user mute strategies
2025-12-27 13:49:31 -08:00
Aleix Conchillo Flaqué
0efa36a04e
examples(foundational): added 24-user-mute-strategy.py example
2025-12-27 13:49:31 -08:00
Om Chauhan
edc7db22b6
renamed changelog
2025-12-26 22:21:24 +05:30
Om Chauhan
2c2317de5d
added changelog
2025-12-25 20:23:27 +05:30
Om Chauhan
604384b3ce
exposed use_ssl param
2025-12-25 20:09:42 +05:30
Aleix Conchillo Flaqué
260b7e7959
push UserStartedSpeakingFrame before interruption
2025-12-24 15:33:44 -08:00
Aleix Conchillo Flaqué
0abaae2f07
LLMUserAggregator: no need to reset strategies
...
Turn start strategies are already reset when triggered, so there's no need to
reset them again.
2025-12-24 11:08:43 -08:00
Aleix Conchillo Flaqué
30922d365f
minor turn start strategies cleanup
2025-12-24 11:08:43 -08:00
Aleix Conchillo Flaqué
c33c8d2195
LLMUserAggregator: add support for user mute strategies
2025-12-24 11:08:43 -08:00
Aleix Conchillo Flaqué
5a4236bc71
tests: add user mute strategy tests
2025-12-24 11:08:43 -08:00
Aleix Conchillo Flaqué
1d70275574
initial user mute strategies
2025-12-24 11:08:43 -08:00
Aleix Conchillo Flaqué
ee35ea0966
Merge pull request #3291 from pipecat-ai/aleix/llm-user-aggregator-timeout
...
LLMUserAggregator bot turn start strategies timeout fallback
2025-12-23 18:34:57 -08:00
Aleix Conchillo Flaqué
ffb5895404
tests: add initial tests for universal LLMUserAggregator
2025-12-23 15:51:06 -08:00
Aleix Conchillo Flaqué
1f0357ae5e
LLMUserAggregator: add bot turn start strategies timeout fallback
2025-12-23 15:42:57 -08:00
Mark Backman
44a698cbcc
Add AudioContextTTSService base class, update AudioContextWordTTSService inheritance
2025-12-23 10:36:17 -05:00
Mark Backman
74ab68cb58
Add changelog fragment
2025-12-23 10:15:50 -05:00
Mark Backman
5038ebf205
Clean up _receive_messages to use WebsocketService class
2025-12-23 09:44:21 -05:00
Mark Backman
1da215f576
Inworld TTS service clean up
2025-12-23 09:24:29 -05:00
Aleix Conchillo Flaqué
40493e8ce8
Merge pull request #3286 from pipecat-ai/aleix/improve-turn-analyzer-bot-turn-start-strategy
...
improve turn analyzer bot turn start strategy
2025-12-22 21:46:48 -08:00
Aleix Conchillo Flaqué
4017bfa769
LLMUserAggregator: improve turn_analyzer warning
2025-12-22 21:44:49 -08:00
Aleix Conchillo Flaqué
480a9d092c
TurnAnalyzerBotTurnStartStrategy: make sure to use turn analyzer state
2025-12-22 16:29:48 -08:00
Aleix Conchillo Flaqué
b5fe1c9cd8
fix old interruption/speaking strategies docstrings
2025-12-22 16:19:25 -08:00
Mark Backman
49b53d72a9
Merge pull request #3276 from pipecat-ai/mb/grok-realtime-cleanup
...
GrokRealtimeLLMService cleanup
2025-12-22 18:13:23 -05:00
Aleix Conchillo Flaqué
ae9ee33af9
Merge pull request #3284 from pipecat-ai/aleix/min-words-bot-not-speaking
...
MinWordsUserTurnStartStrategy: single word interrupt if bot not speaking
2025-12-22 15:07:36 -08:00
Mark Backman
01466c19fc
Merge pull request #3285 from pipecat-ai/mb/revert-changes-quickstat
...
Revert turn strategies changes to quickstart
2025-12-22 18:07:30 -05:00
Mark Backman
93689827e9
Revert turn strategies changes to quickstart
2025-12-22 18:05:05 -05:00
Aleix Conchillo Flaqué
a0d5ee3873
MinWordsUserTurnStartStrategy: single word interrupt if bot not speaking
2025-12-22 14:32:21 -08:00
Mark Backman
08a9b434c1
Merge pull request #3277 from pipecat-ai/mb/fix-deprecation-warning-LLMContextAssistantTimestampFrame
...
fix: Separate LLMContextAssistantTimestampFrame from OpenAILLMContext…
2025-12-22 13:51:26 -05:00
Mark Backman
2910b683a4
Fix STT services that rely on VAD stop speaking status to finalize the transcript ( #3283 )
...
Updates to AssemblyAISTTService, CartesiaSTTService, DeepgramSageMakerSTTService, DeepgramSTTService to use VADUser*SpeakingFrame
2025-12-22 12:54:06 -05:00
Mark Backman
0958c658db
Merge pull request #3279 from pipecat-ai/mb/fix-11labs-realtime-stt-vad-speaking
...
fix: Update ElevenLabsRealtimeSTTService to use VADUser speaking frames
2025-12-22 12:11:18 -05:00
Mark Backman
00bb08bacc
fix: Update ElevenLabsRealtimeSTTService to use VADUser speaking frames
2025-12-21 15:57:42 -05:00
Mark Backman
65f23adf4a
fix: Separate LLMContextAssistantTimestampFrame from OpenAILLMContextAssistantTimestampFrame
2025-12-21 09:06:35 -05:00
Mark Backman
5ad8e5436d
Add Grok Voice Agent to README services list
2025-12-20 08:11:41 -05:00
Mark Backman
845b4ad20e
Add 51 foundational to evals
2025-12-20 08:07:25 -05:00
Mark Backman
32c4f914c4
Add event handling and class for response.function_call_arguments.delta
2025-12-20 08:06:39 -05:00
Mark Backman
348fa5a719
Improve SessionProperties initialization: remove voice from args, set default for TurnDetection
2025-12-20 08:02:48 -05:00
Mark Backman
0576783c5e
Improve sample_rate handling in GrokRealtimeLLMService
2025-12-20 07:46:31 -05:00
Mrunmay Chichkhede
d7d979dde1
feat: Add GrokRealtimeLLMService for xAI Grok Voice Agent API ( #3267 )
2025-12-20 07:04:12 -05:00
Sam Sykes
76bae6e699
Update SpeechmaticsSTTService to use the python voice SDK
2025-12-19 19:59:18 -05:00
Mark Backman
f31416c5e4
Update fragment name for 3268
2025-12-19 17:55:10 -05:00
Aleix Conchillo Flaqué
5c779abad2
Merge pull request #3045 from pipecat-ai/aleix/redesign-interruption-strategies
...
introducing user and bot turn start strategies
2025-12-19 14:51:33 -08:00
Aleix Conchillo Flaqué
ec7a7ed048
add RNNoiseFilter to changelog and update pyrnnoise to 0.4.1
2025-12-19 14:48:06 -08:00
Aleix Conchillo Flaqué
bf791527dc
update CHANGELOG for new user/bot turn start strategies
2025-12-19 14:48:06 -08:00
Aleix Conchillo Flaqué
5816f960cc
LLMUserAggregator: add on_user_turn_started/on_bot_turn_started events
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
9bf6668b52
LLMUserAggregator: show error if using turn analyzer in transport
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
4a32aa5266
TurnAnalyzerBotTurnStartStrategy: don't use text on interim transcriptions
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
c9048d3a0f
LLMUserAggregator: prevent consecutive user/bot turn starts
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
9e56d1ac65
TurnStartStrategies: set user and bot strategies defaults if None
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
d22e1f18bb
examples: update with new user and bot turn start strategies
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
83263a30af
llm_response: deprecate old LLMUserAggregatorParams and LLMAssistantAggregatorParams
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
169fc0b568
frames: deprecate emulated field in UserStartedSpeakingFrame/UserStoppedSpeakingFrame
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
a9cca0b934
LLMAssistantAggregatorParams: copy to llm_response_universal
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
dff6b5402a
LLMUserAggregator: use TranscriptionUserTurnStartStrategy for emulated interruptions
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
2cdf207227
turns: add TranscriptionUserTurnStartStrategy
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
a388ff927c
LLMUserAggregator: broadcast user started/stopped speaking frames
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
222ccbb471
SegmentedSTTService: use VAD user started/stopped speaking frames
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
49ebe34599
BaseInputTransport: broadcast SpeechControlParamsFrame
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
c4c4b4107b
TurnAnalyzerBotTurnStartStrategy: broadcast SpeechControlParamsFrame
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
7e6b0839b0
examples(foundational): don't use legacy LLMUserAggregatorParams
2025-12-19 14:47:02 -08:00
Aleix Conchillo Flaqué
d33c72a8b0
LLMUserAggregator: allow external user started/stopped speaking frames
2025-12-19 14:47:01 -08:00
Aleix Conchillo Flaqué
962eb73cc4
frames: deprecated EmulateUserStartedSpeakingFrame/EmulateUserStoppedSpeakingFrame
2025-12-19 14:47:01 -08:00
Aleix Conchillo Flaqué
3d62b9c203
tests: added user turn start strategies unit tests
2025-12-19 14:47:01 -08:00
Aleix Conchillo Flaqué
7e69288898
tests: added bot turn start strategies unit tests
2025-12-19 14:47:01 -08:00
Aleix Conchillo Flaqué
76561da850
TranscriptionBotTurnStartStrategy: improve by using interim transcriptions
2025-12-19 14:47:01 -08:00
Aleix Conchillo Flaqué
355fcf3282
BaseInputTransport: deprecate the use of turn analyzer in transport
2025-12-19 14:47:01 -08:00
Aleix Conchillo Flaqué
359ac302f5
audio(interruptions): deprecate MinWordsInterruptionStrategy
2025-12-19 14:47:01 -08:00
Aleix Conchillo Flaqué
223052e6e7
LLMUserAggregator: use new user and bot turn start strategies
2025-12-19 14:47:01 -08:00
Aleix Conchillo Flaqué
0f6668d41b
PipelineTask: pass turn start strategies to StartFrame
2025-12-19 14:47:01 -08:00
Aleix Conchillo Flaqué
6a62c8d6da
FrameProcessor: add user and bot turn start strategies
2025-12-19 14:47:01 -08:00
Aleix Conchillo Flaqué
5dd3af25ac
frames: add turn start strategies to StartFrame
2025-12-19 14:47:01 -08:00
Aleix Conchillo Flaqué
76c79a7dfa
introduce new user and bot turn start strategies
2025-12-19 14:47:01 -08:00
Mark Backman
fac1a05eb5
Merge pull request #3268 from fixie-ai/mike/ttfb
...
Add ttfb tracking for Ultravox
2025-12-19 15:49:57 -05:00
kompfner
917c512aa8
Merge pull request #3263 from pipecat-ai/pk/deprecate-openai-llm-context
...
Deprecate `OpenAILLMContext` and associated things
2025-12-19 13:19:48 -05:00
Mike Depinet
5ec08ff1d8
Add ttfb tracking for Ultravox
2025-12-19 09:26:11 -08:00
Paul Kompfner
9b6f5853cf
Deprecate OpenAILLMContext and associated things
2025-12-19 11:23:06 -05:00
Mark Backman
5e94b20562
Merge pull request #3233 from jaydamani/jay/improve-elevenlabs-services
...
Improve Elevenlabs realtime transcription service
2025-12-18 19:07:43 -05:00
Mark Backman
f6785de120
Merge pull request #3262 from pipecat-ai/mb/renumber-ultravox-foundational
...
Move Ultravox foundational example to 50, add to release evals
2025-12-18 14:31:46 -05:00
Mark Backman
56c58f7302
Move Ultravox foundational example to 50, add to release evals
2025-12-18 13:38:12 -05:00
Aleix Conchillo Flaqué
7f53483f6b
Merge pull request #3257 from pipecat-ai/aleix/daily-transcriptions-track-type
...
add transport source to Daily transcriptions
2025-12-17 19:29:22 -08:00
Aleix Conchillo Flaqué
274db3e05c
DailyTransport: add transport_source to transcription frames
2025-12-17 19:24:08 -08:00
Aleix Conchillo Flaqué
fb6c30156a
pyproject: udpate daily-python to 0.23.0
2025-12-17 19:24:08 -08:00
Aleix Conchillo Flaqué
6c0e4be4ac
Merge pull request #3205 from gui217/feat/rnnoise
...
Feat/rnnoise
2025-12-17 18:22:22 -08:00
Mark Backman
9623575b78
Merge pull request #3255 from pipecat-ai/mb/use-uv-ruff
2025-12-17 17:20:03 -05:00
Mark Backman
31b3bd737a
Update linting scripts to use ruff version installed by uv
2025-12-17 16:31:14 -05:00
Aleix Conchillo Flaqué
f9fef78070
Merge pull request #3253 from pipecat-ai/changelog-0.0.98
...
Release 0.0.98 - Changelog Update
2025-12-17 11:22:35 -08:00
Aleix Conchillo Flaqué
92970c7873
changelog: add PR prefix to PR link
2025-12-17 14:20:34 -05:00
aconchillo
491d298c10
Update changelog for version 0.0.98
2025-12-17 11:16:03 -08:00
Aleix Conchillo Flaqué
c46a20328d
changelog: fix 3230 entry
2025-12-17 11:06:57 -08:00
Aleix Conchillo Flaqué
7e4dbf42e8
Merge pull request #3252 from pipecat-ai/aleix/vision-response-frames
...
add vision response and text frames
2025-12-17 11:01:06 -08:00
Aleix Conchillo Flaqué
159e403ae4
MoondreamService: yield vision response and text frames
2025-12-17 10:42:08 -08:00
Aleix Conchillo Flaqué
d3d50ac580
frames: added vision response and text frames
2025-12-17 10:42:08 -08:00
jay
614d5e0d19
add changelog
2025-12-18 00:08:30 +05:30
jay
83a3295a39
update error handling based on code review
2025-12-18 00:03:47 +05:30
Aleix Conchillo Flaqué
e03e5f3a59
Merge pull request #3251 from pipecat-ai/aleix/more-evals-prompt-eng
...
scripts(evals): more eval prompts improvements
2025-12-17 10:29:50 -08:00
Mark Backman
65e4719cec
Merge pull request #3250 from pipecat-ai/mb/add-pr-link-to-changelog-lines
...
Add PR link to the changelog line item
2025-12-17 12:58:48 -05:00
Aleix Conchillo Flaqué
d07b37b288
scripts(evals): more eval prompts improvements
2025-12-17 09:55:12 -08:00
Mark Backman
ca97d9dc4b
Merge pull request #3249 from pipecat-ai/mb/cleanup-pipecat-version
...
Clean up use of pipecat version
2025-12-17 12:17:53 -05:00
Mark Backman
4c20483a7e
Add PR link to the changelog line item
2025-12-17 12:12:05 -05:00
Mark Backman
6d84f36d05
Merge pull request #3214 from pipecat-ai/mb/update-run-inference
...
Update run_inference to use the provided LLM configuration params
2025-12-17 12:03:50 -05:00
Mark Backman
0b6e8f5bca
Merge pull request #3246 from pipecat-ai/mb/changelog-3245
...
Add changelog fragment for PR 3245
2025-12-17 11:55:54 -05:00
Paul Kompfner
cdd6f5aa6a
Fix Anthropic LLM's run_inference so that it works even when extended thinking is enabled
2025-12-17 11:55:46 -05:00
Mark Backman
f1a0d547ce
Clean up use of pipecat version
2025-12-17 11:49:54 -05:00
mattie ruth backman
b1b7fc6357
Bump the RTVI version to 1.1.0 and add pipecat versioning to the botReady about field
2025-12-17 11:48:02 -05:00
Mark Backman
b3403e884d
Merge pull request #3247 from pipecat-ai/mb/strip-whitespace-simple-text-agg
...
SimpleTextAggregator: Strip whitespace in the returned aggregation
2025-12-17 11:43:37 -05:00
Mark Backman
16e304016d
SimpleTextAggregator: Strip whitespace in the returned aggregation
2025-12-17 11:33:39 -05:00
Mark Backman
21a55f6aae
Update run_inference to use the provided LLM configuration params
2025-12-17 10:58:05 -05:00
Mark Backman
310df33de6
Add changelog fragment for PR 3245
2025-12-17 08:45:16 -05:00
Mark Backman
c8a86059fb
Merge pull request #3245 from simopot/add-soniox-language-hints-strict
...
Add language_hints_strict parameter to SonioxSTTService
2025-12-17 08:43:25 -05:00
Mark Backman
c537d7bafb
Merge pull request #3235 from pipecat-ai/mb/dev-runner-daily-pstn-dialin
...
Added Daily PSTN dial-in support to the development runner
2025-12-17 08:31:42 -05:00
Simo Potinkara
1fce68cef1
Add language_hints_strict parameter to SonioxSTTService
...
Add support for the language_hints_strict parameter in Soniox STT
configuration. When set to true, this parameter strictly enforces
language hints, restricting transcription to only the specified
languages.
2025-12-17 13:24:26 +02:00
Aleix Conchillo Flaqué
ecd9ec4ad2
Merge pull request #3241 from pipecat-ai/aleix/evals-remove-idle-timeout
...
evals remove idle timeout and prompt improvements
2025-12-16 18:04:52 -08:00
Aleix Conchillo Flaqué
db983cb693
BaseObject: log file and line number for uncaught exceptions
2025-12-16 17:29:14 -08:00
Aleix Conchillo Flaqué
5b30f1b1ef
scripts(evals): improve prompts
2025-12-16 17:26:50 -08:00
Aleix Conchillo Flaqué
5f7dbfe775
scripts(evals): don't use on_idle_timeout
2025-12-16 17:26:42 -08:00
Aleix Conchillo Flaqué
2bb6ba59fc
Merge pull request #3240 from pipecat-ai/aleix/cartesia-ensure-word-timestamps-started
...
WordTTSService: make sure word timestamps are always started
2025-12-16 14:02:55 -08:00
Aleix Conchillo Flaqué
ac7b06faba
WordTTSService: make sure word timestamps are always started
2025-12-16 14:00:52 -08:00
Mark Backman
afa7573834
Merge pull request #3239 from pipecat-ai/mb/update-inworld-tts
...
Inworld TTS services: Add websocket TTS class, add word-timestamp ali…
2025-12-16 16:26:43 -05:00
Mark Backman
f2eb9eeb56
Merge pull request #3232 from pipecat-ai/mb/changelog-3230
...
Add changelog fragment for PR 3230
2025-12-16 16:23:17 -05:00
kompfner
9e49e09360
Merge pull request #3226 from pipecat-ai/filipi/elevenlabs_http_voice_settings
...
Fixed an issue where ElevenLabsHttpTTSService was not updating voice settings
2025-12-16 16:07:34 -05:00
kompfner
b5221cd2c1
Merge pull request #3234 from hwuiwon/hw/bugfix-llmcontext
...
Fix LLM context tool audio content handling
2025-12-16 16:04:16 -05:00
Hwuiwon Kim
796f3aeff3
fix
2025-12-16 15:56:08 -05:00
Mark Backman
de94790b94
Merge pull request #3236 from pipecat-ai/mb/websocket-stt-services
...
Update websocket STT services to use the WebsocketSTTService base class
2025-12-16 13:59:52 -05:00
Mark Backman
bd3bf9a00e
Inworld TTS services: Add websocket TTS class, add word-timestamp alignment
2025-12-16 13:47:24 -05:00
kompfner
92f934031d
Merge pull request #3224 from pipecat-ai/pk/simplify-gemini-thinking
...
Clean up logic related to applying Gemini thought signatures to conte…
2025-12-16 13:35:17 -05:00
Mark Backman
11b92d89d0
Add session ID to GladiaSTTService logs, reset bytes_sent counter
2025-12-16 10:06:16 -05:00
Mark Backman
0d1a122582
Add changelog for PR 3236
2025-12-16 09:48:47 -05:00
Mark Backman
24b5efb9d8
Update SonioxSTTService to use WebsocketSTTService
2025-12-16 09:46:35 -05:00
Mark Backman
eeb3b85e39
Update AWSTranscribeSTTService to use WebsocketSTTService
2025-12-16 09:37:31 -05:00
Mark Backman
8255770b6c
Update AssemblyAISTTService to use WebsocketSTTService
2025-12-16 09:30:03 -05:00
Mark Backman
d3f918eb58
Update GladiaSTTService to use WebsocketSTTService
2025-12-16 09:20:53 -05:00
Mark Backman
36c6549426
Added Daily PSTN dial-in support to the development runner
2025-12-15 19:10:00 -05:00
Aleix Conchillo Flaqué
88d909d468
Merge pull request #3231 from pipecat-ai/aleix/improve-evals-assert-on-exit
...
evals: use EndFrame reason field to provide eval result
2025-12-15 13:23:29 -08:00
Aleix Conchillo Flaqué
21e346abe2
scripts(evals): improve eval prompts
2025-12-15 13:21:40 -08:00
Aleix Conchillo Flaqué
70a80847a7
scripts(evals): use future instead of a queue to store eval result
2025-12-15 13:21:28 -08:00
Hwuiwon Kim
27647fc067
Fix LLM context tool conversion and audio content handling
2025-12-15 13:43:57 -05:00
Mark Backman
85fe6d4c34
Add changelog fragment for PR 3230
2025-12-15 13:02:01 -05:00
Mark Backman
4cd971e4bd
Merge pull request #3230 from kstonekuan/fix/smallwebrtcrequesthandler-return-type
...
Fix return type for SmallWebRTCRequestHandler.handle_web_request
2025-12-15 13:01:45 -05:00
jay
7e424d750e
improve error handling to log all error types
2025-12-15 23:18:44 +05:30
jay
59c3abeb92
fix issue with infinite loop when websocket disconnects
2025-12-15 23:06:35 +05:30
Paul Kompfner
54926f390d
Make image writing to and reading from LLMContext more robust; let's allow storing in context image types other than JPEG, meaning not lossily and unnecessarily re-encoding non-JPEG images as JPEG.
2025-12-15 10:39:36 -05:00
Kingston Kuan
50362ca37e
Merge branch 'pipecat-ai:main' into fix/smallwebrtcrequesthandler-return-type
2025-12-15 16:41:59 +08:00
Aleix Conchillo Flaqué
a14c911fb2
scripts(evals): improve eval assertion on exit
2025-12-14 12:37:05 -08:00
Aleix Conchillo Flaqué
a5e42337a4
frames: EndFrame and CancelFrame reason is now Any
2025-12-14 12:16:14 -08:00
Aleix Conchillo Flaqué
4f848e9631
Merge pull request #3227 from fixie-ai/mike/upstream
...
Add Ultravox service
2025-12-13 18:29:02 -08:00
Kingston
93df7044fa
fix return type for SmallWebRTCRequestHandler
2025-12-13 22:11:06 +08:00
Paul Kompfner
e604e9b490
Support conversations with Gemini 3 Pro Image (model "gemini-3-pro-image-preview").
...
Prior to this change, after the model generated an image the conversation would not be able to progress. It would stall out because we were never storing the image in context, so the model would never realize it already did the work of generating an image. We didn't run into issues with Gemini 2.5 Flash Image, because that model always followed up an image with a text message.
2025-12-12 18:20:17 -05:00
Mike Depinet
2e4fa3f8db
PR comments
...
Also satisfy some Pyright complaints and update default model
2025-12-12 15:03:31 -08:00
Mark Backman
5f6448a8a4
Merge pull request #3228 from pipecat-ai/mb/gemini-live-update
...
Update GeminiLiveLLMService model to gemini-2.5-flash-native-audio-pr…
2025-12-12 14:32:45 -05:00
Mark Backman
6cda357ce8
Remove timestamp check from TestThoughtTranscription
2025-12-12 14:28:39 -05:00
Mark Backman
7e87f61d17
Update GeminiLiveLLMService model to gemini-2.5-flash-native-audio-preview-12-2025
2025-12-12 14:18:57 -05:00
Mike Depinet
ccdf83800b
Rename changelog entries
2025-12-12 10:21:56 -08:00
Mike Depinet
4b81be7acf
Add Ultravox service ( #1 )
...
Adds support for using Ultravox Realtime as a speech-to-speech service.
Also removes the deprecated Ultravox speech-to-text vllm model integration to avoid confusion.
2025-12-12 10:16:15 -08:00
Paul Kompfner
abc2ad8cbc
Avoid printing out entire thought signatures in logs
2025-12-12 13:01:45 -05:00
Paul Kompfner
64471d65f8
Clean up logic related to applying Gemini thought signatures to context messages
2025-12-12 12:53:11 -05:00
Filipi Fuchter
3c4991a41f
Mentioning the ElevenLabsHttpTTSService voice settings fix in the changelog.
2025-12-12 14:48:32 -03:00
Filipi Fuchter
71d6516a14
Fixed an issue where ElevenLabsHttpTTSService was not updating voice settings when receiving a TTSUpdateSettingsFrame.
2025-12-12 14:46:24 -03:00
Filipi da Silva Fuchter
22288648e6
Merge pull request #3210 from pipecat-ai/filipi/heygen_liveavatar
...
Adding support for the HeyGen LiveAvatar API
2025-12-12 09:19:58 -03:00
Filipi Fuchter
a6ee040d82
Adding the changelog mentioning the HeyGen changes.
2025-12-12 08:58:48 -03:00
Filipi Fuchter
87fc860cd5
Changing the HeyGenVideoService example to use the live avatar API.
2025-12-12 08:52:10 -03:00
Filipi Fuchter
b25ad21941
Refactoring HeyGenVideoService and HeyGenTransport to work with both APIs.
2025-12-12 08:51:35 -03:00
Filipi Fuchter
debcea3baa
Adding the new HEYGEN_LIVE_AVATAR_API_KEY to the requested environment's variables.
2025-12-12 08:51:01 -03:00
Filipi Fuchter
c2abe42a64
Adding support for the HeyGen LiveAvatar API.
2025-12-12 08:49:52 -03:00
Filipi Fuchter
56dee06a29
Refactored the Interactive Avatar API to extend the HeyGen base API.
2025-12-12 08:49:16 -03:00
Filipi Fuchter
60cc14cafd
Created HeyGen base API to support both Interactive Avatar and LiveAvatar.
2025-12-12 08:48:39 -03:00
kompfner
1e98094394
Merge pull request #3175 from pipecat-ai/pk/thinking-exploration
...
Additional functionality related to thinking, for Google and Anthropic LLMs.
2025-12-11 17:15:37 -05:00
Paul Kompfner
ccdd6cde52
Fix a couple of typos in comments
2025-12-11 17:05:09 -05:00
Paul Kompfner
12979293ad
Add thinking examples to eval suite
2025-12-11 15:58:48 -05:00
Paul Kompfner
28248e9b00
Split up thinking examples so that there isn't an llm command-line arg for controlling which LLM to use. This change is preparation for adding these examples to our suite of evals.
2025-12-11 15:07:35 -05:00
Paul Kompfner
0e88ad672e
Add ThoughtTranscriptionMessage.role, which is always "assistant"
2025-12-11 14:41:16 -05:00
kompfner
f41c3dcbc3
Merge pull request #3212 from pipecat-ai/pk/nova-2-sonic
...
Nova 2 Sonic support
2025-12-11 09:36:50 -05:00
Mark Backman
645e1802f8
Merge pull request #3219 from pipecat-ai/mb/deprecate-fal-smart-turn
2025-12-10 13:13:44 -05:00
Mark Backman
6636da682c
Merge pull request #3085 from rimelabs/feature/rimeNonJsonTTsservice
...
Add RimeNonJsonTTSService for non-JSON WebSocket API support
2025-12-10 10:38:39 -05:00
Mark Backman
10a32c943f
deprecate: FalSmartTurnAnalyzer and LocalSmartTurnAnalyzer
2025-12-10 08:14:28 -05:00
Gokul Js
455579ffcc
Refactor RimeNonJsonTTSService to extend InterruptibleTTSService, removing dependency on WebsocketTTSService and streamlining audio interruption handling.
2025-12-10 04:56:52 +05:30
Paul Kompfner
c37da6ab78
In the AWS Nova Sonic example, shorten the simulated weather function call delay
2025-12-09 16:53:18 -05:00
Paul Kompfner
1892854516
In the AWS Nova Sonic example, send back "location" from the weather-fetching function to help the model associate a tool response with a tool call...if you interrupt the model while more than one function call is outbound, it seemingly can get confused about which tool result goes which call.
2025-12-09 16:27:23 -05:00
Mark Backman
735e597bf2
Merge pull request #3209 from pipecat-ai/hush/07n-prompt
...
Update system prompt in Gemini example to be more instructive
2025-12-09 15:45:46 -05:00
Vanessa Pyne
52980a69c5
Merge pull request #3215 from pipecat-ai/vp-user-bot-latency-observer-internal-var-change
...
user-bot-latency log observer internal var change
2025-12-09 13:03:29 -06:00
vipyne
ff2f1dac82
user-bot-latency log observer internal var change
2025-12-09 12:34:38 -06:00
Paul Kompfner
3cbfbb997e
Added CHANGELOG for AWS Nova 2 Sonic-related changes
2025-12-09 12:57:19 -05:00
Paul Kompfner
3e66cb50e0
Update AWS Nova Sonic example to showcase async tool calling
2025-12-09 12:44:21 -05:00
Paul Kompfner
b821dd2507
Fix a bug in AWSNovaSonicLLMService where we would mishandle cancelled tool calls in context
2025-12-09 12:12:55 -05:00
Paul Kompfner
0c5bccd1f1
Changes related to Nova 2 Sonic's support for the model speaking first
2025-12-09 11:55:23 -05:00
Paul Kompfner
926514ca18
Add support to AWSNovaSonicLLMService for new "endpointingSensitivity" parameter.
2025-12-09 11:26:43 -05:00
Paul Kompfner
ca5e668f4a
Update AWSNovaSonicLLMService docstring with more (and more up-to-date) info
2025-12-09 10:14:27 -05:00
Paul Kompfner
53de6c0b9a
Update list of supported regions in 40-aws-nova-sonic.py
2025-12-09 09:46:53 -05:00
Paul Kompfner
b22ac8292f
Update default model in AWSNovaSonicLLMService to "amazon.nova-2-sonic-v1:0"
2025-12-09 09:38:47 -05:00
James Hush
83877ab1e6
Update system prompt in Gemini example to be more instructive
...
Changed the on_client_connected system message from a direct greeting to
an instruction that tells the AI to introduce itself, giving the LLM more
flexibility in how it starts the conversation.
2025-12-09 09:04:10 +01:00
gui217
1c0e25a90d
fix unit tests
2025-12-09 09:56:20 +02:00
Gokul Js
2a6a0d83db
Update docstring in RimeNonJsonTTSService to clarify the focus on the current plain text protocol and note potential future support for JSON WebSocket.
2025-12-09 02:49:37 +05:30
Gokul Js
6ca117a3c1
Remove unused import of 'language' in tts.py to clean up the code and improve readability.
2025-12-09 02:45:17 +05:30
Gokul Js
4fcb099fd7
Add RimeNonJsonTTSService to support non-JSON streaming mode, enabling WebSocket streaming for the Arcana model.
2025-12-09 02:43:57 +05:30
Paul Kompfner
c5ff5cc219
Update CHANGELOG
2025-12-08 16:09:59 -05:00
Aleix Conchillo Flaqué
88289f578a
Merge pull request #3208 from pipecat-ai/thor/add-client-identification
...
add Gemini client identification
2025-12-08 13:05:04 -08:00
Paul Kompfner
229ff794d6
Better handle Gemini non-function thought signatures
2025-12-08 15:56:40 -05:00
Aleix Conchillo Flaqué
096db3eb6c
Merge pull request #3207 from pipecat-ai/aleix/voicemail-conversation-detected-event
...
VoicemailDetector: add on_conversation_detected event
2025-12-08 11:59:45 -08:00
Aleix Conchillo Flaqué
cfd1cada8c
VoicemailDetector: add on_conversation_detected event
2025-12-08 11:57:14 -08:00
Aleix Conchillo Flaqué
ee435b6f1e
update CHANGELOG
2025-12-08 11:54:09 -08:00
Aleix Conchillo Flaqué
d289b38ba7
tests(google): mock the new pipecat.version()
2025-12-08 11:51:01 -08:00
Aleix Conchillo Flaqué
b0f63c3785
pipecat: add version() function
2025-12-08 11:51:01 -08:00
Paul Kompfner
1249ee3de3
Better handle Gemini non-function thought signatures
2025-12-08 13:07:25 -05:00
Vanessa Pyne
b09d8bd595
Merge pull request #3206 from pipecat-ai/vp-update-bot-latency-observer
...
use VADUserStarted/StoppedSpeakingFrame s in user_bot_latency_log_observer.py
2025-12-08 11:37:56 -06:00
vipyne
540a48b1b6
use VADUserStarted/StoppedSpeakingFrame s in user_bot_latency_log_observer.py
2025-12-08 11:37:31 -06:00
Paul Kompfner
aa0529ff82
Update comments for accuracy
2025-12-08 11:47:06 -05:00
Paul Kompfner
7e92597c0e
Remove LLMThoughtSignatureFrame in favor of using the more generic LLMMessagesAppendFrame
2025-12-08 11:10:05 -05:00
Gokul Js
99f89351fa
Add support for non-JSON streaming mode in RimeTTSService, enabling both JSON and raw audio WebSocket streaming for enhanced performance and flexibility.
2025-12-08 21:32:50 +05:30
Gokul Js
0b4d984be6
Standardize error handling in RimeNonJsonTTSService by replacing specific error messages with a generic "Unknown error occurred" format, enhancing consistency in error reporting.
2025-12-08 21:24:30 +05:30
Paul Kompfner
17203ba3e6
Change FunctionInProgressFrame.llm_specific_extra to a more generic FunctionInProgressFrame.append_extra_context_messages.
2025-12-08 10:50:19 -05:00
Gokul Js
924831089c
Enhance error handling in RimeNonJsonTTSService by standardizing error messages for improved clarity and consistency in reporting.
2025-12-08 21:17:01 +05:30
Gokul Js
329b8ac426
Refactor error handling in RimeNonJsonTTSService to provide a more generic error message, improving clarity in error reporting.
2025-12-08 21:06:48 +05:30
Paul Kompfner
61674d7758
Add process_thought constructor argument to TranscriptProcessor to control whether to handle thoughts in addition to assistant utterances. Defaults to False.
2025-12-08 10:27:36 -05:00
Gokul Js
b9990811b5
Merge branch 'main' into feature/rimeNonJsonTTsservice
2025-12-08 20:54:01 +05:30
Paul Kompfner
8ccc2cbf31
Add unit tests for ThoughtTranscriptProcessor
2025-12-08 10:14:31 -05:00
Gokul Js
f4e33fc8dd
Update docstrings in RimeNonJsonTTSService for clarity and consistency, specifying 'Non-JSON' in relevant descriptions.
2025-12-08 20:32:13 +05:30
Gokul Js
5bfea84bd5
Refactor RimeNonJsonTTSService to extend WebsocketTTSService, enhancing WebSocket functionality and improving code clarity
2025-12-08 20:30:46 +05:30
Paul Kompfner
ef703e9d16
Get rid of ThoughtTranscriptProcessor, moving its logic into AssistantTranscriptProcessor instead
2025-12-08 09:59:32 -05:00
Paul Kompfner
44aa11737b
Minor docstring update for accuracy
2025-12-08 09:29:10 -05:00
Paul Kompfner
49f1f7d6a2
Added CHANGELOG entry describing new thinking-related functionality
2025-12-08 09:29:10 -05:00
Paul Kompfner
4ea51ff67c
Slight refactor of handling thought-signature-containing special context messages in the Gemini adapter
2025-12-08 09:29:10 -05:00
Paul Kompfner
747bd4f737
Tweak the prompt of the thinking + functions example to not confuse Gemini as much (Gemini found the original prompt a bit ambiguous, it seems)
2025-12-08 09:29:10 -05:00
Paul Kompfner
15f5583fd2
Simplify, at the expense of a bit of not-yet-needed flexibility: rather than associating a loose thought_metadata with each thought, use a signature. Thought signatures are the only "thought metadata" we use today.
2025-12-08 09:29:10 -05:00
Paul Kompfner
c8c6f424cd
Add support for Gemini 3 Pro non-function-call-related thought signatures
2025-12-08 09:29:10 -05:00
Paul Kompfner
0cdf0c4504
Bump Google GenAI library version to at least 1.51.0, as that's the version where thinking_level—required for controlling Gemini 3 Pro thinking—is introduced
2025-12-08 09:29:10 -05:00
Paul Kompfner
217f03b9cc
Add additional functionality related to "thinking", for Google and Anthropic LLMs.
...
Thinking, sometimes called "extended thinking" or "reasoning", is an LLM process where the model takes some additional time before giving an answer. It's useful for complex tasks that may require some level of planning and structured, step-by-step reasoning. The model can output its thoughts (or thought summaries, depending on the model) in addition to the answer. The thoughts are usually pretty granular and not really suitable for being spoken out loud in a conversation, but can be useful for logging or prompt debugging.
Here's what's added:
1. New typed input parameters for Google and Anthropic LLMs that control the models' thinking behavior (like how much thinking to do, and whether to output thoughts or thought summaries).
2. New frames for representing thoughts output by LLMs.
3. A generic mechanism for associating extra LLM-specific data with a function call in context, used specifically to support Google's function-call-related "thought signatures", which are necessary to ensure thinking continuity between function calls in a chain (where the model thinks, makes a function call, thinks some more, etc.)
4. A generic mechanism for recording LLM thoughts to context, used specifically to support Anthropic, whose thought signatures are expected to appear alongside the text of the thoughts within assistant context messages.
5. An expansion of `TranscriptProcessor` to process LLM thoughts in addition to user and assistant utterances.
2025-12-08 09:29:01 -05:00
Gokul Js
12093fcffc
Update default sample_rate parameter in RimeNonJsonTTSService to None for flexibility
2025-12-08 19:50:38 +05:30
Gokul Js
e5fb643cf5
Improve docstring formatting in RimeNonJsonTTSService for better readability
2025-12-08 19:45:13 +05:30
Mark Backman
4517475db7
Merge pull request #3197 from pipecat-ai/mb/cartesia-stt-cleanup
...
Clean up CartesiaSTTService
2025-12-08 08:53:40 -05:00
gui217
c48858742a
clean up
2025-12-08 11:51:20 +02:00
gui217
90ef758522
align uv.lock
2025-12-08 11:40:08 +02:00
gui217
3974937352
align uv.lock
2025-12-08 11:39:40 +02:00
gui217
d64ab08bc4
chore: update uv.lock
2025-12-08 11:39:17 +02:00
gui217
6603ecfe29
chore: update uv.lock with pyrnnoise and restore revision 3
2025-12-08 11:39:10 +02:00
gui217
d3ae0b6a14
rebase
2025-12-08 11:36:44 +02:00
Aleix Conchillo Flaqué
92b6e8d66b
Merge pull request #3189 from pipecat-ai/aleix/introduce-uninterruptible-frames
...
introduce uninterruptible frames
2025-12-07 14:02:35 -08:00
Aleix Conchillo Flaqué
3be1a7afaa
Merge pull request #3202 from pipecat-ai/aleix/remove-manta
...
README: remove manta badge
2025-12-07 14:00:13 -08:00
thorwebdev
15df3c06e8
chore: add test.
2025-12-06 22:36:04 -05:00
Aleix Conchillo Flaqué
f0af0a6b96
README: remove manta badge
2025-12-05 16:16:19 -08:00
Mark Backman
4cefe1357c
Merge pull request #3201 from pipecat-ai/changelog-0.0.97
...
Release 0.0.97 - Changelog Update
2025-12-05 18:49:15 -05:00
markbackman
4df0a9bf73
Update changelog for version 0.0.97
2025-12-05 18:47:21 -05:00
Mark Backman
9ef139d020
Merge pull request #3200 from pipecat-ai/mb/improve-changelog-template
...
Fix newlines between sections in changlelog template
2025-12-05 18:42:52 -05:00
Mark Backman
9103d4ae05
Fix newlines between sections in changlelog template
2025-12-05 18:40:49 -05:00
Aleix Conchillo Flaqué
bd63b6cefa
Merge pull request #3198 from pipecat-ai/aleix/examples-14i-new-model
...
examples(foundational): update 14i-fireworks with new serverless model
2025-12-05 15:33:12 -08:00
Aleix Conchillo Flaqué
4d03270bc3
examples(foundational): update 14i-fireworks with new serverless model
2025-12-05 15:31:29 -08:00
Mark Backman
0debcee761
Clean up CartesiaSTTService
2025-12-05 18:12:11 -05:00
Mark Backman
6aee72c5b4
Merge pull request #3196 from pipecat-ai/mb/docs-cleanup-prep-0.0.97
...
Docs cleanup before 0.0.97 release
2025-12-05 15:16:36 -05:00
Mark Backman
8d62cfb1b6
Merge pull request #3195 from ivaaan/add-hume-header
...
Add tracking headers to Hume service
2025-12-05 14:50:18 -05:00
ivaaan
41214236ab
add changelog
2025-12-05 20:47:04 +01:00
Mark Backman
b25963a63b
Docs cleanup before 0.0.97 release
2025-12-05 14:19:26 -05:00
ivaaan
8c6ef21d84
add stop, cancel
2025-12-05 20:13:58 +01:00
thorwebdev
f729b1625b
chore: move into services file.
2025-12-05 13:31:58 -05:00
ivaaan
0ffaa09c95
add tracking headers to Hume service
2025-12-05 19:00:47 +01:00
Aleix Conchillo Flaqué
f6e31b7e89
Merge pull request #3185 from pipecat-ai/fix/websocket-service-cancelled-error-handling
...
fix(websocket): handle CancelledError to prevent reconnection on shutdown
2025-12-05 09:25:49 -08:00
Aleix Conchillo Flaqué
49b2b12e04
frames: change function call frame base types
2025-12-05 09:22:29 -08:00
Aleix Conchillo Flaqué
7ad3969690
introduce UninterruptibleFrame frames
2025-12-05 09:21:36 -08:00
thorwebdev
af089a65ae
feat: add Gemini client identification.
2025-12-05 12:06:28 -05:00
Aleix Conchillo Flaqué
48422dd442
WebsocketService: avoid reconnection on shutdown
2025-12-05 09:03:04 -08:00
Vanessa Pyne
fed6a8b669
Merge pull request #3187 from pipecat-ai/vp-mcp-filter-followup
...
add mcp filter example and changelog
2025-12-05 10:58:19 -06:00
vipyne
82e0253a62
add mcp filter example and changelog
2025-12-05 10:56:59 -06:00
Vanessa Pyne
a7f26dca60
Merge pull request #3152 from RuiDaniel/mcp_client_filters
...
Add filters to MCP Client
2025-12-05 10:50:27 -06:00
Vanessa Pyne
459ef27f3f
Merge pull request #3079 from pipecat-ai/vp-add-exact-model-version-function
...
set full model name for base openai models
2025-12-05 10:48:53 -06:00
Mark Backman
464cfa5ccb
Merge pull request #3188 from pipecat-ai/mb/improve-changelog-process
...
Auto-generate changelog from fragments
2025-12-05 11:42:25 -05:00
Mark Backman
9289881a80
Remove 3120.added.md
2025-12-05 11:35:50 -05:00
Mark Backman
34033cd454
Add new changelog entries
2025-12-05 11:35:50 -05:00
Mark Backman
47c21c9579
Delete README.md in changelog
2025-12-05 11:35:50 -05:00
Mark Backman
3b0bcf0b66
Validate fragment types match the expected types
2025-12-05 11:35:50 -05:00
Mark Backman
c4a8308027
Fail when no changelog fragments are available
2025-12-05 11:35:50 -05:00
Mark Backman
e9f76dcaf2
Set the date automatically when the workflow runs, leaving an optional override
2025-12-05 11:35:50 -05:00
Mark Backman
21b2229b2b
Auto-generate changelog from fragments
2025-12-05 11:35:49 -05:00
Aleix Conchillo Flaqué
11aa9c9e68
update CHANGELOG, remove wait_for_all
2025-12-05 08:34:07 -08:00
Aleix Conchillo Flaqué
9f4680e9bd
Merge pull request #3190 from pipecat-ai/aleix/no-need-wait-for-all
...
LLMService: let's not introduce wait_for_all for now
2025-12-05 08:31:44 -08:00
Aleix Conchillo Flaqué
04443a3820
LLMService: let's not introduce wait_for_all for now
2025-12-05 08:26:04 -08:00
Mark Backman
1571cc58ac
Merge pull request #3192 from pipecat-ai/mb/cartesia-stt-timestamp
...
Add full transcript result for CartesiaSTTService
2025-12-05 10:37:06 -05:00
Mark Backman
dea80cf946
Add full transcript result for CartesiaSTTService
2025-12-05 10:25:46 -05:00
Mark Backman
91dec044c4
Merge pull request #3171 from LaurentMazare/gradium
...
Gradium integration.
2025-12-05 09:43:44 -05:00
laurent
8cf4267d87
Switch to a debug.
2025-12-05 15:37:17 +01:00
Mark Backman
0ee7cab6c6
Merge pull request #3184 from ashotbagh/feat/asyncai-multilingual-addons
...
Added new languages support for AsyncAI
2025-12-05 08:42:09 -05:00
Ashot
74c2039bfb
Updated changelog.
2025-12-05 16:54:38 +04:00
Ashot
66088837cd
Fixed defualt language issue in async tts
2025-12-05 16:51:05 +04:00
laurent
07ebf8534a
Add the example.
2025-12-05 10:51:22 +01:00
laurent
fce4cfba15
Changelog update.
2025-12-05 10:46:01 +01:00
laurent
af52833ca0
Update the readme and env.example.
2025-12-05 10:44:30 +01:00
laurent
9fdf756375
Fix.
2025-12-05 10:38:35 +01:00
laurent
283bbb385c
And remove the request-id.
2025-12-05 10:35:19 +01:00
laurent
8c6b2edb25
Various code review tweaks.
2025-12-05 10:33:48 +01:00
Laurent Mazare
6ab30f9b87
Apply suggestions from code review
...
Co-authored-by: Mark Backman <m.backman@gmail.com >
2025-12-05 10:25:47 +01:00
Aleix Conchillo Flaqué
3d93285bdf
Merge pull request #3176 from pipecat-ai/aleix/exception-filename-line-number
...
log file name and line number when exception occurs
2025-12-04 11:08:32 -08:00
Aleix Conchillo Flaqué
7261cd28f2
log file name and line number when exception occurs
2025-12-04 11:06:45 -08:00
vipyne
33eeb8ce44
Use _full_model_name in llm trace if available
2025-12-04 11:54:45 -06:00
vipyne
ebda94ca98
set full model name for base openai models
2025-12-04 11:54:45 -06:00
Mark Backman
40b17cff8f
Merge pull request #3186 from pipecat-ai/mb/11labs-fix-metrics-tracking
...
fix: ElevenLabsTTSService character usage metrics
2025-12-04 12:36:39 -05:00
marcus-daily
7ba0ebba11
Smart Turn analyzer now uses the full context of the turn rather than just the audio since VAD last triggered ( fixes #3094 )
2025-12-04 16:40:08 +00:00
Mark Backman
b39087027c
fix: ElevenLabsTTSService character usage metrics
2025-12-04 09:41:18 -05:00
Ashot
e65974c870
Added new languages support for AsyncAI
2025-12-04 16:15:28 +04:00
marcus-daily
b1e5d68d97
Updating changelog
2025-12-04 11:32:16 +00:00
marcus-daily
39bca074d7
Smart Turn v3.1
2025-12-04 11:32:16 +00:00
Aleix Conchillo Flaqué
b5e79f9dc5
Merge pull request #3181 from pipecat-ai/aleix/sync-to-utils-sync
...
move pipecat.sync to pipecat.utils.sync
2025-12-03 19:41:18 -08:00
Aleix Conchillo Flaqué
613b96819f
Merge pull request #3180 from pipecat-ai/aleix/deepgram-tts-service-fix
...
DeepgramTTSService: fix websocket header logging
2025-12-03 19:40:43 -08:00
Mark Backman
57c24670ea
Merge pull request #3132 from pipecat-ai/mb/normalize-llm-text-frame-output
...
Add split_text_by_spaces string util, normalize aggregator input
2025-12-03 22:05:14 -05:00
Mark Backman
d79dd94019
Make aggregate return an AsyncIterator, other clean up
2025-12-03 22:00:34 -05:00
Mark Backman
fa8e7458e1
Clean up
2025-12-03 22:00:04 -05:00
Mark Backman
4d66191963
fix: PatternPairAggregator to process patterns only once
2025-12-03 22:00:04 -05:00
Mark Backman
7e9d67002e
SkipTagsAggregator and PatternPairAggregator now subclass SimpleTextAggregator
2025-12-03 22:00:04 -05:00
Mark Backman
ffbb6e5937
Update SimpleTextAggregator to handle character by character input, use a buffer to handle ambiguous EOS scenarios, and add a flush method to all aggregators
2025-12-03 22:00:02 -05:00
Mark Backman
535b85cf90
Add split_text_by_spaces string util
2025-12-03 21:55:30 -05:00
Aleix Conchillo Flaqué
8dc9872ed5
deprecate pipecat.sync package
2025-12-03 18:44:41 -08:00
Aleix Conchillo Flaqué
f37a53cc25
utils(sync): move sync to utils.sync
2025-12-03 18:20:12 -08:00
Aleix Conchillo Flaqué
9cce28c64c
DeepgramTTSService: use websocket response headers for logging
2025-12-03 18:16:25 -08:00
Aleix Conchillo Flaqué
3ca94363ec
Merge pull request #3168 from pipecat-ai/aleix/dont-override-skip-tts
...
LLMTextFrame: don't override skip_tts
2025-12-03 18:15:50 -08:00
Rpcd
9dd882ecf8
Update src/pipecat/services/mcp_service.py
...
Co-authored-by: Vanessa Pyne <vipyne@gmail.com >
2025-12-03 17:28:37 +00:00
Rpcd
0bbb14eb9b
Update src/pipecat/services/mcp_service.py
...
Co-authored-by: Vanessa Pyne <vipyne@gmail.com >
2025-12-03 17:28:29 +00:00
Mark Backman
050f287ec4
Merge pull request #3072 from jjmaldonis/deepgram/add-deepgram-request-ids-to-debug-logs
...
deepgram: added request IDs to debug logs
2025-12-03 09:37:25 -05:00
Jason Maldonis
e6f5561785
updated changelog
2025-12-03 08:18:09 -06:00
Jason Maldonis
2df91f4b37
fixed linting
2025-12-03 08:09:16 -06:00
Jason Maldonis
7db49b9067
deepgram: added request IDs to debug logs
...
Deepgram request IDs are necessary for investigating behavior at the
request level. This commit adds DEBUG logs that print Deepgram request
IDs when using Deepgram's STT or TTS.
2025-12-03 08:09:13 -06:00
Vanessa Pyne
7c497bdc89
Merge pull request #3130 from pipecat-ai/vp-nvidia-docs
...
update nvidia services naming
2025-12-02 13:04:16 -06:00
vipyne
1aa4247d2b
remove nim from pyproject.toml
2025-12-02 12:55:13 -06:00
laurent
1ffa9ff51f
Gradium integration.
2025-12-02 13:34:51 +01:00
Rpcd
435b53f1a0
Update src/pipecat/services/mcp_service.py
...
Co-authored-by: Vanessa Pyne <vipyne@gmail.com >
2025-12-02 09:22:08 +00:00
Rpcd
406bdfad0d
Update src/pipecat/services/mcp_service.py
...
Co-authored-by: Vanessa Pyne <vipyne@gmail.com >
2025-12-02 09:21:59 +00:00
vipyne
acba544e6f
pr notes for nvidia service name change
2025-12-01 22:41:17 -06:00
vipyne
5d93c64ee5
typo fixes and uv.lock update
2025-12-01 22:41:17 -06:00
vipyne
de10bc8803
changelog for riva,nim -> nvidia name change
2025-12-01 22:41:17 -06:00
vipyne
36f5c1722d
deprecate riva and nim service paths in favor of nvidia
2025-12-01 22:41:17 -06:00
vipyne
a8280522e5
examples: rename nvidia foundational examples
2025-12-01 22:41:17 -06:00
vipyne
05d65dfdd3
Update NVIDIA NIM and Riva services to Nvidia
...
- pip install pipecat-ai[nim]
- pip install pipecat-ai[riva]
+ pip install pipecat-ai[nvidia]
and
- from pipecat.services.nim.llm import NimLLMService
+ from pipecat.services.nvidia.llm import NvidiaLLMService
- from pipecat.services.riva.stt import RivaSTTService
+ from pipecat.services.nvidia.stt import NvidiaSTTService
- from pipecat.services.riva.tts import RivaTTSService
+ from pipecat.services.nvidia.tts import NvidiaTTSService
2025-12-01 22:41:17 -06:00
Aleix Conchillo Flaqué
a3962e3b47
LLMTextFrame: don't override skip_tts
2025-12-01 18:37:07 -08:00
Aleix Conchillo Flaqué
cd231cf829
Merge pull request #3120 from pipecat-ai/aleix/function-calls-wait-for-all
...
allow waiting for all function calls to complete
2025-12-01 18:35:53 -08:00
Aleix Conchillo Flaqué
9fafc1692d
update uv.lock
2025-12-01 18:32:00 -08:00
Aleix Conchillo Flaqué
7648d0436c
examples(19): linting
2025-12-01 18:30:34 -08:00
Aleix Conchillo Flaqué
bff8747e38
LLMService: allow waiting for all function calls to complete
2025-12-01 18:30:25 -08:00
Mark Backman
d227c0c097
Merge pull request #3155 from pipecat-ai/mb/fix-sarvam-tts-not-flushing
...
fix: flush audio in SarvamTTSService
2025-12-01 17:22:33 -05:00
Mark Backman
9ccde60521
fix: flush audio in SarvamTTSService
2025-12-01 17:18:34 -05:00
Mark Backman
b84a40666c
Merge pull request #3156 from pipecat-ai/mb/deepgram-stt-stopped-frame
...
fix: DeepgramTTSService, let the base class push TTSStoppedFrame
2025-12-01 17:18:19 -05:00
Mark Backman
e72b135a4c
fix: DeepgramTTSService, let the base class push TTSStoppedFrame
2025-12-01 17:15:51 -05:00
Aleix Conchillo Flaqué
2235d8f5a2
CHANGELOG formatting
2025-12-01 10:24:42 -08:00
Mark Backman
6e20a50a4b
Merge pull request #3153 from pipecat-ai/mb/fix-aws-stt-region
...
fix: AWSTranscribeSTTService always set to us-east-1
2025-12-01 13:07:22 -05:00
Mark Backman
89d9ca045a
fix: AWSTranscribeSTTService always set to us-east-1
2025-12-01 13:02:08 -05:00
Mark Backman
4b95ee92eb
Merge pull request #3166 from pipecat-ai/mb/update-changelog-AWSBedrockAgentCoreProcessor
...
Retroactively add changelog to 0.0.96 for AWSBedrockAgentCoreProcessor
2025-12-01 11:51:47 -05:00
Mark Backman
d481ac6cc6
Retroactively add changelog to 0.0.96 for AWSBedrockAgentCoreProcessor
2025-12-01 11:49:00 -05:00
Mark Backman
e5a91296b5
Merge pull request #3162 from ai-coustics/add-stt-optimized-model
...
Add Quail STT as default model for `AICFilter`
2025-11-30 09:59:37 -05:00
Corvin Jaedicke
d8d10a0685
add changelog entry
2025-11-28 15:24:19 +01:00
Corvin Jaedicke
6dd9ed03b1
bump version to include new STT model, noise gate deprecation warning
2025-11-28 15:14:43 +01:00
Filipi da Silva Fuchter
d486c80804
Merge pull request #3151 from pipecat-ai/filipi/fix_runner_ice_servers
...
Fixing runner ICE servers to be compatible with what is expected by the mobile SDKs.
2025-11-27 10:24:02 -03:00
Filipi Fuchter
dedea7c420
Fixing runner ICE servers to be compatible with what is expected by the mobile SDKs.
2025-11-27 09:27:26 -03:00
Aleix Conchillo Flaqué
b78eb5de6b
Merge pull request #3148 from pipecat-ai/aleix/pipecat-0.0.96-update
...
update CHANGELOG for 0.0.96 with proper date
2025-11-26 17:21:31 -08:00
Aleix Conchillo Flaqué
95aa13beb1
update CHANGELOG for 0.0.96 with proper date
2025-11-26 17:16:54 -08:00
Mark Backman
88ce85342c
Merge pull request #3147 from pipecat-ai/mb/fix-sagemaker-error-handling
...
Fix error handling in DeepramSageMakerSTTService
2025-11-26 20:15:45 -05:00
Mark Backman
bedd40ae8b
Fix error handling in DeepramSageMakerSTTService
2025-11-26 20:12:31 -05:00
Mark Backman
fda327b3ee
Merge pull request #3146 from pipecat-ai/mb/fix-aws-bedrock-region
...
fix: AWSBedrockLLMService was always set to us-east-1
2025-11-26 19:56:09 -05:00
Mark Backman
ace95b6e6d
fix: AWSBedrockLLMService was always set to us-east-1
2025-11-26 19:52:04 -05:00
Aleix Conchillo Flaqué
26c5c28c5c
Merge pull request #3145 from pipecat-ai/aleix/simli-enable-logging-param
...
SimliVideoService: add enable_logging input parameter
2025-11-26 16:49:12 -08:00
Aleix Conchillo Flaqué
81f862749d
SimliVideoService: add enable_logging input parameter
2025-11-26 16:36:06 -08:00
Aleix Conchillo Flaqué
b8bf7b4132
Merge pull request #3143 from pipecat-ai/aleix/pipecat-0.0.96
...
update CHANGELOG for 0.0.96
2025-11-26 16:31:44 -08:00
Aleix Conchillo Flaqué
d90121ef3b
update CHANGELOG for 0.0.96
2025-11-26 15:30:06 -08:00
Filipi da Silva Fuchter
d0b7b4fb0a
Merge pull request #3144 from pipecat-ai/filipi/fix_flux_reconnection_issue
...
Fixed an issue with DeepgramFluxSTTService where it sometimes failed to reconnect.
2025-11-26 20:29:41 -03:00
Filipi Fuchter
4acc317923
Fixed an issue with DeepgramFluxSTTService where it sometimes failed to reconnect.
2025-11-26 20:23:03 -03:00
Filipi da Silva Fuchter
7caf5751ee
Merge pull request #3084 from pipecat-ai/filipi/improve_error_handler
...
Improving error handler.
2025-11-26 18:40:44 -03:00
Filipi Fuchter
1330ef3ad6
Enhanced error handling across the framework.
...
Co-authored-by: Mark Backman <m.backman@gmail.com >
2025-11-26 18:34:25 -03:00
Mark Backman
9efb21d61e
Merge pull request #3115 from pipecat-ai/mb/deepgram-websocket-tts
...
Update DeepgramTTSService to use Deepgram's Websocket TTS API
2025-11-26 13:30:52 -05:00
Mark Backman
6d93b8e9d8
Update DeepgramTTSService to use Deepgram's Websocket TTS API
2025-11-26 13:25:34 -05:00
Aleix Conchillo Flaqué
6f527e509e
update CHANGELOG with FishAudioTTSService s1 model update
2025-11-26 10:22:59 -08:00
Aleix Conchillo Flaqué
6cf1d0417e
Merge pull request #3136 from kcui5/patch-1
...
Update Fish Audio default model to s1
2025-11-26 10:19:26 -08:00
Mark Backman
19d8b0dfc2
Merge pull request #3011 from thsunkid/feat/add-cached-reasoning-tokens-metrics-to-opentel-spans
2025-11-26 07:45:33 -05:00
Kyle Cui
7fa0cbf2a9
Update Fish Audio default model to s1
...
Update default model from speech-1.5 to s1 for Fish Audio TTS service
2025-11-26 01:50:38 -08:00
Thu Nguyen
36c4bc2df2
Update changelog
2025-11-26 13:01:48 +07:00
Thu Nguyen
42be0183af
Merge branch 'main' into feat/add-cached-reasoning-tokens-metrics-to-opentel-spans
2025-11-26 12:59:43 +07:00
RuiDaniel
7961f8a664
same behaviour on error
2025-11-25 18:35:59 +00:00
RuiDaniel
4ca143e8af
add mcp filters to client
2025-11-25 18:27:22 +00:00
Mark Backman
2607699664
Merge pull request #3125 from pipecat-ai/mb/fix-sagemaker-imports
...
fix: remove stt_sagemaker import from deepgram/__init__.py
2025-11-24 21:31:31 -05:00
Mark Backman
47fa3b8556
Merge pull request #3108 from fbarril/livekit-transport-helper
...
add livekit helper
2025-11-24 20:13:13 -05:00
Mark Backman
fa0100c38b
fix: remove stt_sagemaker import from deepgram/__init__.py
2025-11-24 20:04:18 -05:00
kompfner
e5142c1210
Merge pull request #3113 from pipecat-ai/pk/agentcore-processor
...
Initial implementation of `AWSBedrockAgentCoreProcessor`
2025-11-24 19:10:44 -05:00
Paul Kompfner
5907b51c7d
In AWSBedrockAgentCoreProcessor use self.create_task()/self.cancel_task() instead of using asyncio directly.
2025-11-24 18:53:39 -05:00
Paul Kompfner
9e4ec4f7f3
Implement AWSBedrockAgentCoreProcessor
2025-11-24 18:53:35 -05:00
fbarril
e2161ea63d
add pyjwt as a livekit dependency
2025-11-24 23:30:11 +00:00
fbarril
7c81f66241
Merge remote-tracking branch 'origin/main' into livekit-transport-helper
...
# Conflicts:
# CHANGELOG.md
# uv.lock
2025-11-24 23:29:22 +00:00
fbarril
60da466379
add pyjwt as a livekit dependency
2025-11-24 23:27:32 +00:00
fbarril
12c29b71f3
add entry to CHANGELOG.md
2025-11-24 23:27:13 +00:00
Mark Backman
b52b108932
Merge pull request #3118 from pipecat-ai/mb/deepgram-stt-sagemaker
...
Add SageMaker BiDi client and DeepgramSageMakerSTTService
2025-11-24 16:47:25 -05:00
Mark Backman
a357ff0205
Alphabetize the project.optional-dependencies
2025-11-24 16:43:44 -05:00
Mark Backman
0ece8b5894
Add 07c Deepgram SageMaker example
2025-11-24 16:41:01 -05:00
Mark Backman
782b257bbb
Add DeepgramSageMakerSTTService
2025-11-24 16:41:01 -05:00
Mark Backman
ab8dcd6ede
Add SageMaker BiDi client
2025-11-24 16:41:00 -05:00
Mark Backman
012c2f7dde
Merge pull request #3106 from pipecat-ai/mb/update-11labs-realtime-stt
...
Fix sample_rate issue in ElevenLabsRealtimeSTTService, add timestamps…
2025-11-24 08:10:30 -05:00
Mark Backman
87fdd8f006
Fix MiniMax changelog entries
2025-11-24 08:07:20 -05:00
Mark Backman
7bdac02837
Fix sample_rate issue in ElevenLabsRealtimeSTTService, add timestamps and logging
2025-11-24 08:06:33 -05:00
Mark Backman
861567bc59
Merge pull request #3119 from pipecat-ai/aleix/changelog-formatting
...
format CHANGELOG
2025-11-24 08:05:11 -05:00
Aleix Conchillo Flaqué
d0ff43134a
format CHANGELOG
2025-11-23 17:48:57 -08:00
Dante Noguez
3458b74fc9
Fix 11labs realtime dynamic updates ( #3117 )
2025-11-22 10:02:37 -05:00
mattie ruth backman
a6202c4d1a
Fixed CHANGELOG post rebase
2025-11-21 17:16:10 -05:00
mattie ruth backman
3c3141796a
Overlooked Changelog updates
2025-11-21 17:16:10 -05:00
mattie ruth backman
8b8b57b09c
Introduced new bot-output RTVI event to provide...
...
a best effort version of the bot's output
- The `RTVIObserver` now emits `bot-output` messages based off
the new `AggregatedTextFrame`s (`bot-tts-text` and
`bot-llm-text` are still supported and generated, but
`bot-transcript` is now deprecated in lieu of this new, more
thorough, message).
- The new `RTVIBotOutputMessage` includes the fields:
- `spoken`: A boolean indicating whether the text was spoken by TTS
- `aggregated_by`: A string representing how the text was aggregated
("sentence", "word", "my custom aggregation")
- Introduced new fields to `RTVIObserver` to support the new
`bot-output` messaging:
- `bot_output_enabled`: Defaults to True. Set to false to disable
bot-output messages.
- `skip_aggregator_types`: Defaults to `None`. Set to a list of
strings that match aggregation types that should not be included
in bot-output messages. (Ex. `credit_card`)
2025-11-21 17:16:10 -05:00
mattie ruth backman
4f30a48ecd
Rime and Cartesia TTS Updates:
...
`CartesiaTTSService`:
- Modified use of custom default text_aggregator to avoid deprecation warnings and push users
towards use of transformers or the `LLMTextProcessor`
- Added convenience methods for taking advantage of Cartesia's SSML tags: spell, emotion,
pauses, volume, and speed.
`RimeTTSService`:
- Modified use of custom default text_aggregator to avoid deprecation warnings and push users
towards use of transformers or the `LLMTextProcessor`
- Added convenience methods for taking advantage of Rime's customization options: spell,
pauses, pronunciations, and inline speed control.
2025-11-21 17:16:10 -05:00
mattie ruth backman
ecbc41045c
Added ability to transform text just-in-time before it gets sent to the TTS
2025-11-21 17:16:10 -05:00
mattie ruth backman
e1528d0f0c
Added support to TTS services to skip sending text to the...
...
the actual TTS service to be spoken based on its aggregation type.
2025-11-21 17:16:10 -05:00
mattie ruth backman
6b6d760cf1
Introduced LLMTextProcessor and deprecatd custom text_aggregators in TTS
...
Introduced `LLMTextProcessor`: A new processor meant to allow customization for how
LLMTextFrames should be aggregated and considered. It's purpose is to turn
`LLMTextFrame`s into `AggregatedTextFrame`s. By default, a TTSService will still
aggregate `LLMTextFrame`s by sentence for the service to consume. However, if you
wish to override how the llm text is aggregated, you should no longer override the
TTS's internal text_aggregator, but instead, insert this processor between your LLM
and TTS in the pipeline.
2025-11-21 17:16:10 -05:00
mattie ruth backman
7a4372a909
Introduced a new AggregatedTextFrame Frame type that TTSTextFrame inherits from
...
This frame introduces an `aggregated_by` field to describe the type of text included
in the frame and allows unspoken groupings of text to be pushed through the pipeline
and treated similar to TTSTextFrames.
2025-11-21 17:16:10 -05:00
mattie ruth backman
0e820a01b9
Introduce append_to_context to TextFrames
...
Adding support for setting whether or not the text in the TextFrame
should be added to the LLM context (by the LLM assistant aggregator).
Defaults to `True`.
2025-11-21 17:16:10 -05:00
mattie ruth backman
24266c238f
Augmented PatternPairAggregator so that matched patterns can...
...
be treated as their own aggregation, taking advantage of the new
ability to assign a type to an aggregation
2025-11-21 17:16:10 -05:00
mattie ruth backman
dcc20f86e1
Updated the BaseTextAggregator to categorize aggregations
...
Modified the BaseTextAggregator type so that when text gets aggregated, metadata can
be associated with it. Currently, that just means a `type`, so that the aggregation
can be classified or described. Changes made to support this:
- **IMPORTANT**: Aggregators are now expected to strip leading/trailing white space
characters before returning their aggregation from `aggregation()` or `.text`. This
way all aggregators have a consistent contract allowing downstream use to know how
to stitch aggregations back together
- Introduced a new `Aggregation` dataclass to represent both the aggregated `text` and
a string identifying the `type` of aggregation (ex. "sentence", "word", "my custom
aggregation")
- **BREAKING**: `BaseTextAggregator.text` now returns an `Aggregation` (instead of `str`).
To update: `aggregated_text = myAggregator.text` -> `aggregated_text = myAggregator.text.text`
- **BREAKING**: `BaseTextAggregator.aggregate()` now returns `Optional[Aggregation]`
(instead of `Optional[str]`). To update:
```
aggregation = myAggregator.aggregate(text)
if (aggregation):
print(f"successfully aggregated text: {aggregation.text}") // instead of {aggregation}
```
- `SimpleTextAggregator`, `SkipTagsAggregator`, `PatternPairAggregator` updated to
produce/consume `Aggregation` objects.
- All uses of the above Aggregators have been updated accordingly.
2025-11-21 17:16:10 -05:00
fbarril
ec8964425a
add livekit helper
2025-11-21 00:27:57 +00:00
Vanessa Pyne
26918728df
Merge pull request #3096 from pipecat-ai/vp-minimax-2962-v2
...
minimax 2962 language updates
2025-11-20 10:41:35 -06:00
vipyne
954849379b
cleanup
2025-11-20 10:41:09 -06:00
vipyne
06542a2dbc
Update CHANGELOG
2025-11-20 10:41:09 -06:00
Vanessa Pyne
59d40eac45
Update src/pipecat/services/minimax/tts.py
...
Co-authored-by: Mark Backman <mark@daily.co >
add warning
2025-11-20 10:41:09 -06:00
vipyne
17cf6c56cf
minimax updates
...
some `debug`s -> `trace`s
add western US base_url to docs
ensure error_message is defined
add deprecation warning for `english_normalization` param
2025-11-20 10:41:09 -06:00
minimax
616e6ba351
docs(minimax): add API endpoint comment for west US region
2025-11-20 10:41:08 -06:00
minimax
f3cb5e0106
feat(minimax): comprehensive updates to TTS service
...
- Add support for speech-2.6-hd and speech-2.6-turbo models
- Add 16 new languages (total 40): Afrikaans, Bulgarian, Catalan, Danish, Persian, Filipino, Hebrew, Croatian, Hungarian, Malay, Norwegian, Nynorsk, Slovak, Slovenian, Swedish, Tamil
- Add new emotions: calm and fluent
- Add new parameters: text_normalization (renamed from english_normalization), latex_read, force_cbr, exclude_aggregated_audio, subtitle_enable, subtitle_type
- Extract trace_id from response headers for all requests
- Improve error handling for non-streaming error responses
- Add detailed extra_info logging (audio_length, audio_size, usage_characters, word_count)
- Add validation warnings for language/model compatibility
- Fix silent error issue where HTTP 200 responses with errors were ignored
BREAKING CHANGE: Renamed parameter english_normalization to text_normalization
2025-11-20 10:41:08 -06:00
Aleix Conchillo Flaqué
c89f230c99
fix CHANGELOG
2025-11-20 08:40:30 -08:00
Aleix Conchillo Flaqué
69cd5716cd
Merge pull request #3102 from pipecat-ai/aleix/daily-python-0.22.0
...
pyproject: update daily-python to 0.22.0
2025-11-20 08:35:39 -08:00
Mark Backman
ab58f72322
Merge pull request #3101 from hwuiwon/hw/inworld-talking-speed
...
feat: Add speaking rate control to Inworld TTS service.
2025-11-20 09:50:55 -05:00
Hwuiwon Kim
ead361f665
fix
2025-11-20 07:45:13 -05:00
Aleix Conchillo Flaqué
fa6b8851ed
pyproject: update daily-python to 0.22.0
2025-11-19 21:56:38 -08:00
Hwuiwon Kim
1cc69d475d
feat: Add speaking rate control to Inworld TTS service & fix param cases
2025-11-19 22:57:53 -05:00
Mark Backman
51bdd8b728
Merge pull request #3097 from hwuiwon/fix-typo
...
Fix typo in STT event handler documentation
2025-11-19 17:10:32 -05:00
Hwuiwon Kim
30ff488714
Fix typo in event handler documentation
2025-11-19 17:04:07 -05:00
Gokul Js
0707141998
fix
2025-11-20 01:36:35 +05:30
Gokul Js
cc861d6b70
Refactor WebSocket connection code in RimeNonJsonTTSService for improved readability
2025-11-19 22:46:36 +05:30
Gokul Js
de4e9c54f6
Increase WebSocket max size limit in RimeNonJsonTTSService to enhance data handling capacity
2025-11-19 22:44:50 +05:30
Gokul Js
da671cd232
Fix whitespace inconsistency in audio flushing method of RimeNonJsonTTSService
2025-11-19 22:19:36 +05:30
Gokul Js
1d9696e614
Add audio flushing after sending text in RimeNonJsonTTSService
...
This update ensures that audio is flushed immediately after sending bare text to the WebSocket, improving the responsiveness of the Text-to-Speech service.
2025-11-19 22:19:00 +05:30
Vanessa Pyne
510f3df6b7
Merge pull request #3091 from pipecat-ai/vp-fix-mcp-examples
...
update MCP foundational examples
2025-11-19 10:35:08 -06:00
vipyne
68292bd75f
rename MCP foundational examples
2025-11-19 10:34:13 -06:00
vipyne
42423bff41
update MCP foundational examples
2025-11-19 10:29:18 -06:00
Aleix Conchillo Flaqué
c3d2a25229
Merge pull request #3082 from pipecat-ai/aleix/pipecat-0.0.95
...
update CHANGELOG for 0.0.95
2025-11-18 21:17:07 -08:00
Aleix Conchillo Flaqué
cf1a9c1548
update CHANGELOG for 0.0.95
2025-11-18 21:14:27 -08:00
Aleix Conchillo Flaqué
51ba245e10
scripts(evals): fix EVAL_CONVERSATION/EVAL_WEATHER eval
2025-11-18 21:14:27 -08:00
Aleix Conchillo Flaqué
39b4e61837
SimliVideoService: fix connection issue
2025-11-18 19:41:47 -08:00
Aleix Conchillo Flaqué
ceaf53fdb0
LLMContext: async create_image_message/create_audio_message fixes
2025-11-18 19:41:13 -08:00
Aleix Conchillo Flaqué
f93276c64f
Merge pull request #3090 from pipecat-ai/revert_function_calling_pr
...
Reverting: Ensure that the function call results respect the previous LLM context
2025-11-18 19:40:58 -08:00
Mark Backman
62a0f0c0f5
Merge pull request #3070 from ivaaan/hume-timestamps
2025-11-18 19:56:20 -05:00
Filipi Fuchter
793aca6b8b
Revert "Ensure that the function call results respect the previous LLM context."
...
This reverts commit a510b276e6 .
2025-11-18 21:38:49 -03:00
Filipi Fuchter
1fcaf3a4bf
Revert "Searching in both _function_calls_context_messages and context messages when updating the result."
...
This reverts commit fccc91e923 .
2025-11-18 21:38:49 -03:00
Gokul Js
afeef94900
Remove unused audio_format parameter from extra settings in RimeNonJsonTTSService
2025-11-19 04:55:14 +05:30
Gokul Js
860d9c4f29
Refactor _update_settings method in RimeNonJsonTTSService for improved readability and maintainability
2025-11-19 04:53:27 +05:30
Gokul Js
4393191166
Add method to update settings in RimeNonJsonTTSService
2025-11-19 04:53:21 +05:30
Gokul Js
88daad524e
Refactor whitespace in RimeNonJsonTTSService to improve code readability
2025-11-19 03:43:49 +05:30
Gokul Js
66c58f8155
fix
2025-11-19 03:40:59 +05:30
Gokul Js
7bbb5be910
format fix
2025-11-19 03:35:54 +05:30
Gokul Js
0dcb65bd56
add run tts methos for rimeNonJsonTTs
2025-11-19 03:34:58 +05:30
Gokul Js
2784b0f438
Add RimeNonJsonTTSService for non-JSON WebSocket API support
...
This commit introduces the RimeNonJsonTTSService class, enabling Text-to-Speech synthesis over WebSocket endpoints that require plain text messages. The service includes configuration parameters for language, segmentation, and audio settings, and handles WebSocket connections for raw audio byte transmission. Limitations include the lack of support for word-level timestamps and context IDs.
2025-11-19 03:24:57 +05:30
ivaaan
6484855139
fix changelog
2025-11-18 21:47:46 +01:00
ivaaan
771469b834
fix changelog
2025-11-18 21:39:29 +01:00
kompfner
a60618b0ca
Merge pull request #3080 from pipecat-ai/pk/assistant-aggregator-handles-mixed-includes-inter-frame-spaces-text
...
`LLMAssistantAggregator` now properly aggregates text that might be a…
2025-11-18 15:24:27 -05:00
Paul Kompfner
3d21faaac2
LLMAssistantAggregator now properly aggregates text that might be a mix of includes_inter_frame_spaces=True and includes_inter_frame_spaces=False frames
2025-11-18 15:12:25 -05:00
ivaaan
f325eeb95b
rm TranscriptProcessor 2
2025-11-18 20:41:10 +01:00
ivaaan
4c3fd42b1c
fix changelog
2025-11-18 20:36:45 +01:00
ivaaan
c2309efd7e
rm TranscriptProcessor
2025-11-18 20:35:09 +01:00
Ivan A
4ae1819645
Update src/pipecat/services/hume/tts.py
...
Co-authored-by: Mark Backman <m.backman@gmail.com >
2025-11-18 20:30:44 +01:00
Ivan A
a38f208135
Update examples/foundational/07ae-interruptible-hume.py
...
Co-authored-by: Mark Backman <m.backman@gmail.com >
2025-11-18 20:30:28 +01:00
Mark Backman
d1eb837890
Merge pull request #3081 from pipecat-ai/mb/fix-30-tts-text-frame-log
...
Fix foundational 30 example to output TTSTextFrames synced to audio
2025-11-18 14:10:56 -05:00
Mark Backman
153201542b
Fix foundational 30 example to output TTSTextFrames synced to audio
2025-11-18 13:29:06 -05:00
Filipi da Silva Fuchter
9137e50043
Merge pull request #3053 from pipecat-ai/filipi/function_calls
...
Ensure that the function call results respect the previous LLM context.
2025-11-18 14:59:01 -03:00
Ivan A
8dbe119a73
Merge branch 'main' into hume-timestamps
2025-11-18 18:38:24 +01:00
ivaaan
26f96d0be8
upd example
2025-11-18 18:31:38 +01:00
ivaaan
9944e6faf0
upd service based on Mark's suggestions
2025-11-18 18:25:53 +01:00
Aleix Conchillo Flaqué
c1573c1f76
Merge pull request #3078 from pipecat-ai/aleix/llm-context-create-image-audio-async
...
LLMContext: create_image_message/create_audio_message are now async
2025-11-18 09:06:51 -08:00
Aleix Conchillo Flaqué
9f45ad4d2e
LLMContext: create_image_message/create_audio_message are now async
2025-11-18 09:04:40 -08:00
Filipi Fuchter
fccc91e923
Searching in both _function_calls_context_messages and context messages when updating the result.
2025-11-18 11:50:28 -03:00
Filipi Fuchter
a510b276e6
Ensure that the function call results respect the previous LLM context.
2025-11-18 11:37:57 -03:00
Mark Backman
6481094638
Merge pull request #3058 from pipecat-ai/mb/add-camera-screen-support-smallwebrtc
...
Add camera and screen capture support to dev runner for SmallWebRTC
2025-11-18 09:22:36 -05:00
Mark Backman
3132e12265
Add camera and screen capture support to dev runner for SmallWebRTC
2025-11-18 09:19:13 -05:00
Aleix Conchillo Flaqué
12af3f79d0
Merge pull request #3060 from pipecat-ai/aleix/consumer-queue-frames
...
ConsumerProcessor: queue frames internally instead of pushing them
2025-11-18 00:54:18 -08:00
Aleix Conchillo Flaqué
4835617b16
ConsumerProcessor: queue frames internally instead of pushing them
2025-11-17 23:52:09 -08:00
Aleix Conchillo Flaqué
9283108240
Merge pull request #3073 from pipecat-ai/aleix/base-text-filter-only-filter
...
BaseTextFilter: only require subclasses to implement filter()
2025-11-17 23:29:26 -08:00
kompfner
515eaeeb1a
Merge pull request #3074 from pipecat-ai/pk/tweak-moondream-example
...
Update Moondream example so that Moondream service output makes it in…
2025-11-17 16:52:18 -05:00
Paul Kompfner
5095fc6a64
Update Moondream example so that Moondream service output makes it into the context, even if the TTS service is disabled
2025-11-17 15:16:19 -05:00
Aleix Conchillo Flaqué
7eedb33d50
BaseTextFilter: only require subclasses to implement filter()
2025-11-17 11:23:47 -08:00
Filipi da Silva Fuchter
47f78df497
Merge pull request #3071 from pipecat-ai/filipi/small_webrtc_custom_data
...
Passing the custom request_data to the SmallWebRTCRunnerArguments body.
2025-11-17 15:50:11 -03:00
Filipi Fuchter
74154b26a2
Mentioning the SmallWebRTCTransport fix in the readme.
2025-11-17 15:39:07 -03:00
Filipi Fuchter
0c3c26b7b8
Passing the custom request_data to the SmallWebRTCRunnerArguments body.
2025-11-17 15:20:09 -03:00
kompfner
64417ef4ff
Merge pull request #3061 from pipecat-ai/pk/greatly-simplify-inter-frame-spaces-logic
...
D'oh! My TTS "inter-frame-spaces" logic was *way* overcomplicated (an…
2025-11-17 10:47:56 -05:00
Paul Kompfner
f3b254e335
D'oh! My TTS "inter-frame-spaces" logic was *way* overcomplicated (and fundamentally mistaken, though it happened to work)
...
Now:
- For TTS word-by-word output and `TTSSpeakFrames`: `TTSTextFrame`s' have `includes_inter_frame_spaces=False`.
- For all other TTS output: `TTSTextFrame` pass through the received text frames' `includes_inter_frame_spaces` value. So far, this value has always been `True`: LLMs send text chunks already containing all necessary spaces.
- `LLMTextFrame`s set `includes_inter_frame_spaces=False` at init time, per the aforementioned assumption.
2025-11-17 10:14:28 -05:00
Filipi da Silva Fuchter
f27119a712
Merge pull request #3069 from pipecat-ai/filipi/fix_riva
...
Fixing RivaTTSService error handler.
2025-11-17 11:48:15 -03:00
ivaaan
2a51d0f1e5
add changelog
2025-11-17 15:20:06 +01:00
ivaaan
9156e21727
fix formatting
2025-11-17 14:00:03 +01:00
Filipi da Silva Fuchter
a5145be16e
Merge pull request #3038 from pipecat-ai/filipi/flux_improvements
...
Deepgram Flux improvements
2025-11-17 09:57:43 -03:00
Filipi Fuchter
b104a59b10
Mentioning the Deepgram Flux improvements in the changelog.
2025-11-17 09:54:39 -03:00
Filipi Fuchter
04dbbabc03
Introduced a minimum confidence parameter in DeepgramFluxSTTService to avoid generating transcriptions below a defined threshold.
2025-11-17 09:54:30 -03:00
Filipi Fuchter
19cc0177b8
Refactored DeepgramFluxSTTService to automatically reconnect if sending a message fails.
2025-11-17 09:54:20 -03:00
Filipi Fuchter
77cd106795
Extracted the logic for retrying connections, and create a new send_with_retry method inside WebSocketService.
2025-11-17 09:54:08 -03:00
ivaaan
71869a116d
fix errors
2025-11-17 13:51:04 +01:00
ivaaan
2f2bde9856
add timestamps to example
2025-11-17 13:40:03 +01:00
ivaaan
7de8838deb
add word-level timestamp support to Hume service
2025-11-17 13:25:12 +01:00
Filipi Fuchter
9bf88bbf14
Fixing RivaTTSService error handler.
2025-11-17 07:43:30 -03:00
Mark Backman
35ff44b799
Merge pull request #3059 from pipecat-ai/mb/remove-llm-tracing-fallback
2025-11-14 14:07:40 -05:00
Angad Singh
d1116d149e
feat: Add ErrorFrame emission to TTS/STT services for pipeline error detection ( #2881 )
...
* feat: Add ErrorFrame emission to TTS/STT services for pipeline error detection
- Add ErrorFrame emission to all major TTS/STT services during initialization and runtime failures
- Services updated: Cartesia, ElevenLabs, Deepgram, AssemblyAI, Rime, Azure
- ErrorFrame objects emitted with fatal=False for graceful degradation
- Enables on_pipeline_error event handler to detect service failures programmatically
- Add comprehensive pytest test suite to verify ErrorFrame emission
- Fixes issue where services failed gracefully but didn't emit ErrorFrame objects
This allows developers to implement real-time error monitoring and alerting
using the on_pipeline_error event handler introduced in v0.0.90.
* Update STT and TTS services to use consistent error handling pattern
- Improves error handling consistency across all services
* Add changelog entry for STT/TTS error handling improvements
* Linting issues Resolved
* Azure STT ErrorFrames added with consistent patterns
* Cartesia STT and Deepgram STT; additional fixes made
* Removed Fatal Flags across services, removed duplication
* Moving the changelog entry to the correct place.
* Refactoring some classes to use yield instead of push_error directly.
* Fixing ruff format.
---------
Co-authored-by: Filipi Fuchter <filipi87@gmail.com >
2025-11-14 15:03:05 -03:00
Mark Backman
d01876ee60
Remove fallbacks in traced_llm
2025-11-14 12:13:49 -05:00
Mark Backman
74a0e8c88d
Merge pull request #3050 from ai-coustics/aic-vad-analyzer
...
feat(ai-coustics): add ai-coustics integrated VAD
2025-11-14 08:11:15 -05:00
Corvin Jaedicke
fbbad27d37
add changelog info
2025-11-14 13:30:06 +01:00
kompfner
e83ac82bf3
Merge pull request #3042 from pipecat-ai/pk/follow-up-inter-frame-spaces
...
Follow-up to #3041
2025-11-13 11:03:06 -05:00
Mark Backman
d78d38ce44
Merge pull request #3039 from pipecat-ai/mb/update-google-gemini-tts
...
Update GeminiTTSService for streaming, other Google TTS improvements
2025-11-13 10:33:46 -05:00
Mark Backman
edbf96b3c5
Update GeminiTTSService for streaming, other Google TTS improvements
2025-11-13 10:22:34 -05:00
Paul Kompfner
8851d18f92
Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.
2025-11-13 10:02:33 -05:00
Mark Backman
d823a3edec
Merge pull request #3040 from pipecat-ai/mb/11labs-realtime-stt
...
Add ElevenLabsRealtimeSTTService
2025-11-13 09:53:34 -05:00
Mark Backman
0e37658f8d
Add ElevenLabsRealtimeSTTService
2025-11-13 09:49:05 -05:00
Corvin Jaedicke
2fab3e2286
fix formatting
2025-11-13 14:39:26 +01:00
Corvin Jaedicke
a7b2052b38
add ai-coustics VAD
2025-11-13 14:20:35 +01:00
Mark Backman
6d0e99c3b8
Merge pull request #3044 from rimelabs/rime-hin-lanaguge-support
...
Add support for Hindi language in RIme TTS service
2025-11-12 21:13:01 -05:00
gokuljs
fe25465987
changelog update
2025-11-13 07:16:36 +05:30
gokuljs
498e9ca4f6
Add support for Hindi language in RIme TTS service
2025-11-13 04:33:22 +05:30
Paul Kompfner
1802f949ef
Fix an issue with some examples where punctuation was missing from the LLM output, by tweaking the LLM prompt.
2025-11-12 17:12:03 -05:00
Paul Kompfner
1ad6405ebb
Override includes_inter_frame_spaces in:
...
- `GoogleHttpTTSService`
- `OpenAITTSService`
The reason I skipped this work in an earlier PR was because these services seemed to be emitting long, punctuation-free text frames. It turns out that the issue was with the LLM prompt, though, resulting in the LLM nondeterministically excluding all punctuation. An upcoming commit will address that prompt issue.
2025-11-12 17:07:43 -05:00
kompfner
4c25555396
Merge pull request #3041 from pipecat-ai/pk/apply-includes-inter-frame-spaces-wherever-necessary
...
Apply `includes_inter_frame_spaces = True` in all LLM and TTS service…
2025-11-12 16:49:14 -05:00
Paul Kompfner
5222ff99de
Apply includes_inter_frame_spaces = True in all LLM and TTS services that need it.
...
Note that for `LLMTextFrame`s, the right behavior is pretty much always `includes_inter_frame_spaces = True`. I decided *not* to go ahead and make that the default for `LLMTextFrame`s, though, simply to not introduce a subtle behavior change for creative/unexpected use-cases that were relying on text in hand-crafted `LLMTextFrame`s being handled a certain way. Ditto for `TTSTextFrame`s.
Also, fix an issue in `NeuphonicTTSService` where it wasn't pushing `TTSTextFrame`s.
Also, fix the broken `SarvamHttpTTSService` example.
Also, add a couple of missing examples.
2025-11-12 15:10:11 -05:00
Mark Backman
203a627707
Merge pull request #3028 from sam-s10s/fix/smx-tts-retry
...
SpeechmaticsTTS - Support for retry when 503 error to TTS API.
2025-11-12 09:26:07 -05:00
James Hush
2006a64def
Fix Langfuse tracing for GoogleLLMService with universal LLMContext ( #3025 )
...
* Fix Langfuse tracing for GoogleLLMService with universal LLMContext
- Fixed issue where input appeared as null in Langfuse dashboard for GoogleLLMService
- Added fallback to use adapter's get_messages_for_logging() for universal LLMContext
- Ensures proper message format conversion for Google/Gemini services
- Handles system message conversion to system_instruction format
- Also fixes serialization of empty message lists ([] now serializes correctly)
This fix ensures Langfuse tracing works correctly for Google services using
both OpenAILLMContext/GoogleLLMContext and the universal LLMContext.
* Add unit tests for Langfuse tracing with GoogleLLMService
- Test that tracing correctly captures messages with universal LLMContext
- Test that empty message lists are properly serialized
- Test that adapter's get_messages_for_logging is used instead of context method
- All tests verify that input is correctly added to Langfuse spans
* Fix test mocking to patch opentelemetry.trace.get_tracer correctly
The tests were failing in CI because they were trying to patch
'pipecat.utils.tracing.service_decorators.trace' which doesn't exist as
an attribute. The trace module is imported from opentelemetry, so we need
to patch 'opentelemetry.trace.get_tracer' instead.
* Skip tracing tests when opentelemetry is not installed
The tracing dependencies (opentelemetry) are optional in Pipecat and not
installed in the CI environment. Added a skipif marker to skip these tests
when opentelemetry is not available, preventing CI failures while still
allowing the tests to run when tracing dependencies are installed locally.
* Install tracing dependencies in GitHub Actions CI
Instead of skipping the tracing tests, install the 'tracing' extra
(opentelemetry) in the CI environment so the tests can run properly.
Removed the skipif condition from the tests since opentelemetry will
now be available in CI.
* Use the context type to determine which messages to use, fix tool_count and tools (#3032 )
---------
Co-authored-by: Mark Backman <mark@daily.co >
2025-11-12 14:58:00 +01:00
Corvin Jaedicke
3c76917c1e
use async process function
2025-11-12 13:48:22 +01:00
Filipi da Silva Fuchter
eb36a1bc91
Merge pull request #3033 from pipecat-ai/filipi/deepgram_flux_urlencode_changelog
...
Mentioning DeepgramFluxSTTService URL encode fix in changelog.
2025-11-11 17:29:07 -03:00
Filipi Fuchter
fff8aac18c
Mention DeepgramFluxSTTService URL encode fix in changelog.
2025-11-11 17:25:40 -03:00
Filipi da Silva Fuchter
ec4bd8db10
Merge pull request #3014 from julienvantyghem/fix/deepgramflux-keyterm-encoding
...
fix(deepgram-flux): urlencode keyterm and tag parameters
2025-11-11 17:24:17 -03:00
Filipi da Silva Fuchter
4cc298d616
Merge pull request #3029 from pipecat-ai/filipi/heygen_keep_alive
...
Preventing HeyGenVideoService from disconnecting.
2025-11-11 15:25:43 -03:00
Sam Sykes
8d21b54ef3
Revert to ErrorFrame.
2025-11-11 18:24:08 +00:00
Sam Sykes
217d7e9953
Fix for max attempts.
2025-11-11 18:05:06 +00:00
Sam Sykes
41cf9adef4
Updated for max retries.
2025-11-11 18:00:27 +00:00
Sam Sykes
501744d7da
Update CHANGELOG.
2025-11-11 17:53:31 +00:00
Sam Sykes
60bc77c795
Update debugging messages.
2025-11-11 17:50:06 +00:00
Sam Sykes
0febfc62ec
Updated to use backoff utility function.
2025-11-11 17:45:22 +00:00
Filipi Fuchter
b76b25a6e1
Mentioning the HeyGen fix in the changelog.
2025-11-11 11:58:31 -03:00
Filipi Fuchter
62caadfc7c
Preventing HeyGenVideoService from disconnecting.
2025-11-11 11:37:46 -03:00
Sam Sykes
41ac43cf71
updated docs
2025-11-11 13:56:45 +00:00
Sam Sykes
adf5198423
Support for retry when 503 error to TTS API.
2025-11-11 13:49:14 +00:00
Mark Backman
54e8d29615
Merge pull request #3022 from pipecat-ai/mb/changelog-0.0.94
...
Prep for 0.0.94 hotfix
2025-11-10 16:52:38 -05:00
Mark Backman
ee494918a9
Prep for 0.0.94 hotfix
2025-11-10 16:50:58 -05:00
Mark Backman
aa8a50bc61
Merge pull request #3015 from pipecat-ai/mb/deprecate-krisp
...
Deprecate KrispFilter
2025-11-10 16:38:06 -05:00
Mark Backman
20857ac19a
Merge pull request #3010 from pipecat-ai/mb/gemini-live-ar-xa
...
Add ar-XA language code for Gemini Live
2025-11-10 16:36:33 -05:00
Mark Backman
421a1b5389
Merge pull request #3021 from pipecat-ai/mb/add-sarvam-stt-readme
...
Add Sarvam STT to README list
2025-11-10 16:36:03 -05:00
Mark Backman
8dd45af5b7
Deprecate KrispFilter
2025-11-10 16:35:11 -05:00
kompfner
66c903276a
Merge pull request #3020 from pipecat-ai/pk/make-explicit-adding-spaces-when-concatenating-tts-text
...
Make the mechanism of adding spaces when concatenating TTS (or speech…
2025-11-10 14:34:10 -05:00
Mark Backman
588dcf2ab9
Add Sarvam STT to README list
2025-11-10 14:29:54 -05:00
Paul Kompfner
913194844e
Make the mechanism of adding spaces when concatenating TTS (or speech-to-speech LLM) output text explicit and deterministic, rather than heuristic-based.
...
This fixes a bug where spaces were sometimes missing from assistant messages in context.
2025-11-10 14:22:32 -05:00
Vanessa Pyne
c2ce143e6c
Merge pull request #3017 from pipecat-ai/vp-rm-livekit-serializer
...
remove LivekitFrameSerializer
2025-11-10 11:56:47 -06:00
vipyne
c1c7a561ed
remove LivekitFrameSerializer
2025-11-10 11:06:12 -06:00
kompfner
05311dcfbf
Merge pull request #3016 from pipecat-ai/mb/revert-concat-aggregated-text
...
Revert "Merge pull request #3004 from pipecat-ai/mb/improve-concat-ag…
2025-11-10 10:49:38 -05:00
Mark Backman
2300941bb8
Revert "Merge pull request #3004 from pipecat-ai/mb/improve-concat-aggregated-text"
...
This reverts commit 5e7f59a0b0 , reversing
changes made to 2ad4122b77 .
2025-11-10 09:58:19 -05:00
Julien Vantyghem
c38055dbdd
fix(deepgram-flux): urlencode keyterm and tag parameters
2025-11-09 19:17:19 +01:00
Thu Nguyen
35593b8574
Add cached and reasoning token metrics to OpenTelemetry spans
2025-11-09 00:38:30 +07:00
Mark Backman
0df75b0915
Add ar-XA language code for Gemini Live
2025-11-08 08:24:55 -05:00
Aleix Conchillo Flaqué
16e2d5b998
Merge pull request #3007 from pipecat-ai/aleix/pipecat-0.0.93
...
update CHANGELOG for 0.0.93
2025-11-07 13:25:25 -08:00
Aleix Conchillo Flaqué
4cf9e1409e
update CHANGELOG for 0.0.93
2025-11-07 13:17:44 -08:00
Aleix Conchillo Flaqué
0ed430e7e2
examples(foundational): use DeepgramSTTService in 07
2025-11-07 11:34:11 -08:00
Aleix Conchillo Flaqué
342a8b121b
pyproject: update simli to 0.1.25
2025-11-07 11:30:41 -08:00
Aleix Conchillo Flaqué
5729722dcd
SimliVideoService: check exception initializing simli client
2025-11-07 11:30:41 -08:00
Aleix Conchillo Flaqué
38aac44a1e
scripts(evals): 26c should be a camera eval
2025-11-07 11:30:41 -08:00
Aleix Conchillo Flaqué
4f1468e0fa
scripts(evals): improve eval prompt
2025-11-07 10:05:46 -08:00
Aleix Conchillo Flaqué
9b1192ca9b
Merge pull request #3001 from pipecat-ai/pk/openai-realtime-toolsschema-support
...
Added support for passing in a `ToolsSchema` in lieu of a list of pro…
2025-11-07 09:37:43 -08:00
Mark Backman
5e7f59a0b0
Merge pull request #3004 from pipecat-ai/mb/improve-concat-aggregated-text
...
Improve concatenate_aggregated_text string utility
2025-11-07 12:37:12 -05:00
Aleix Conchillo Flaqué
2ad4122b77
Merge pull request #3006 from pipecat-ai/aleix/vision-image-backwards-compatibility
...
restore vision/image backwards compatibility
2025-11-07 09:19:38 -08:00
Mark Backman
5950f734f5
Merge pull request #3002 from pipecat-ai/mb/clarify-model-openai-realtime
...
Clarify how to set model in OpenAIRealtimeLLMService
2025-11-07 12:05:10 -05:00
Aleix Conchillo Flaqué
8d0364b630
restore vision/image backwards compatibility
2025-11-07 08:53:58 -08:00
kompfner
bfe031604a
Merge pull request #3005 from pipecat-ai/pk/add-missing-comments
...
Add missing explanatory comments to AWS and Anthropic that are presen…
2025-11-07 11:50:41 -05:00
kompfner
9bfde61183
Merge pull request #3003 from pipecat-ai/pk/fix-deprecation-warning-always-printed-on-set-bot-ready
...
Fix a deprecation warning printed every time `RTVIProcessor.set_bot_r…
2025-11-07 11:50:30 -05:00
Paul Kompfner
cb40a39a01
Add missing explanatory comments to AWS and Anthropic that are present in the other LLM services
2025-11-07 11:44:44 -05:00
Mark Backman
03001f8047
Update TranscriptProcessor unit tests
2025-11-07 11:44:04 -05:00
Paul Kompfner
10f1c314b6
Fix a deprecation warning printed every time RTVIProcessor.set_bot_ready() is called
2025-11-07 11:27:58 -05:00
Mark Backman
4d1d6465fc
Support OpenAI Realtime and Gemini Live single word edge cases in concatenate_aggregated_text
2025-11-07 11:26:38 -05:00
Paul Kompfner
359d220162
Document a OpenAIRealtimeLLMService gotcha in an example.
2025-11-07 10:32:27 -05:00
Mark Backman
6feecf05f7
Merge pull request #2994 from Toprak2/patch-1
...
Fix incorrect docstring in FrameProcessorQueue.__init__
2025-11-07 10:21:11 -05:00
Paul Kompfner
c3306bb4f2
Support for passing in a ToolsSchema in lieu of a list of provider-specific dicts when updating OpenAIRealtimeLLMService using LLMUpdateSettingsFrame.
2025-11-07 10:18:29 -05:00
Mark Backman
07a4aae248
Clarify how to set model in OpenAIRealtimeLLMService
2025-11-07 09:58:12 -05:00
Paul Kompfner
925a6cc2ef
Added support for passing in a ToolsSchema in lieu of a list of provider-specific dicts when initializing OpenAIRealtimeLLMService.
...
I chose to go the somewhat hacky route of adding the `ToolsSchema` support into the `events.SessionProperties` model itself—even though we should never serialize that type when creating events—because the alternative seemed to be to create a new type for `OpenAIRealtimeLLMService` initialization parameters and then we'd have to contend with backward compatibility, which seemed like a bigger headache.
2025-11-07 09:50:26 -05:00
Mark Backman
613ad74103
Merge pull request #3000 from pipecat-ai/mb/clarify-openai-realtime-model-docs
2025-11-07 06:36:30 -05:00
Muhammed Ali Toprak
2ab6b71890
Shorten docstring for clarity
2025-11-07 11:24:06 +03:00
Toprak2
c2bd8d22a0
Merge branch 'pipecat-ai:main' into patch-1
2025-11-07 11:19:08 +03:00
Mark Backman
eda12f56e6
Add clarifying documentation about OpenAI Realtime model use
2025-11-06 19:42:35 -05:00
Aleix Conchillo Flaqué
3daa1b7850
Merge pull request #2998 from pipecat-ai/aleix/transport-params-audio-out-end-silence-secs
...
BaseOutputTransport: send silence when EndFrame is received
2025-11-06 12:18:33 -08:00
Aleix Conchillo Flaqué
4c8c44ecc3
BaseOutputTransport: send silence when EndFrame is received
2025-11-06 12:16:05 -08:00
Aleix Conchillo Flaqué
8c34e1efba
Merge pull request #2996 from pipecat-ai/aleix/broadcast-frame
...
FrameProcessor: add new broadcast_frame() method
2025-11-06 12:13:15 -08:00
Aleix Conchillo Flaqué
f6916428b1
FrameProcessor: add new broadcast_frame() method
2025-11-06 12:11:48 -08:00
Marcus
a14d00b806
Improved LocalSmartTurnAnalyzerV3 performance on systems with a low CPU count ( #2982 )
2025-11-06 14:42:05 -05:00
Mark Backman
927cf751c0
Merge pull request #2997 from pipecat-ai/mb/google-stt-409
...
GoogleSTTService: Add more robust handling of 409 errors
2025-11-06 14:39:51 -05:00
Mark Backman
1fb6d6bd23
GoogleSTTService: Add more robust handling of 409 errors
2025-11-06 14:35:53 -05:00
Mark Backman
94a3306679
Merge pull request #2995 from pipecat-ai/mb/fix-stt-mute-filter-stt-muting
...
fix: STTMuteFilter no longer sends STTMuteFrame
2025-11-06 14:35:07 -05:00
Mark Backman
16bd1fe32d
Merge pull request #2984 from pipecat-ai/mb/11labs-pronunciation-dictionary
...
Add ElevenLabs pronunciation dictionary support
2025-11-06 14:23:49 -05:00
Mark Backman
58b552171d
Add pronunciation_dictionary_locators to ElevenLabs TTS Services
2025-11-06 14:13:51 -05:00
Mark Backman
4732a442d4
Merge pull request #2992 from pipecat-ai/mb/metrics-log-observer
...
Add MetricsLogObserver
2025-11-06 14:04:55 -05:00
Mark Backman
accdddce95
Add MetricsLogObserver
2025-11-06 14:01:20 -05:00
Mark Backman
daf9da823c
Merge pull request #2993 from pipecat-ai/mb/fix-gemini-token-counting
...
fix: correct GoogleLLMService token counting
2025-11-06 13:47:51 -05:00
Mark Backman
f6b6aa8766
fix: STTMuteFilter no longer sends STTMuteFrame
2025-11-06 11:53:32 -05:00
Toprak2
935eb58951
Update docstring for FrameProcessorQueue
...
Clarify the docstring for FrameProcessorQueue initialization.
2025-11-06 19:18:15 +03:00
Mark Backman
9f2ddcc5f4
Merge pull request #2927 from pipecat-ai/marcus/2025-10-28_sample_rtvi_fix
...
Add RTVIProcessor to foundational example 38b
2025-11-06 10:19:10 -05:00
Mark Backman
961e28517e
Remove arg from RTVIProcessor
2025-11-06 10:16:31 -05:00
Mark Backman
34d6f3fa00
fix: correct GoogleLLMService token counting
2025-11-06 10:01:37 -05:00
Filipi da Silva Fuchter
616abfd96c
Merge pull request #2987 from pipecat-ai/filipi/fix_start_endpoint
...
Fixing the runner start endpoint when enableDefaultIceServers is enabled.
2025-11-06 09:32:01 -03:00
Mark Backman
d7774ac599
Merge pull request #2991 from pipecat-ai/mb/fix-deepgram-is-connected
2025-11-06 06:35:51 -05:00
Mark Backman
c8c13ecee2
fix: DeepgramSTTService await is_connected()
2025-11-05 21:42:15 -05:00
Vanessa Pyne
314acc104e
Merge pull request #2990 from pipecat-ai/vp-fix-riva-default-voice
...
fix default riva tts voice_id
2025-11-05 18:40:41 -06:00
vipyne
1dfa59257d
fix default riva tts voice_id
2025-11-05 18:30:05 -06:00
Mark Backman
376dcc34f7
Merge pull request #2986 from pipecat-ai/mb/docs-0.0.93
...
Fix docstrings for 0.0.93 release, fix classmethod placement in Reque…
2025-11-05 15:49:09 -05:00
Filipi Fuchter
5ee8c56899
Fixing the runner start endpoint when enableDefaultIceServers is enabled.
2025-11-05 17:36:24 -03:00
kompfner
4397deddc7
Merge pull request #2970 from pipecat-ai/pk/tool-registration-improvements
...
Assorted tool registration improvements
2025-11-05 15:31:15 -05:00
Paul Kompfner
13d6078ea0
Minor tweak to an example for clarity.
2025-11-05 15:30:01 -05:00
Paul Kompfner
61aec08794
CHANGELOG item ordering tweak
2025-11-05 15:29:58 -05:00
Paul Kompfner
0f69d4aea3
Fixed an issue where GeminiLiveLLMService wasn't consistent in what it would do if if it received an LLMContextFrame (triggered by an LLMRunFrame, say) and there were no user messages in the initial context:
...
- If the context contained a system message, that message would be converted to a user message and the LLM would respond
- If the system message was provided as a constructor argument, though, no user messages would be sent to the LLM, and the LLM would therefore not respond
Not adding this fix to the CHANGELOG since `GeminiLiveLLMService`'s ability to properly handle context-provided tools and system instruction hasn't been published yet.
2025-11-05 15:29:04 -05:00
Paul Kompfner
84ba628dfb
Fix a bug in GeminiLiveLLMService where if only *one* of tools or system instruction was provided in the context, the other wouldn't fall back to using the value provided in the constructor.
...
Not adding this fix to the CHANGELOG since `GeminiLiveLLMService`'s ability to properly handle context-provided tools and system instruction hasn't been published yet.
2025-11-05 15:29:04 -05:00
Paul Kompfner
9ce33f23b9
Add an example demonstrating MCP usage with a speech-to-speech service (GeminiLiveLLMService) using the pattern of passing in tools in the constructor
2025-11-05 15:29:04 -05:00
Paul Kompfner
75245e1daa
Fix a bug in GeminiLiveLLMService where in some circumstances it wouldn't respond after a tool call
2025-11-05 15:29:04 -05:00
Paul Kompfner
24365aeefe
CHANGELOG wording fix
2025-11-05 15:29:04 -05:00
Paul Kompfner
29ef0f419f
Add typing formalizing MCPClient support for registering tools on an LLMSwitcher in addition to an LLMService.
2025-11-05 15:29:01 -05:00
Paul Kompfner
a9d78bd956
Make it possible to get a ToolsSchema out of an MCPClient without passing in an LLM service.
...
This allows folks to use `MCPClient` alongside the pattern of passing in tools at LLM init time, a pattern supported by speech-to-speech services such as `GeminiLiveLLMService`.
2025-11-05 15:28:23 -05:00
Paul Kompfner
e6f881bb08
Remove the "needs alternate schema" mechanism in MCPClient, moving the necessary schema massaging into GeminiLLMAdapter instead.
...
This does a couple of things:
- Makes the `MCPClient` LLM agnostic, setting us up for some upcoming improvements (like making it possible to use with `LLMSwitcher`)
- Makes `GeminiLLMAdapter` more robust, as the schema massaging that was previously only done in `MCPClient` is useful for all tools, not just for MCP-provided ones
2025-11-05 15:28:23 -05:00
Paul Kompfner
bee4165ba4
Add LLMSwitcher.register_direct_function()
2025-11-05 15:28:19 -05:00
Mark Backman
e2f6ce1078
Fix docstrings for 0.0.93 release, fix classmethod placement in RequestHandler
2025-11-05 15:27:16 -05:00
Paul Kompfner
0184493711
Update the service switcher example to illustrate registering tools on all LLMs in a switcher
2025-11-05 15:27:00 -05:00
Aleix Conchillo Flaqué
eb3c4c59fc
Merge pull request #2985 from pipecat-ai/revert-2976-aleix/interruption-task-frame-finished-event
...
Revert "fix interruption task frame context ordering"
2025-11-05 12:25:45 -08:00
Aleix Conchillo Flaqué
d844829538
Revert "fix interruption task frame context ordering"
2025-11-05 12:14:03 -08:00
Mark Backman
11b101e8a6
Merge pull request #2974 from pipecat-ai/mb/language-mapping-improvements
...
Improve language checking in STT and TTS services
2025-11-05 14:59:41 -05:00
Mark Backman
3db5ab9f23
Merge pull request #2983 from pipecat-ai/mb/bump-fastapi-0.122.0
...
Bumped the fastapi dependency to <0.122.0
2025-11-05 13:24:23 -05:00
Mark Backman
9a96e4060c
Bumped the fastapi dependency to <0.122.0
2025-11-05 13:13:47 -05:00
Aleix Conchillo Flaqué
d826279946
Merge pull request #2976 from pipecat-ai/aleix/interruption-task-frame-finished-event
...
fix interruption task frame context ordering
2025-11-05 09:53:26 -08:00
Aleix Conchillo Flaqué
e4212fb3c0
tests: add interruption strategies context ordering tests
2025-11-05 09:38:18 -08:00
Aleix Conchillo Flaqué
234aae3091
FrameProcessor: use finished_event for push_interruption_task_frame_and_wait
2025-11-05 09:38:17 -08:00
Aleix Conchillo Flaqué
c33b81bb92
PipelineTask: set finished_event InterruptionFrame is received
2025-11-05 09:37:55 -08:00
Aleix Conchillo Flaqué
a1c07039ee
frames: added finished_event to InterruptionFrame/InterruptionTaskFrame
2025-11-05 09:37:55 -08:00
Mark Backman
33be73692f
Merge pull request #2979 from thsunkid/feature/whisper-stt-probability-metrics
...
Feat: Access prob metrics for Whisper STT services using include_prob_metrics
2025-11-05 12:33:24 -05:00
Mark Backman
f6d7b6ae5f
Fix SpeechmaticsSTTService: use language checking for language and output_locale
2025-11-05 12:26:52 -05:00
Mark Backman
2ee54c985f
Improve language checking in STT and TTS services
2025-11-05 12:26:52 -05:00
Thu Nguyen
76c336644a
Merge branch 'main' into feature/whisper-stt-probability-metrics
2025-11-06 00:24:34 +07:00
Thu Nguyen
dd8711dee1
Added changelog
2025-11-06 00:23:42 +07:00
Thu Nguyen
c26c27fe21
Update util with new docs and extract_deepgram_probability
2025-11-06 00:23:20 +07:00
Mark Backman
159dbd078d
Merge pull request #2980 from pipecat-ai/mb/gemini-vertex-update
...
Refactor GoogleVertexLLMService to use GoogleLLMService as a base class
2025-11-05 11:35:50 -05:00
Mark Backman
c18ff999a5
Update GoogleVertexLLMService default model to gemini-2.5-flash
2025-11-05 11:28:41 -05:00
Mark Backman
80d127aaa4
Refactor GoogleVertexLLMService to use GoogleLLMService as a base class
2025-11-05 09:33:02 -05:00
Mark Backman
bbc7d3e2fb
Merge pull request #2977 from pipecat-ai/mb/request-handler-smallwebrtc
...
Fix: support request data in SmallWebRTC
2025-11-05 08:50:31 -05:00
Thu Nguyen
3486d63ef6
Add docs
2025-11-05 13:30:49 +07:00
Thu Nguyen
842c4a3485
Update base_stt
2025-11-05 13:26:59 +07:00
Thu Nguyen
0b779a880b
Feat: allow accessing prob metrics for Whisper STT services with include_prob_metrics param
2025-11-05 13:24:49 +07:00
Mark Backman
01f3421052
Fix: support request data in SmallWebRTC
2025-11-04 17:14:29 -05:00
Aleix Conchillo Flaqué
c20aa78648
Merge pull request #2969 from pipecat-ai/aleix/pipecat-observer-files
...
PipelineTask: load observers from PIPECAT_OBSERVER_FILES
2025-11-04 12:34:37 -08:00
Aleix Conchillo Flaqué
38f27ad991
PipelineTask: load observers from PIPECAT_OBSERVER_FILES
2025-11-04 12:10:53 -08:00
Mark Backman
0c38585034
Merge pull request #2973 from pipecat-ai/mb/cartesia-sonic-3-languages
...
Add sonic-3 languages to Cartesia TTS services
2025-11-04 14:43:06 -05:00
Mark Backman
8a09bbbf0e
Merge pull request #2972 from akash-dutta-dev/hotfix/addCustomParamForExotel
...
Add customer parameter in Call Data for Exotel
2025-11-04 14:29:58 -05:00
Vanessa Pyne
fb737ff671
Merge pull request #2967 from pipecat-ai/vp-39-bork-83
...
update example 39-mcp-stdio.py to use different mcp server
2025-11-04 09:02:29 -06:00
vipyne
b7a4d7371c
wrap tools = await mcp.register_tools(llm) in try in examples
2025-11-04 09:01:12 -06:00
vipyne
ef88d6a2ea
update example 39-mcp-stdio.py to use different mcp server
...
https://www.loom.com/share/a9f0a270261d4c6cb054ab2b4dcd6084
SO to Rijksmuseum MCP
https://github.com/r-huijts/rijksmuseum-mcp
2025-11-04 09:01:12 -06:00
kompfner
5c1bd8cda2
Merge pull request #2961 from pipecat-ai/pk/gemini-live-fix-session-resumption
...
Fix Gemini Live session resumption. The problem was that we weren't p…
2025-11-04 09:19:17 -05:00
Paul Kompfner
a82158045a
Fix Gemini Live session resumption. The problem was that we weren't properly ignoring send errors during reconnection.
2025-11-04 09:18:40 -05:00
Mark Backman
b1533ddfc4
Add sonic-3 languages to Cartesia TTS services
2025-11-04 07:57:04 -05:00
Mark Backman
0abc699f24
Merge pull request #2964 from pipecat-ai/mb/14j-nim-updates
...
Fix 14j foundational example
2025-11-04 07:24:53 -05:00
Akash Dutta
09018071e8
Add customer parameter in Call Data for Exotel
2025-11-04 16:57:28 +05:30
Mark Backman
1c53a5fd01
Fix 14j foundational example
2025-11-03 14:57:44 -05:00
kompfner
05d4753d3e
Merge pull request #2956 from pipecat-ai/pk/gemini-honor-context-provided-instructions-and-tools
...
`GeminiLiveLLMService` supports context-provided system instruction a…
2025-11-03 10:38:26 -05:00
Paul Kompfner
87131850bc
GeminiLiveLLMService supports context-provided system instruction and tools
2025-11-03 10:30:46 -05:00
Aleix Conchillo Flaqué
af83f45a49
Merge pull request #2959 from pipecat-ai/aleix/cancel-frame-reason
...
CancelFrame: add reason field to indicate why pipeline is being cancelled
2025-11-02 11:06:58 -08:00
Aleix Conchillo Flaqué
62e45f466a
EndFrame: add reason field to indicate why pipeline is being ended
2025-11-02 00:45:27 -07:00
Aleix Conchillo Flaqué
e85e93b9b1
CancelFrame: add reason field to indicate why pipeline is being cancelled
2025-11-02 00:44:47 -07:00
Mark Backman
074d3ff162
Merge pull request #2821 from shreyas-sarvam/sarvam/stt
...
Sarvam STT/STTT WS implementation
2025-10-31 13:47:27 -04:00
shreyas-sarvam
d680ec2e69
Merge branch 'main' into sarvam/stt
2025-10-31 23:09:47 +05:30
shreyas-sarvam
d905b21f72
fix: Pass input_audio_codec as an __init__ parameter
2025-10-31 23:07:48 +05:30
shreyas-sarvam
6c5d84ca4c
fix: Fixes for sample_rate being passed by PipelineParams
2025-10-31 23:03:25 +05:30
Aleix Conchillo Flaqué
334167e3d7
Merge pull request #2953 from pipecat-ai/aleix/pipecat-0.0.92
...
update CHANGELOG for 0.0.92. 🎃 "The Haunted Edition" 👻
2025-10-31 09:47:25 -07:00
Aleix Conchillo Flaqué
e3531a5f25
update CHANGELOG for 0.0.92. 🎃 "The Haunted Edition" 👻
2025-10-31 09:17:03 -07:00
Mark Backman
343e97666a
Merge pull request #2954 from pipecat-ai/mb/runner-meeting-token-properties
...
Add support for token properties in Daily util and development runner
2025-10-31 12:12:14 -04:00
Mark Backman
653e84321b
Add support for token properties in Daily util and development runner
2025-10-31 12:08:53 -04:00
Mark Backman
3585f724c4
Merge pull request #2952 from pipecat-ai/mb/add-daily-room-properties-to-runner
...
Add support for dailyRoomProperties when calling /start using the dev…
2025-10-31 12:04:42 -04:00
Mark Backman
5fe597d355
Add support for dailyRoomProperties when calling /start using the development runner
2025-10-31 12:01:03 -04:00
Aleix Conchillo Flaqué
67ab3773f6
Merge pull request #2949 from pipecat-ai/aleix/idle-timeout-observer
...
PipelineTask: add IdleFrameObserver to detect idle pipelines
2025-10-31 08:51:09 -07:00
Mark Backman
c6e12b9358
Merge pull request #2943 from pipecat-ai/mb/deepgram-http
...
Add DeepgramHttpTTSService
2025-10-31 11:51:06 -04:00
Aleix Conchillo Flaqué
0f5030bafa
tests: add unit test to check for idle timeout on swallowed frames
2025-10-31 08:45:56 -07:00
Aleix Conchillo Flaqué
ed93e29850
PipelineTask: add IdleFrameObserver to detect idle pipelines
2025-10-31 08:45:56 -07:00
Mark Backman
7eb880c5e8
Add DeepgramHttpTTSService
2025-10-31 11:39:32 -04:00
Aleix Conchillo Flaqué
4fa0de6660
Merge pull request #2947 from pipecat-ai/aleix/rename-add-to-context
...
UserImageRawFrame: rename add_to_context to append_to_context
2025-10-31 08:29:49 -07:00
kompfner
396c1bcc13
Merge pull request #2946 from pipecat-ai/pk/deprecate-expect-stripped-words
...
Deprecate `expect_stripped_words` option from `LLMAssistantAggregatorParams`…
2025-10-31 09:57:20 -04:00
shreyas-sarvam
57f6ae9e50
Merge branch 'main' into sarvam/stt
2025-10-31 17:36:52 +05:30
shreyas-sarvam
2d03e51109
fix: Remove unused imports, use sample_rate from base class
2025-10-31 17:31:59 +05:30
Mark Backman
1e7143e5f3
Merge pull request #2942 from pipecat-ai/mb/speechmatics-tts-changelog
...
Add SpeechmaticsTTSService, Soniox changes to changelog
2025-10-31 07:43:58 -04:00
Mark Backman
f820c20fa2
Add SpeechmaticsTTSService and SonioxSTTService changes to changelog
2025-10-31 07:41:17 -04:00
Mark Backman
83f395ff8f
Merge pull request #2940 from thsunkid/feature/google-tts-chirp-speaking-rate
...
Add dynamic speaking_rate control for Google TTS Chirp voices
2025-10-31 07:39:05 -04:00
shreyas-sarvam
09a7e08cbf
Merge branch 'main' into sarvam/stt
2025-10-31 15:21:09 +05:30
shreyas-sarvam
6f172bba8f
feat: Make input parameters accessible to users
2025-10-31 15:17:06 +05:30
shreyas-sarvam
1433df4de2
fix: Fix language param and include suggested way of handling STT response
2025-10-31 13:23:08 +05:30
Thu Nguyen
6ade5617fb
addressed comments
2025-10-31 09:53:47 +07:00
Aleix Conchillo Flaqué
685d440206
UserImageRawFrame: rename add_to_context to append_to_context
2025-10-30 15:18:27 -07:00
Paul Kompfner
ac5734d0ed
Deprecate expect_stripped_words option from LLMAssistantAggregatorParams, when used with the newer LLMAssistantAggregator, which now handles word spacing automatically.
...
This commit does not change how it works in the older `LLMAssistantContextAggregator`.
2025-10-30 17:22:47 -04:00
Aleix Conchillo Flaqué
5e00133e64
Merge pull request #2935 from pipecat-ai/aleix/improve-image-and-vision-support
...
improve image and vision support
2025-10-30 14:09:01 -07:00
Aleix Conchillo Flaqué
42f0490414
examples(foundational): 14-* show how to tell the LLM we are capturing an image
2025-10-30 14:02:17 -07:00
Aleix Conchillo Flaqué
19f046a338
examples(foundational): add 12d-describe-image-moondream
2025-10-30 14:02:17 -07:00
Aleix Conchillo Flaqué
ec95618b94
don't tie UserImageRawFrame with function calls
2025-10-30 14:02:17 -07:00
Aleix Conchillo Flaqué
74fb6e7676
scripts(evals): improve eval prompting
2025-10-30 13:08:15 -07:00
Aleix Conchillo Flaqué
8fa6cbac51
examples(foundational): added 14d docstrings
2025-10-30 13:08:15 -07:00
Aleix Conchillo Flaqué
a997655eac
scripts(evals): simplify eval configuration and allow RunnerArgs body
2025-10-30 13:08:15 -07:00
Aleix Conchillo Flaqué
3b3a215155
examples(foundational): re-add 12-* but load image from file
2025-10-30 13:08:15 -07:00
Aleix Conchillo Flaqué
e458d3edfe
scripts(evals): update 12-* for 14-*-video
2025-10-30 13:08:15 -07:00
Aleix Conchillo Flaqué
d7d409df60
examples(foundational): move 12-* to 14-*-video
2025-10-30 13:08:15 -07:00
Aleix Conchillo Flaqué
5174b18176
LLMAssistantAggregator: don't mark function calls as completed when receiving user image
...
Before, when requesting a user image from a function call we had to wait for a
random time before we could indicate the function call was done. This was to
given time to the aggregator to process the image before marking the function
call as completed.
To avoid this, we now wait for the requested image to be received by the LLM
assistant agrgegator (using an asyncio event). Then, we can successfully mark
the function call as completed.
2025-10-30 13:08:15 -07:00
Aleix Conchillo Flaqué
9c5690d670
LLMContext: added support for image messages with URLs
2025-10-30 13:08:15 -07:00
Aleix Conchillo Flaqué
e0933e20d2
deprecated UserResponseAggregator
2025-10-30 13:08:15 -07:00
Aleix Conchillo Flaqué
ce13155d26
vision(moondream): process VisionImageRawFrame
2025-10-30 13:08:15 -07:00
Aleix Conchillo Flaqué
817a485d94
frames: added VisionImageRawFrame
2025-10-30 13:08:15 -07:00
Aleix Conchillo Flaqué
b094418d1e
LLMContext: add create_image_message and create_audio_message
2025-10-30 13:08:13 -07:00
Filipi da Silva Fuchter
08a1e09020
Merge pull request #2944 from pipecat-ai/filipi/flux_handlers
...
New event handlers for the DeepgramFluxSTTService.
2025-10-30 16:40:41 -03:00
Filipi Fuchter
52b33e5106
New event handlers for the DeepgramFluxSTTService.
2025-10-30 16:09:07 -03:00
Mark Backman
5db0871a20
Merge pull request #2873 from matejmarinko-soniox/main
...
Update model params for Soniox STT
2025-10-30 12:50:30 -04:00
Mark Backman
222c362fa4
Merge pull request #2937 from aaronng91/speechmatics-tts
...
Add Speechmatics TTS
2025-10-30 12:30:27 -04:00
Aaron Ng
9d509bb409
address changes
2025-10-30 16:25:10 +00:00
shreyas-sarvam
8d0e7e5e16
chore: Add changelog entry, update foundational examples
2025-10-30 19:22:14 +05:30
shreyas-sarvam
e7b8da7a83
feat: Refactor code to include language parameter, model_name and use _handle_transcription method
2025-10-30 19:01:04 +05:30
shreyas-sarvam
35c48a45cf
fix: Ruff format
2025-10-30 18:51:18 +05:30
shreyas-sarvam
14a365aa16
fix: Use message handler to handle responses
2025-10-30 17:54:32 +05:30
shreyas-sarvam
779fc0419d
Merge branch 'main' into sarvam/stt
2025-10-30 15:50:53 +05:30
Thu Nguyen
057e0c3973
Lint
2025-10-30 17:12:36 +07:00
Thu Nguyen
8a6abdd44b
feat: Add dynamic speaking_rate control for Google TTS Chirp voices
2025-10-30 17:09:41 +07:00
Mark Backman
7872fa2e88
Merge pull request #2934 from roshie548/add-cartesia-generation-config
...
feat: add generation_config support for Cartesia Sonic-3
2025-10-29 23:10:48 -04:00
Roshan
e86c546a1a
Merge branch 'main' into add-cartesia-generation-config
2025-10-29 18:31:09 -07:00
Roshan
abf34bcccf
address pr comments
2025-10-29 18:29:51 -07:00
Aleix Conchillo Flaqué
56eb633390
Merge pull request #2911 from pipecat-ai/aleix/daily-transport-improve-error-handling
...
DailyTransport: update start_dialout/start_recording return values
2025-10-29 16:28:10 -07:00
Aleix Conchillo Flaqué
6299b9db87
DailyTransport: trigger "on_error" if transcription fails to start/stop
2025-10-29 16:25:13 -07:00
Aleix Conchillo Flaqué
bcffa590a3
DailyTransport: update start_dialout/start_recording return values
2025-10-29 16:25:13 -07:00
kompfner
8b739aa444
Merge pull request #2889 from pipecat-ai/pk/openai-realtime-universal-llmcontext-2
...
Support new `LLMContext` pattern with `OpenAIRealtimeLLMService`
2025-10-29 16:54:37 -04:00
Paul Kompfner
8f15980c67
Get rid of unnecessary new task in example file
2025-10-29 16:23:50 -04:00
Paul Kompfner
89e9acf0e1
CHANGELOG and code comment tweaks
2025-10-29 16:21:04 -04:00
Paul Kompfner
ddac24e6c9
Fix a missing space in a warning message
2025-10-29 16:17:05 -04:00
Paul Kompfner
d0f52feba3
OpenAI Realtime needs the assistant context aggregator to have expect_stripped_words=False
2025-10-29 16:15:16 -04:00
Paul Kompfner
8894db4290
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Add warning about no longer pushing `TTSTextFrame`s.
2025-10-29 15:45:06 -04:00
Paul Kompfner
1f96cdf970
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Make `LLMUserAggregator` push the `LLMSetToolsFrame`s, in case a speech-to-speech service that needs to handle the frame itself—like `OpenAIRealtimeLLMService`—is downstream. As far as I can tell, pushing `LLMSetToolsFrame` should otherwise have no unwanted side effects.
2025-10-29 15:43:51 -04:00
Paul Kompfner
0282033208
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Add `LLMContext.get_messages_for_persistent_storage()` for compatibility with `OpenAILLMContext`, to avoid tripping up users who we're unknowingly migrating to using `LLMContext`.
2025-10-29 15:43:51 -04:00
Paul Kompfner
917ea27352
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Update `AzureRealtimeLLMService` example (19a) to use new `LLMContext` pattern.
2025-10-29 15:43:51 -04:00
Paul Kompfner
8c03df1463
Update some docstring arg descriptions to be a bit more current or accurate
2025-10-29 15:43:51 -04:00
Paul Kompfner
15aa76efba
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Maintain backward compatibility with functions specified in dict format.
2025-10-29 15:43:51 -04:00
Paul Kompfner
8ac421f8fd
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Remove unused imports.
2025-10-29 15:43:51 -04:00
Paul Kompfner
75b3ea9c96
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Fix tracing.
2025-10-29 15:43:51 -04:00
Paul Kompfner
95be1510ac
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Improve `OpenAIRealtimeLLMAdapter.get_messages_for_logging()`.
2025-10-29 15:43:51 -04:00
Paul Kompfner
df19011080
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Improve warning about transcription frame direction change.
2025-10-29 15:43:51 -04:00
Paul Kompfner
e42cf78e79
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Update deprecation versions.
2025-10-29 15:43:51 -04:00
Paul Kompfner
0495de52b6
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Log warning about transcription frame direction change.
2025-10-29 15:43:51 -04:00
Paul Kompfner
9bc02afd0d
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
CHANGELOG tweak.
2025-10-29 15:43:51 -04:00
Paul Kompfner
6140fdb2c9
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
In anticipation of `messages` property being added to `LLMContext` (in another PR), remove warnings about the need to use `get_messages()` instead.
2025-10-29 15:43:51 -04:00
Paul Kompfner
b6a1886dae
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
2025-10-29 15:43:51 -04:00
Paul Kompfner
42d0a097c5
Tweaks to 20b example
2025-10-29 15:43:51 -04:00
Paul Kompfner
3761804146
Make OpenAIRealtimeLLMService's websocket send method more resilient. Previously, it was possible for a websocket send attempt to occur during a disconnect.
2025-10-29 15:43:51 -04:00
Paul Kompfner
46e97c57c2
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Update 20b example to use new `LLMContext` pattern.
2025-10-29 15:43:51 -04:00
Paul Kompfner
19770b76b4
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Add back file that was removed, when it should've just been deprecated.
Also, fix version numbers in deprecation messages to match the next expected release.
2025-10-29 15:43:51 -04:00
Paul Kompfner
b34461bf93
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
2025-10-29 15:43:47 -04:00
Paul Kompfner
bab0aaf585
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Update `create_context_aggregator()` (which we're keeping around for backward compatibility) to create a `LLMContextAggregatorPair` rather than OpenAI-Realtime-specific aggregators.
2025-10-29 15:36:58 -04:00
Paul Kompfner
61944d22ef
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Implement sending tool call results to the OpenAI server based on reading context updates. This lets us use the normal assistant context aggregator and not a special OpenAI Realtime subclass that pushes up a special frame for function call results.
2025-10-29 15:36:58 -04:00
Paul Kompfner
47756319be
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Receiving a new context (via a context frame) no longer serves as a signal to reset the conversation. That’s because we’re now receiving new contexts from the user aggregator every time new messages are added, and from the assistant aggregator when function call results come in. The code pattern we're heading towards, of “diffing” each new context with the previous on, sets us up for doing more sophisticated things in the future, like sending specific messages to OpenAI to edit its internally-tracked context.
Also, remove code that was directly modifying context.
2025-10-29 15:36:58 -04:00
Paul Kompfner
5fa56df014
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Update 19b example with new pattern.
2025-10-29 15:36:58 -04:00
Paul Kompfner
8a151235c3
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Deprecate `send_transcription_frames`—transcription frames are now always sent.
2025-10-29 15:36:57 -04:00
Paul Kompfner
ec42f8c24e
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Push `TranscriptionFrame`s upstream, to be handled by the user context aggregator. This will require at least a couple of other changes:
- Updating examples to put transcript processors upstream from `OpenAIRealtimeLLMService`
- Maybe figuring out a way to preserve backward compatibility with existing pipelines that put transcript processors downstream from `OpenAIRealtimeLLMService`
- Updating `OpenAIRealtimeLLMService` to ignore new received context frames, since the upstream user context aggregator will generate those after each newly-added user message; hopefully nobody was reliant on the old behavior of resetting the conversation upon receiving a new context!
2025-10-29 15:36:57 -04:00
Paul Kompfner
29fd17b9ff
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).
...
Avoid pushing `LLMTextFrame` when `OpenAIRealtimeLLMService` is configured to output audio. This avoids duplicate text in assistant messages in context. Conceptually, a speech-to-speech service encapsulates TTS behavior; in a "traditional" pipeline, `LLMTextFrames` are swallowed by the TTS service, so they should similarly not be pushed by a speech-to-speech service. Only. `TTSTextFrame`s should be pushed.
2025-10-29 15:36:57 -04:00
Paul Kompfner
3ea1e357f2
Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (initial part of work)
2025-10-29 15:36:57 -04:00
kompfner
351ef617ae
Merge pull request #2932 from pipecat-ai/pk/gemini-live-universal-llmcontext
...
Update `GeminiLLMService` to work with `LLMContext` and `LLMContextAg…
2025-10-29 15:35:13 -04:00
Paul Kompfner
9dafb715c4
Update some deprecation versions
2025-10-29 15:30:43 -04:00
Paul Kompfner
82d494d3d4
Fix a bug in GeminiLiveLLMService related to ending gracefully—i.e. waiting for the bot to stop responding before ending the pipeline—when the service is configured with the TEXT modality
2025-10-29 14:34:02 -04:00
Mark Backman
e893aaa620
Merge pull request #2931 from ivaaan/hume-bugfix
...
Hume: use Octave v1 if description provided
2025-10-29 13:40:21 -04:00
Paul Kompfner
65c17a698e
Whoops - fix a bug in GeminiLiveLLMService where we weren't checking if a tool call result was already handled before reporting it to the LLM
2025-10-29 12:44:00 -04:00
Paul Kompfner
615aae5b95
Fix GeminiLiveLLMService's sending of LLMFullResponseStartFrame and LLMFullResponseEndFrame so that they properly bookend responses.
...
Properly bookended responses now work with:
- AUDIO modality (validated with 26b example)
- TEXT modality (validated with 26d example)
- AUDIO modality with Vertex AI (validated with 26h example)
It doesn't seem that TEXT modality is supported with Vertex AI, hence the missing "quadrant" of validation.
2025-10-29 12:33:37 -04:00
Aaron Ng
b0acbeffb9
add sm-app param
2025-10-29 16:33:18 +00:00
Ivan A
2f1061f300
Merge branch 'main' into hume-bugfix
2025-10-29 17:06:50 +01:00
ivaaan
9307079af2
upd changelog
2025-10-29 17:05:41 +01:00
Mark Backman
efa64642a4
Merge pull request #2930 from pipecat-ai/mb/simli-constructor-update
...
Update Simli to align with Pipecat constructor norms
2025-10-29 11:50:11 -04:00
Mark Backman
ede6c32149
Update Simli to align with Pipecat constructor norms
2025-10-29 11:47:23 -04:00
Aaron Ng
4050e8b7dc
add speechmatics tts
2025-10-29 14:53:20 +00:00
Roshan
b0f5fc02c4
refactor: use Pydantic BaseModel for GenerationConfig and simplify model_dump()
...
- Change GenerationConfig from dataclass to Pydantic BaseModel for consistency
- Simplify _build_msg() to use model_dump(exclude_none=True) instead of manual field extraction
- Simplify HTTP run_tts() to use model_dump(exclude_none=True) instead of manual field extraction
This addresses feedback from code review and reduces code duplication.
2025-10-28 18:41:58 -07:00
Aleix Conchillo Flaqué
493d6bf91e
Merge pull request #2936 from pipecat-ai/aleix/daily-python-0.21.0
...
pyproject: update daily-python to 0.21.0
2025-10-28 18:25:25 -07:00
Aleix Conchillo Flaqué
aaebcae2e8
pyproject: update daily-python to 0.21.0
2025-10-28 17:23:37 -07:00
Roshan
408264a0fd
docs: update CHANGELOG.md for generation_config feature
2025-10-28 15:16:49 -07:00
Roshan
df8aa3e4b0
feat: add generation_config support for Cartesia Sonic-3
...
Add GenerationConfig dataclass with volume, speed, and emotion parameters
for Cartesia Sonic-3 TTS models. This enables fine-grained control over
speech generation including volume (0.5-2.0), speed (0.6-1.5), and
emotion (60+ options).
Changes:
- Add GenerationConfig dataclass with proper Google-style docstrings
- Update CartesiaTTSService.InputParams to include generation_config
- Update CartesiaHttpTTSService.InputParams to include generation_config
- Modify _build_msg() to include generation_config in WebSocket messages
- Modify run_tts() to include generation_config in HTTP requests
- Maintain backward compatibility with existing speed and emotion parameters
The legacy speed (literal strings) and emotion (list) parameters remain
available for non-Sonic-3 models.
2025-10-28 15:10:46 -07:00
Mark Backman
4d82a1260b
Merge pull request #2933 from pipecat-ai/mb/remove-aiohttp-session-sarvam
...
Remove aiohttp_session arg from SarvamTTSService
2025-10-28 16:54:56 -04:00
Paul Kompfner
f974c66e12
Update GeminiLLMService to work with LLMContext and LLMContextAggregatorPair
2025-10-28 15:46:28 -04:00
Mark Backman
533372ed37
Remove aiohttp_session arg from SarvamTTSService
2025-10-28 15:39:14 -04:00
ivaaan
a9118eb2cd
use Octave 1 if description provided
2025-10-28 20:36:34 +01:00
Aleix Conchillo Flaqué
84ed2468e5
Merge pull request #2924 from pipecat-ai/aleix/daily-transport-remove-join-timeout
...
DailyTransport: don't timeout prematurely on join
2025-10-28 10:43:28 -07:00
Aleix Conchillo Flaqué
d82d855c20
DailyTransport: don't timeout prematurely on leave
2025-10-28 10:41:19 -07:00
Mark Backman
412ff2a4a1
Merge pull request #2929 from pipecat-ai/mb/cartesia-sonic-3
...
Update Cartesia's default model to sonic-3
2025-10-28 13:07:28 -04:00
Mark Backman
82ccc160fb
Merge pull request #2923 from pipecat-ai/mb/runner-no-proxy-required
...
Remove development runner requirement for proxy
2025-10-28 11:59:38 -04:00
Mark Backman
9ef60bd468
Update Cartesia's default model to sonic-3
2025-10-28 11:49:54 -04:00
marcus-daily
06e86cc107
Add RTVIProcessor to foundational example 38b
2025-10-28 12:14:23 +00:00
Aleix Conchillo Flaqué
f3c4bf08dd
DailyTransport: don't timeout prematurely on join
2025-10-27 17:52:19 -07:00
Mark Backman
f2cfbee3c3
Remove development runner requirement for proxy
2025-10-27 16:18:31 -04:00
Vanessa Pyne
8b063116ab
Merge pull request #2921 from pipecat-ai/vp-azure-ex-cleanup
...
cleanup logger message
2025-10-27 12:59:08 -05:00
vipyne
8096e62b34
cleanup logger message
2025-10-27 11:27:30 -05:00
kompfner
20f4b0e8ff
Merge pull request #2914 from pipecat-ai/pk/gemini-function-calling-fixes
...
Gemini function calling fixes
2025-10-27 09:45:29 -04:00
Paul Kompfner
6feaf91789
Fix a bug in GeminiLLMAdapter's handling of Gemini-specific context messages
2025-10-27 09:42:24 -04:00
Mark Backman
91d3ae07b3
Merge pull request #2915 from Rickaym/fix--rounding-the-edges-of-observer-function-method-deprecation
...
fix: use correct property names
2025-10-24 19:42:34 -04:00
Pyae Sone Myo
71841f71ef
fix: use correct property names
2025-10-25 00:47:46 +06:30
Paul Kompfner
949b807023
Close genai client more gracefully to avoid printed warnings. We're now following the genai library guidance: https://github.com/googleapis/python-genai?tab=readme-ov-file#close-a-client
2025-10-24 11:36:25 -04:00
Paul Kompfner
4ad15f9a01
Update Gemini service to include function name when sending function responses in context
2025-10-24 11:04:52 -04:00
Paul Kompfner
99d94fc625
Update Gemini service to use "user" role for function responses, as shown in the Gemini docs
2025-10-24 10:05:14 -04:00
Mark Backman
a3d630c0d1
Merge pull request #2908 from pipecat-ai/mb/runner-daily-start-route
...
fix: add support for DAILY_SAMPLE_ROOM_URL when calling /start for Da…
2025-10-23 14:15:42 -04:00
Mark Backman
04b482c445
Merge branch 'main' into mb/runner-daily-start-route
2025-10-23 14:11:38 -04:00
Mark Backman
b2bce4916f
Merge pull request #2900 from pipecat-ai/mb/quickstart-pipecat-cli
...
Quickstart to use Pipecat CLI
2025-10-23 10:55:42 -04:00
Mark Backman
60e9817f16
fix: add support for DAILY_SAMPLE_ROOM_URL when calling /start for DailyTransport
2025-10-22 16:48:30 -04:00
kompfner
c655d0d313
Merge pull request #2907 from pipecat-ai/mb/service-switcher-updates
...
ServiceSwitcher updates
2025-10-22 11:23:48 -04:00
Paul Kompfner
ea6e146f2d
Update TestServiceSwitcher to exercise targeting system frames only to the active service
2025-10-22 11:14:27 -04:00
Mark Backman
ec890a834f
Rename to filter_system_frames
2025-10-22 11:01:33 -04:00
Mark Backman
5b921fc054
fix: FunctionFilter adds block_system_frame arg
2025-10-22 10:53:01 -04:00
Mark Backman
f1040100f4
Update ServiceSwitcher and LLMSwitcher docstrings
2025-10-22 10:51:03 -04:00
Mark Backman
54691ee781
Merge pull request #2904 from pipecat-ai/mb/bump-aws-sdk-bedrock-runtime
...
Upgrade aws_sdk_bedrock_runtime to v0.1.1
2025-10-22 08:58:48 -04:00
Mark Backman
49239a23c6
Upgrade aws_sdk_bedrock_runtime to v0.1.1
2025-10-21 23:27:38 -04:00
Aleix Conchillo Flaqué
e0c43de13f
Merge pull request #2903 from pipecat-ai/aleix/pipecat-0.0.91
...
update CHANGELOG for 0.0.91
2025-10-21 19:09:23 -07:00
Aleix Conchillo Flaqué
cc4c96d099
update CHANGELOG for 0.0.91
2025-10-21 19:00:51 -07:00
Aleix Conchillo Flaqué
788465cb04
Merge pull request #2901 from pipecat-ai/pk/llmcontext-messages
...
Add `messages` property to `LLMContext` for usage parity with `OpenAI…
2025-10-21 18:00:33 -07:00
Aleix Conchillo Flaqué
db934eade0
Merge pull request #2891 from pipecat-ai/aleix/daily-pipecat-runner-args
...
runner: allow starting a bot from Daily's /start endpoint
2025-10-21 17:59:13 -07:00
Mark Backman
0b8c966a11
Merge pull request #2892 from pipecat-ai/mb/aws-llm-claude-fix
...
fix: AWSBedrockLLMService compatibility for newer Claude models
2025-10-21 20:50:20 -04:00
Mark Backman
5849485bc6
fix: AWSBedrockLLMService compatibility for newer Claude models
2025-10-21 19:47:27 -04:00
Aleix Conchillo Flaqué
459af58540
runner: allow starting a bot from Daily's /start endpoint
2025-10-21 16:28:11 -07:00
Aleix Conchillo Flaqué
576bd67e85
runner: add body field to RunnerArguments
2025-10-21 16:27:48 -07:00
Aleix Conchillo Flaqué
1e8629bf96
runner: allow passing an api_key to configure
2025-10-21 16:27:48 -07:00
Paul Kompfner
776a3526f9
Add messages property to LLMContext for usage parity with OpenAILLMContext.
...
This wasn't really an issue before, when folks were *knowingly* migrating from `OpenAILLMContext` to `LLMContext`. But in the latest AWS Nova Sonic change, we're swapping it out from under folks, so this kind of compatibility is more important.
For context, the reason we *didn't* offer the `messages` property earlier was to aid in the development of `LLMContext`—we wanted to draw attention to all the places where messages were being read from context, so we could find the places where we might need to pass an argument to the read.
2025-10-21 17:38:39 -04:00
kompfner
2ced044418
Merge pull request #2896 from pipecat-ai/pk/add-back-types-that-were-meant-to-be-deprecated-not-removed
...
Add back types that were removed, when they should only have been dep…
2025-10-21 17:33:17 -04:00
Mark Backman
151f187837
Merge pull request #2895 from pipecat-ai/mb/update-env-example
...
Organize the env.example file
2025-10-21 17:15:22 -04:00
Mark Backman
67afa718d0
Merge pull request #2898 from pipecat-ai/mb/ellipses-changelog
...
Changelog entry for PR #2877
2025-10-21 17:02:08 -04:00
Mark Backman
52ab0eccc0
Quickstart to use Pipecat CLI
2025-10-21 15:57:45 -04:00
Vanessa Pyne
d1f1b68b71
Merge pull request #2863 from pipecat-ai/vp-custom-frame-processor-ex
...
add 08-custom-frame-processor.py to foundational examples
2025-10-21 14:15:38 -05:00
Mark Backman
a479c32665
Merge pull request #2894 from pipecat-ai/mb/cli-readme
...
Add Pipecat CLI to README's ecosystem section
2025-10-21 13:20:12 -04:00
Mark Backman
9f66b0ba41
Add Pipecat CLI to README's ecosystem section
2025-10-21 13:17:37 -04:00
vipyne
23385ca3d2
replace foundational example 08-bots-arguing.py with 08-custom-frame-processor.py
2025-10-21 11:56:35 -05:00
vipyne
8b24bae9c5
pr notes
2025-10-21 11:42:06 -05:00
Mark Backman
0502ec6c44
Changelog entry for PR #2877
2025-10-21 11:58:27 -04:00
Mark Backman
81645910e0
Merge pull request #2877 from nimobeeren/patch-1
...
Add ellipsis character to sentence ending punctuation list
2025-10-21 11:53:17 -04:00
Filipi da Silva Fuchter
d6ab4c41b0
Merge pull request #2897 from pipecat-ai/filipi/fix_proxy_route
...
Fixed an issue in the runner's proxy_request
2025-10-21 12:28:04 -03:00
Filipi Fuchter
2f92cb8781
Fixed an issue in the runner's proxy_request where a session that exists but has empty data was being treated as invalid.
2025-10-21 11:41:52 -03:00
Paul Kompfner
fbf274374c
Add back types that were removed, when they should only have been deprecated
2025-10-21 09:56:31 -04:00
Mark Backman
427efecf5b
Organize the env.example file
2025-10-21 09:43:46 -04:00
Filipi da Silva Fuchter
b3e54546ac
Merge pull request #2888 from pipecat-ai/filipi/rtvi_duplicated_frames
...
Fixed an issue where the RTVIProcessor was sending duplicate UserStartedSpeakingFrame and UserStoppedSpeakingFrame messages.
2025-10-21 08:57:32 -03:00
Filipi Fuchter
de46631bac
Fixed an issue where the RTVIProcessor was sending duplicate UserStartedSpeakingFrame and UserStoppedSpeakingFrame messages.
2025-10-20 18:39:00 -03:00
vipyne
abf0150261
add 47-custom-frame-processor.py to foundational examples
2025-10-20 12:11:40 -05:00
Aleix Conchillo Flaqué
a0c93ab6de
update CHANGELOG cosmetics
2025-10-20 09:07:50 -07:00
Aleix Conchillo Flaqué
4bec566bbf
Merge pull request #2885 from pipecat-ai/aleix/daily-python-0.20.0
...
pyproject: update daily-python to 0.20.0
2025-10-20 08:04:52 -07:00
Aleix Conchillo Flaqué
ec3cd24182
pyproject: update daily-python to 0.20.0
2025-10-20 08:04:34 -07:00
kompfner
e36e64c2e8
Merge pull request #2750 from pipecat-ai/pk/aws-nova-sonic-universal-llmcontext-1
...
Support new `LLMContext` pattern with `AWSNovaSonicLLMService`
2025-10-20 10:12:53 -04:00
Paul Kompfner
02a88022dd
Add a bit more detail to CHANGELOG related to AWSNovaSonicLLMService's support for LLMContext
2025-10-20 10:06:09 -04:00
Paul Kompfner
6cae61f2cc
Add a bit more detail to CHANGELOG entry about AWSNovaSonicLLMService's support for LLMContext
2025-10-20 09:50:23 -04:00
Paul Kompfner
3b40079120
Add a detailed warning when trying to import things from pipecat.services.aws_nova_sonic.context or pipecat.services.aws.nova_sonic.context
2025-10-20 09:49:05 -04:00
Paul Kompfner
ff0b38859b
Remove AWS Nova Sonic's context.py, which was always concerned with types for internal use only. Now those types are either gone or moved elsewhere.
2025-10-20 09:49:05 -04:00
Paul Kompfner
4d499324d1
Re-apply a change to AWSNovaSonicLLMService that was lost in a rebase
2025-10-20 09:49:05 -04:00
Paul Kompfner
f13e006db2
Bump version in deprecation message in docstring
2025-10-20 09:49:05 -04:00
Paul Kompfner
87d9e8c9cd
Re-apply a couple of recent changes to AWSNovaSonicLLMService that were lost in a rebase
2025-10-20 09:49:05 -04:00
Paul Kompfner
4820f1c059
Address some AWSNovaSonicLLMService context-recording edge cases
2025-10-20 09:49:05 -04:00
Paul Kompfner
860c39d1b1
Get rid of LLMContext.get_messages_for_persistent_storage().
...
The reason for its `system_instruction` argument was to support usage with LLMs where you might pass the system instruction as a parameter to the `LLMService` rather than specifying it in the context.
But as I thought about it more I became unconvinced that the `system_instruction` argument was really beneficial:
- If you specified your system instruction in your context in the first place, it'll still be there when you read messages for persistent storage
- If you didn't specify your system instruction in the context and instead passed it in as an `LLMService` parameter, you most likely *don't* want it to be in the context when you read messages for persistent storage
- ...and if you really really do need to inject it at the start of the context, it's quite easy to do anyway
And if we remove the `system_instruction` argument from `get_messages_for_persistent_storage()`, then it's essentially just `get_messages()`.
2025-10-20 09:49:05 -04:00
Paul Kompfner
ae5c5ed7f6
Update AWSNovaSonicLLMService to work with LLMContext and LLMContextAggregatorPair
2025-10-20 09:49:00 -04:00
shreyas-sarvam
5052da8ce6
Merge branch 'main' into sarvam/stt
2025-10-20 13:45:24 +05:30
Aleix Conchillo Flaqué
7aa01c1ca8
Merge pull request #2882 from pipecat-ai/aleix/base-transport-output-cleanup
...
base output transport cleanup
2025-10-18 07:38:13 -07:00
Mark Backman
4d6356748f
Merge pull request #2819 from shreyas-sarvam/sarvam/tts-v3
...
feat: Add support for bulbul:v3
2025-10-18 09:36:57 -04:00
Mark Backman
5b1a182421
Merge branch 'main' into sarvam/tts-v3
2025-10-18 09:34:10 -04:00
Mark Backman
6ac0c34413
Merge pull request #2879 from sam-s10s/fix/smx-vocab
...
Fix for SpeechmaticsSTTService AdditionVocabEntry entries
2025-10-18 09:27:23 -04:00
Mark Backman
c115422dbf
Merge pull request #2857 from dan-ince-aai/main
...
feat: add keyterms_prompt to AssemblyAI service
2025-10-18 09:20:43 -04:00
Mark Backman
a2a973be27
Merge pull request #2842 from nbyers-altira/fix-riva-segmented
...
Fix NVIDIA Riva Segmented STT by adding missing is_final parameter to _handle_transcription
2025-10-18 09:11:11 -04:00
Aleix Conchillo Flaqué
0407744950
BaseOutputTransport: simplify process_frame
2025-10-17 21:55:20 -07:00
Aleix Conchillo Flaqué
7ce370ccc6
BaseOutputTransport: simplify bot speaking logic
2025-10-17 15:13:20 -07:00
nbyers-altira
a4867f61aa
be a tad more precise in changelog
2025-10-17 13:51:49 -04:00
nbyers-altira
a67a765783
add changelog, run linter
2025-10-17 13:49:52 -04:00
nbyers-altira
81221668b1
Merge remote-tracking branch 'upstream/main' into fix-riva-segmented
2025-10-17 13:45:59 -04:00
Daniel Ince
cc9c264940
Merge branch 'main' into main
2025-10-17 15:15:36 +01:00
Sam Sykes
f2c61ac9fd
Fix for AdditionVocabEntry without sounds_like items.
2025-10-17 14:34:37 +01:00
Filipi da Silva Fuchter
88f8c10f63
Merge pull request #2875 from pipecat-ai/filipi/rtvi_routes
...
Creating the WebRTC routes that mimic the ones provided by Pipecat Cloud.
2025-10-17 10:13:45 -03:00
Filipi Fuchter
855f4842dd
Creating the WebRTC routes that mimic the ones provided by Pipecat Cloud.
2025-10-17 10:10:19 -03:00
Filipi da Silva Fuchter
2bf44fe2af
Merge pull request #2853 from pipecat-ai/filipi/trickle_ice
...
Adding support for trickle ice.
2025-10-17 09:00:32 -03:00
Filipi Fuchter
3e8a7cc254
Adding support for trickle ICE to the SmallWebRTCTransport.
2025-10-17 08:57:45 -03:00
Daniel Ince
a600c05570
Merge branch 'main' into main
2025-10-17 11:43:38 +01:00
dan-ince-aai
3ba6b55659
feat: multilingual + changelog updates
2025-10-17 11:38:03 +01:00
dan-ince-aai
d5f2dcfac0
lint
2025-10-17 11:32:06 +01:00
Nimo Beeren
d1d74c571c
add ellipsis character to sentence ending punctuation list
2025-10-17 10:38:06 +02:00
shreyas-sarvam
d12134038b
chore: Update CHANGELOG
2025-10-17 10:07:58 +05:30
shreyas-sarvam
a22af3a7e0
Merge branch 'main' into sarvam/stt
2025-10-17 10:00:49 +05:30
Aleix Conchillo Flaqué
76e07c6c48
Merge pull request #2870 from pipecat-ai/aleix/openaitts-update-settings
...
OpenAITTSService: allow updating instructions and speed
2025-10-16 13:21:12 -07:00
Aleix Conchillo Flaqué
8d8503bca7
OpenAITTSService: allow updating instructions and speed
2025-10-16 13:20:49 -07:00
Aleix Conchillo Flaqué
a444097060
Merge pull request #2872 from pipecat-ai/aleix/pipeline-task-cancellation-fixes
...
PipelineTask: fix task cancellation issues
2025-10-16 13:18:13 -07:00
Aleix Conchillo Flaqué
1b9e96c016
PipelineTask: fix task cancellation issues
2025-10-16 13:16:19 -07:00
Vanessa Pyne
7967bc53c3
Merge pull request #2868 from pipecat-ai/vp-whatsapp-dep-mv
...
only import whatsapp deps if using whatsapp runner
2025-10-16 14:16:28 -05:00
vipyne
6381335346
Add --whatsapp flag to runner
2025-10-16 14:15:26 -05:00
vipyne
0fd5d26104
add WHATSAPP_APP_SECRET to required whatsapp env vars
2025-10-16 10:37:56 -05:00
vipyne
41f817bf04
only import whatsapp deps if using whatsapp runner
2025-10-16 10:37:56 -05:00
Matej Marinko
9acc36c58e
Update model params for Soniox STT
...
- remove deprecated parameters and add new ones
- add support for v3 context
2025-10-16 08:51:40 +02:00
shreyas-sarvam
27115e6565
Merge branch 'main' into sarvam/tts-v3
2025-10-16 12:09:50 +05:30
shreyas-sarvam
1ecf6e05fe
Merge branch 'main' into sarvam/stt
2025-10-16 12:08:32 +05:30
Mark Backman
3c4807d7d4
Merge pull request #2859 from pipecat-ai/mb/openai-package-upgrade
...
Bump openai, openpipe versions, add 14x foundational example
2025-10-15 15:41:32 -04:00
Mark Backman
8902f1dc94
Bump openai, openpipe versions, add 14x foundational example
2025-10-15 15:17:22 -04:00
Mark Backman
a25333ee51
Merge pull request #2856 from pipecat-ai/mb/pr-2840-cleanup
...
Fix an issue in ElevenLabsHttpTTSService where the last word is not e…
2025-10-15 15:16:43 -04:00
Mark Backman
82c7d7ad83
Merge pull request #2867 from pipecat-ai/mb/update-moondream-readme
...
Update moondream chatbot README link
2025-10-15 15:16:19 -04:00
Mark Backman
ba2ab51ef7
Merge pull request #2866 from pipecat-ai/mb/add-sentry-foundational
...
Add foundation 47-sentry-metrics.py
2025-10-15 15:15:52 -04:00
Mark Backman
22557fa668
Fix an issue in ElevenLabsHttpTTSService where the last word is not emitted
2025-10-15 15:13:54 -04:00
Vanessa Pyne
3fbf59e7c6
Merge pull request #2864 from pipecat-ai/vp-trace-log
...
WhatsApp transport debug log -> trace log
2025-10-15 13:03:58 -05:00
vipyne
129ab5ea0e
WhatsApp transport debug log -> trace log
2025-10-15 13:02:57 -05:00
Aleix Conchillo Flaqué
dc917523d0
Merge pull request #2855 from pipecat-ai/aleix/stt-tts-connected-disconnected-events
...
services: added on_connected/on_disconnected events
2025-10-15 10:41:38 -07:00
Aleix Conchillo Flaqué
5ea7cc9d32
services: added on_connected/on_disconnected events
2025-10-15 10:39:31 -07:00
Mark Backman
e11ede475b
Update moondream chatbot README link
2025-10-15 13:22:56 -04:00
Mark Backman
90d29e04af
Merge pull request #2861 from pipecat-ai/mb/11labs-http-apply-text-normalization-fix
...
fix: set apply_text_normalization as request parameter in ElevenLabsH…
2025-10-15 12:59:36 -04:00
Mark Backman
4c67136a8d
Merge pull request #2858 from pipecat-ai/mb/daily-runner-room-properties
...
Add room_properties to the Daily runner configure() method
2025-10-15 12:58:18 -04:00
Mark Backman
9d78402a33
fix: set apply_text_normalization as request parameter in ElevenLabsHttpTTSService
2025-10-15 12:56:42 -04:00
Mark Backman
73877218e9
Add room_properties to the Daily runner configure() method
2025-10-15 12:55:19 -04:00
Mark Backman
6a1be90cbb
Merge pull request #2862 from pipecat-ai/mb/11labs-http-aggregate-sentences
...
Add aggregate_sentences arg to ElevenLabsHttpTTSService
2025-10-15 12:54:23 -04:00
Aleix Conchillo Flaqué
fbac959ecb
Merge pull request #2865 from pipecat-ai/aleix/stop-audio-filter-also-on-cancel
...
BaseInputTransport: stop audio filter on cancel
2025-10-15 09:53:24 -07:00
Aleix Conchillo Flaqué
18dd85431c
Merge pull request #2854 from pipecat-ai/aleix/cartesia-stt-service-websocket
...
CartesiaSTTService to inherit from WebsocketSTTService
2025-10-15 09:51:42 -07:00
Aleix Conchillo Flaqué
abc569b3d2
examples(foundational/07): use CartesiaSTTService
2025-10-15 09:46:57 -07:00
Mark Backman
fa5d4ecf86
Add foundation 47-sentry-metrics.py
2025-10-15 12:45:07 -04:00
Aleix Conchillo Flaqué
83b0dc39f7
BaseInputTransport: stop audio filter on cancel
2025-10-15 09:22:48 -07:00
Mark Backman
0c31b5ef19
Add aggregate_sentences arg to ElevenLabsHttpTTSService
2025-10-15 11:49:26 -04:00
dan-ince-aai
d16c36c56d
feat: add keyterms_prompt to AssemblyAI service
2025-10-15 14:27:52 +01:00
Mark Backman
8fe3bcd484
Merge pull request #2840 from Rickaym/fix--excess-space-in-elevelabs-word-timestamp-joins
...
fix: handle ElevenLabs partial word concatenation across alignment chunks gracefully
2025-10-15 09:01:05 -04:00
Aleix Conchillo Flaqué
be2858bfbb
CartesiaSTTService: inherit from WebsocketSTTService
2025-10-14 14:14:57 -07:00
Pyae Sone Myo
b6b0997553
fix: add support for partial words
2025-10-14 23:06:13 +06:30
Pyae Sone Myo
3b751322d3
fix: add interruption reset for partial word states
2025-10-14 23:04:09 +06:30
Aleix Conchillo Flaqué
fce6f55ddb
Merge pull request #2844 from pipecat-ai/aleix/runner-files-path
...
runner: allow subdirectories in --folder
2025-10-14 08:38:38 -07:00
Aleix Conchillo Flaqué
d9580f72a9
runner: allow subdirectories in --folder
2025-10-13 18:29:19 -07:00
nbyers-altira
cc66ac14f1
add is_final to segmented func. sig. instead so tracing is consistent
2025-10-13 10:48:41 -04:00
nbyers-altira
9ddec0f8b4
is_final is not part of the segmented _handle_transcription function signature
2025-10-13 10:44:25 -04:00
shreyas-sarvam
5cc1d8a024
refactor: Update dependencies and improve logging
2025-10-13 10:18:15 +05:30
shreyas-sarvam
9babfe9fd9
refactor: Improve code reability and replace deprecated interruption frames
2025-10-13 08:54:29 +05:30
Pyae Sone Myo
21d8d148b8
fix: handle partial words across alignment chunks gracefully
2025-10-12 22:10:11 +06:30
Aleix Conchillo Flaqué
0588c82bbf
Merge pull request #2838 from makosst/manta_graph_readme
...
Added Manta Graph to README
2025-10-11 09:31:21 -07:00
makosst
16e9093d5a
Added Manta Graph to README
2025-10-11 09:20:17 -07:00
Aleix Conchillo Flaqué
91a5d580fd
Merge pull request #2835 from pipecat-ai/aleix/tts-http-aligned-audio-frames
...
tts: fix RimeHttpTTSService/PiperTTSService 16-bit audio frames alignment
2025-10-10 14:20:44 -07:00
Aleix Conchillo Flaqué
0473556992
tts: fix RimeHttpTTSService/PiperTTSService 16-bit audio frames alignment
2025-10-10 14:19:22 -07:00
Aleix Conchillo Flaqué
fdaa4e476e
Merge pull request #2830 from pipecat-ai/aleix/pipecat-0.0.90
...
update CHANGELOG for 0.0.90
2025-10-10 10:22:26 -07:00
Aleix Conchillo Flaqué
502e7e42a7
update CHANGELOG for 0.0.90
2025-10-10 10:20:19 -07:00
kompfner
2ab3d4fb42
Merge pull request #2834 from pipecat-ai/pk/update-vertex-disclaimer
...
Update a Google Vertex disclaimer for accuracy
2025-10-10 13:19:51 -04:00
Paul Kompfner
55014bdd77
Update a Google Vertex disclaimer for accuracy
2025-10-10 13:18:03 -04:00
kompfner
334796bd65
Merge pull request #2833 from pipecat-ai/pk/vertex-non-optional-location
...
`location` should not be optional when using Google Vertex.
2025-10-10 13:02:40 -04:00
Paul Kompfner
1c25b6fb72
location should not be optional when using Google Vertex.
...
Also, update `GoogleVertexLLMService` initialization pattern in the example file.
2025-10-10 12:58:45 -04:00
Mark Backman
91b29de7ca
Merge pull request #2832 from pipecat-ai/mb/docs-fixes-0.0.90
...
Docs fixes for 0.0.90 release
2025-10-10 12:46:40 -04:00
Mark Backman
21d610cd30
Docs fixes for 0.0.90 release
2025-10-10 12:43:31 -04:00
Mark Backman
f7fe673ad1
Merge pull request #2831 from pipecat-ai/mb/update-evals
...
Update release evals for OpenAI Realtime, Gemini Live Vertex; shorten…
2025-10-10 12:34:27 -04:00
Mark Backman
4b415721e2
Update release evals for OpenAI Realtime, Gemini Live Vertex; shorten 26 foundational names
2025-10-10 12:26:23 -04:00
kompfner
8d2a98e0e7
Merge pull request #2829 from pipecat-ai/pk/fix-gemini-live-deprecation-messages
...
Fix deprecation messages pointing users to the new import paths for G…
2025-10-10 10:42:15 -04:00
Paul Kompfner
523e890c8c
Fix deprecation messages pointing users to the new import paths for Gemini Live
2025-10-10 10:30:38 -04:00
kompfner
3c748fe772
Merge pull request #2823 from pipecat-ai/pk/vertex-init-args-fixup
...
Move `location` and `project_id` out of `InputParams` in `GoogleVerte…
2025-10-10 10:18:51 -04:00
kompfner
d293cee372
Merge pull request #2822 from pipecat-ai/pk/make-pause-processing-frames-more-robust
...
Make `pause_processing_frames()` and `pause_processing_system_frames(…
2025-10-10 10:16:27 -04:00
Paul Kompfner
8b62a96878
Improve how we're deprecating location and project_id in GoogleVertexLLMService, allowing user code to (correctly) continue referring to GoogleVertexLLMService.InputParams.
...
Also fix the slightly wrong (but so far harmless) pattern of initializing `OpenAILLMService.InputParams()` in the `GoogleVertexLLMService` if `params` wasn't provided—we should be letting the superclass decide what to do if the argument isn't specified.
2025-10-10 10:12:00 -04:00
Mark Backman
0c102ce70b
Merge pull request #2826 from pipecat-ai/mb/deprecate-livekit-frame-serializer
...
Deprecate LivekitFrameSerializer
2025-10-10 10:01:45 -04:00
Mark Backman
3894d2a4b9
Deprecate LivekitFrameSerializer
2025-10-10 09:51:57 -04:00
Aleix Conchillo Flaqué
1f6b61c0db
Merge pull request #2828 from pipecat-ai/aleix/gemini-live-gemini-to-llm
...
google: rename google.gemini_live.gemini to google.gemini_live.llm
2025-10-10 06:42:51 -07:00
Aleix Conchillo Flaqué
8ee28b37cd
google: rename google.gemini_live.vertext to google.gemini_live.llm_vertex
2025-10-10 06:41:19 -07:00
Filipi da Silva Fuchter
e85e7e4d84
Merge pull request #2773 from pipecat-ai/filipi/krisp_viva
...
Added audio filter `KrispVivaFilter` using the Krisp VIVA SDK.
2025-10-10 09:51:15 -03:00
Filipi Fuchter
1b3afb5511
Added audio filter KrispVivaFilter using the Krisp VIVA SDK
2025-10-10 09:44:47 -03:00
Aleix Conchillo Flaqué
7cec013666
google: rename google.gemini_live.gemini to google.gemini_live.llm
2025-10-09 22:20:09 -07:00
Aleix Conchillo Flaqué
86127167fb
Merge pull request #2827 from pipecat-ai/aleix/openai-realtime-move
...
move openai_realtime to openai.realtime
2025-10-09 22:18:04 -07:00
Aleix Conchillo Flaqué
9935a68018
examples(19b): fix deprecations
2025-10-09 22:14:52 -07:00
Aleix Conchillo Flaqué
5679dde70f
ai_service: use openai.realtime.events instead of openai_realtime_beta.events
2025-10-09 22:14:46 -07:00
Aleix Conchillo Flaqué
d81b0f6368
update CHANGELOG with openai_realtime deprecation
2025-10-09 22:14:46 -07:00
Aleix Conchillo Flaqué
9698b008da
deprecate openai_realtime
2025-10-09 22:14:46 -07:00
Aleix Conchillo Flaqué
7b05c9283b
move openai.realtime.azure to azure.realtime.llm
2025-10-09 22:14:46 -07:00
Aleix Conchillo Flaqué
303dd2ec35
move openai.realtime.openai to openai.realtime.llm
2025-10-09 22:14:46 -07:00
Aleix Conchillo Flaqué
aa6e81648a
move openai_realtime to openai.realtime
2025-10-09 22:14:46 -07:00
Aleix Conchillo Flaqué
1a87870ef3
Merge pull request #2825 from pipecat-ai/aleix/aws-nova-sonic-move
...
move aws_nova_sonic to aws.nova_sonic
2025-10-09 18:37:46 -07:00
Aleix Conchillo Flaqué
aac4ce2d12
update CHANGELOG with aws_nova_sonic deprecation
2025-10-09 18:32:26 -07:00
Aleix Conchillo Flaqué
2a79b2c853
aws: deprecate aws_nova_sonic
2025-10-09 17:44:29 -07:00
Aleix Conchillo Flaqué
15bf5b1533
aws: move aws_nova_sonic to aws.nova_sonic
2025-10-09 17:35:47 -07:00
Aleix Conchillo Flaqué
cdc86db8ce
update CHANGELOG with GoogleVertexLLMService token fix
2025-10-09 16:58:22 -07:00
Aleix Conchillo Flaqué
9d2ad750b5
Merge pull request #2779 from LucasStringPay/patch-1
...
Ignore None value for 'completion_tokens' or similar for Gemini
2025-10-09 16:55:33 -07:00
Aleix Conchillo Flaqué
19ceb1a48f
Merge pull request #2817 from pipecat-ai/aleix/runner-download-folder
...
runner: add --folder argument to allow file downloads
2025-10-09 16:55:17 -07:00
Aleix Conchillo Flaqué
59217eae38
runner: add --folder argument to allow file downloads
2025-10-09 16:49:51 -07:00
Aleix Conchillo Flaqué
bea0aee835
Merge pull request #2824 from pipecat-ai/aleix/gemini-under-google
...
google: move gemini_live inside google service
2025-10-09 16:40:15 -07:00
Aleix Conchillo Flaqué
aeace9b9be
google: move gemini_live inside google service
2025-10-09 16:06:42 -07:00
Paul Kompfner
2994640f47
Move location and project_id out of InputParams in GoogleVertexLLMService, making them top-level init args instead. We do this for two reasons:
...
- Conceptually, these args comprise project-level setup, akin to credentials, whereas everything in `InputParams` is concerned with model configuration
- Providing a `project_id` when initializing `GoogleVertexLLMService` should not be optional, but prior to the change in this commit it was (erroneously) treated as optional by dint of `InputParams` being optional
This improvement was discussed [in this comment](https://github.com/pipecat-ai/pipecat/pull/2795#discussion_r2408279142 ).
2025-10-09 16:53:21 -04:00
Paul Kompfner
10069719e4
Make pause_processing_frames() and pause_processing_system_frames() more robust in FrameProcessor.
...
To understand this fix, let's look exclusively at `pause_processing_frames()` (`pause_processing_system_frames()` works the same way).
`pause_processing_frames()` works by setting a `__should_block_frames` flag, which is then read each time through the loop in the long-running `__process_frame_task_handler`. if `__should_block_frames` is `True`, it pauses processing frames until it's resumed.
Prior to this fix, the check for `__should_block_frames` was before `await self.__process_queue.get()`. The problem is that a lot of the time spent in the loop is waiting for a frame from the process queue. So if `pause_processing_frames()` is set at any time other than within `process_frame()` itself, it actually won't have an effect by the next frame, only on the frame *after* the next, which is later than intended.
Because thus far in the Pipecat codebase we've only ever called `pause_processing_frames()` and `pause_processing_system_frames()` from within `process_frame()`, this change should have no behavioral effect. But it will be helpful if we ever need to call it from anywhere else. I noticed this issue while developing a feature that did exactly that (though I later abandoned that code).
2025-10-09 15:57:31 -04:00
shreyas-sarvam
1e31fc7f9b
fix: Format errors
2025-10-09 22:09:25 +05:30
kompfner
046b76df60
Merge pull request #2820 from pipecat-ai/pk/gemini-live-vertex-support
...
Support Gemini Live + Vertex AI
2025-10-09 11:53:41 -04:00
Paul Kompfner
f2d9063984
Renames: remove "multimodal" from Gemini Live types
2025-10-09 10:58:36 -04:00
shreyas-sarvam
7c1e2793c5
feat: Add support for bulbul:v3 and bulbul:v3-beta
2025-10-09 18:26:22 +05:30
Paul Kompfner
99f008e927
Make a note in our examples that there's an issue with Gemini Live + Vertex around specifying a modality other than AUDIO
2025-10-08 21:03:07 -04:00
Paul Kompfner
2699f0c2a6
Fix tool calls when using Gemini Live + Vertex AI
2025-10-08 21:03:07 -04:00
Paul Kompfner
0b6dd98000
Make a note in our examples that there's an issue with Gemini Live + Vertex around using "google_search" alongside other tools
2025-10-08 21:03:07 -04:00
Paul Kompfner
a14fb20d15
Fix Gemini Live w/Vertex AI not being able to handle an empty list provided for "function_declarations"
2025-10-08 21:03:07 -04:00
Paul Kompfner
728361a6a7
Add GeminiVertexMultimodalLiveLLMService
2025-10-08 21:03:01 -04:00
kompfner
106db69e8e
Merge pull request #2816 from pipecat-ai/pk/gemini-live-await-ongoing-response-after-endframe
...
Implement ending `GeminiMultimodalLiveLLMService` gracefully (i.e. af…
2025-10-08 17:20:14 -04:00
Paul Kompfner
cf90071926
Format fix
2025-10-08 17:19:46 -04:00
Paul Kompfner
deaeb75a1f
Fix changelog after rebase (and add a missing item)
2025-10-08 17:16:31 -04:00
Paul Kompfner
a666327d70
Implement ending GeminiMultimodalLiveLLMService gracefully (i.e. after the bot is finished)
2025-10-08 17:13:04 -04:00
kompfner
13a0522546
Merge pull request #2804 from pipecat-ai/pk/gemini-live-session-resumption
...
Add (relatively spartan) reconnection logic to `GeminiMultimodalLiveLLMService`
2025-10-08 17:10:45 -04:00
Paul Kompfner
7da37a0d1f
Pull _connection_established_threshold and _max_consecutive_failures into file-level constants
2025-10-08 17:04:05 -04:00
Paul Kompfner
7efb22a323
Add (relatively spartan) reconnection logic to GeminiMultimodalLiveLLMService, leveraging the Gemini Live session resumption mechanism
2025-10-08 16:53:21 -04:00
kompfner
8084e2f909
Merge pull request #2776 from pipecat-ai/pk/gemini-live-gen-ai-library
...
Gemini Live service uses the `genai` library rather than WebSockets directly
2025-10-08 16:50:16 -04:00
Paul Kompfner
86127c6a6e
Add to the changelog the GeminiMultimodalLiveLLMService update to use google-genai
2025-10-08 16:46:41 -04:00
Paul Kompfner
402e019ae2
Make a bit of code clearer
2025-10-08 16:45:55 -04:00
Paul Kompfner
f09e4e238b
Fix some mishandling of enum values
2025-10-08 16:45:55 -04:00
Paul Kompfner
2921162b3b
Add deprecation warning around importing StartSensitivity and EndSensitivity from pipecat.services.gemini_multimodal_live.events
2025-10-08 16:45:55 -04:00
Paul Kompfner
ac1582c906
Let users directly use google-genai types rather than aliased re-exported types
2025-10-08 16:45:55 -04:00
Paul Kompfner
e4b01a5844
Bumping deprecation version of GeminiMultimodalLiveLLMService's base_url arg
2025-10-08 16:45:55 -04:00
Paul Kompfner
fa663abbbc
Add CHANGELOG entry for new GeminiMultimodalLiveLLMService configuration options
2025-10-08 16:45:55 -04:00
Paul Kompfner
d19e6111c3
Bumping deprecation version of GeminiMultimodalLiveLLMService's base_url arg
2025-10-08 16:45:55 -04:00
Paul Kompfner
8a6d504a7e
Add enable_affective_dialog and proactivity settings to GeminiMultimodalLiveLLMService
2025-10-08 16:45:55 -04:00
Paul Kompfner
43915937f2
Update how we import and re-export some types in GeminiMultimodalLiveLLMService
2025-10-08 16:45:55 -04:00
Paul Kompfner
48e92a22fe
Add thinking settings to GeminiMultimodalLiveLLMService
2025-10-08 16:45:55 -04:00
Paul Kompfner
566af6b0b8
Minor comment improvement
2025-10-08 16:45:55 -04:00
Paul Kompfner
12e7613d5f
Deprecate the base_url argument to GeminiMultimodalLiveLLMService.
...
It expected a WebSocket URL, but we're no longer (directly) using WebSockets to talk to Gemini. Instead of trying to (potentially erroneously) map a given custom WebSocket URL to an `HttpOptions` object (the new preferred way of customizing requests made by the Gemini API client), we're simply deprecating `base_url` and pointing users to the `http_options` argument instead.
2025-10-08 16:45:55 -04:00
Paul Kompfner
04a68f2c57
Fix tracing in GeminiMultimodalLiveLLMService
2025-10-08 16:45:55 -04:00
Paul Kompfner
9b4ca12f49
Revert to only supporting providing a single modality - looks like specifying a list of modalities results in an API error.
...
Also, fix some missing `await`s in error handling.
2025-10-08 16:45:55 -04:00
Paul Kompfner
453ce715a6
Add some error handling to GeminiMultimodalLiveLLMService
2025-10-08 16:45:55 -04:00
Paul Kompfner
d87b6189ba
Update GeminiMultimodalLiveLLMService to use the google-genai library, which is the new recommended approach for interfacing with Gemini Live.
2025-10-08 16:45:55 -04:00
Mark Backman
8293347b77
Merge pull request #2814 from pipecat-ai/mb/openai-service-tier
...
Add service_tier to BaseOpenAILLMService
2025-10-08 16:44:27 -04:00
Mark Backman
c85a3f0b94
Add service_tier to BaseOpenAILLMService
2025-10-08 16:33:36 -04:00
Aleix Conchillo Flaqué
233fb25e6c
Merge pull request #2810 from pipecat-ai/aleix/on-pipeline-error
...
PipelineTask: add on_pipeline_error event
2025-10-08 11:26:46 -07:00
Aleix Conchillo Flaqué
080978daa6
Merge pull request #2808 from pipecat-ai/aleix/readme-pipecat-tv
...
README: add Pipecat TV reference
2025-10-08 11:26:17 -07:00
Aleix Conchillo Flaqué
62b7c3d3b2
PipelineTask: add on_pipeline_error event
2025-10-07 18:36:38 -07:00
Mark Backman
4b2379cba8
Merge pull request #2798 from ivaaan/hume-rtvi
...
Hume add RTVI
2025-10-07 21:20:50 -04:00
Aleix Conchillo Flaqué
92087bdfa8
update CHANGELOG
2025-10-07 18:08:18 -07:00
Aleix Conchillo Flaqué
617919ac09
Merge pull request #2809 from pipecat-ai/aleix/revert-interruption-strategies-ordering
...
revert interruption strategies ordering
2025-10-07 18:07:07 -07:00
Aleix Conchillo Flaqué
0669daec3d
update CHANGELOG for 0.0.89
2025-10-07 17:44:10 -07:00
Aleix Conchillo Flaqué
7c15a8c800
Revert "fix context order when using interruption strategies"
...
This reverts commit de8ee96927 .
2025-10-07 17:42:35 -07:00
Aleix Conchillo Flaqué
066b77fba0
README: add Pipecat TV reference
2025-10-07 15:01:28 -07:00
Aleix Conchillo Flaqué
d9aef5f916
some last release CHANGELOG updates
2025-10-07 14:29:27 -07:00
Aleix Conchillo Flaqué
91ae3f8a9b
Merge pull request #2807 from pipecat-ai/aleix/pipecat-0.0.88
...
update CHANGELOG for 0.0.88
2025-10-07 14:16:05 -07:00
Aleix Conchillo Flaqué
36da623352
update CHANGELOG for 0.0.88
2025-10-07 14:12:12 -07:00
Filipi da Silva Fuchter
31b9087ea6
Merge pull request #2805 from pipecat-ai/filipi/allowing_update_smallwebrtc_properties
...
Allowing to update smallwebrtc and whatsapp properties.
2025-10-07 17:57:26 -03:00
Mark Backman
1851fed22e
Merge pull request #2806 from pipecat-ai/mb/deprecate-play-ht
...
Deprecate PlayHT TTS services
2025-10-07 16:44:53 -04:00
Mark Backman
eddce460da
Deprecate PlayHT TTS services
2025-10-07 16:40:01 -04:00
Filipi Fuchter
da4f30cb6d
Allowing to update smallwebrtc and whatsapp properties.
2025-10-07 17:28:14 -03:00
Mark Backman
250cf2d8f1
Merge pull request #2803 from pipecat-ai/mb/fix-11labs-stt-deprecation
...
Remove deprecation warning for ElevenLabsSTTService
2025-10-07 13:04:12 -04:00
Mark Backman
7bbdb4f991
Remove deprecation warning for ElevenLabsSTTService
2025-10-07 12:32:32 -04:00
Mark Backman
051c4782fb
Merge pull request #2802 from pipecat-ai/mb/fix-aws-nova-sonic
...
Fix AWS Nova Sonic authentication
2025-10-07 10:46:03 -04:00
Mark Backman
b1ccec74b2
Fix AWS Nova Sonic authentication
2025-10-07 09:48:18 -04:00
Filipi da Silva Fuchter
92bf0d9eda
Merge pull request #2794 from pipecat-ai/filipi/verifying_whatsapp_signature
...
Verifying WhatsApp signature to ensure the webhook request is from WhatsApp.
2025-10-07 08:57:47 -03:00
Aleix Conchillo Flaqué
f985550441
Merge pull request #2796 from pipecat-ai/aleix/fix-interruption-strategies-context-order
...
fix context order when using interruption strategies
2025-10-06 22:46:31 -07:00
Aleix Conchillo Flaqué
de8ee96927
fix context order when using interruption strategies
2025-10-06 22:43:01 -07:00
Aleix Conchillo Flaqué
2576d0f340
Merge pull request #2792 from pipecat-ai/aleix/google-nano-banana
...
GoogleLLMService: added support for image generation
2025-10-06 22:42:14 -07:00
ivaaan
f38f4711ac
wip
2025-10-06 20:24:27 -07:00
ivaaan
c2f3ddd329
add RTVI to Hume
2025-10-06 19:41:31 -07:00
ivaaan
73ffe96228
add RTVI to Hume
2025-10-06 19:37:05 -07:00
Aleix Conchillo Flaqué
bd13a80da7
pyproject: update google dependencies
2025-10-06 17:38:08 -07:00
Aleix Conchillo Flaqué
312959f97e
GoogleLLMService: update default model to gemini-2.5-flash
2025-10-06 17:38:08 -07:00
Aleix Conchillo Flaqué
fe168e3c68
GoogleLLMService: added support for image generation
2025-10-06 17:38:08 -07:00
Filipi Fuchter
28929a47f7
Verifying WhatsApp signature to ensure the webhook request is from WhatsApp.
2025-10-06 16:16:59 -03:00
Mark Backman
03f5defbc3
Merge pull request #2793 from pipecat-ai/mb/fix-flux-deprecation
...
Fix: Resolve Flux deprecation warning
2025-10-06 12:07:27 -04:00
Mark Backman
b216648315
Fix: Resolve Flux deprecation warning
2025-10-06 09:55:02 -04:00
Mark Backman
084b133a01
Merge pull request #2790 from pipecat-ai/add-security-md
...
Add SECURITY.md
2025-10-06 09:45:02 -04:00
Mark Backman
e589876176
Merge pull request #2786 from pipecat-ai/mb/nltk-download-error
...
Catch PermissionError when NLTK data can't be downloaded
2025-10-06 09:27:22 -04:00
Vanessa Pyne
a826313bf9
Add SECURITY.md
2025-10-05 13:24:47 -05:00
Mark Backman
49f44aa7c8
Catch PermissionError when NLTK data can't be downloaded
2025-10-04 08:41:32 -04:00
Mark Backman
64ceef9cf0
Merge pull request #2783 from pipecat-ai/mb/community-integrations-submission
...
Update to Community Integrations submission process
2025-10-03 12:41:13 -04:00
Mark Backman
cd6567c1f1
Update to Community Integrations submission process
2025-10-03 12:15:48 -04:00
Mark Backman
ac67ca1555
Merge pull request #2778 from pipecat-ai/mb/hume-cleanup
...
Tidying up the Hume example and service
2025-10-03 11:09:18 -04:00
mattie ruth backman
8d38994756
Transports now send InputTransportMessageFrames (not Urgent Frames)
2025-10-03 09:47:44 -04:00
LucasStringPay
607e3040d4
Ignore None 'completion_tokens' or similar
...
Similar as 144ea36c81 , reported in https://github.com/pipecat-ai/pipecat/issues/2207
2025-10-02 15:16:11 -07:00
Mark Backman
60604a9449
Tidying up the Hume example and service
2025-10-02 17:34:40 -04:00
Aleix Conchillo Flaqué
4abe4a6253
Merge pull request #2777 from pipecat-ai/aleix/readme-mention-tail
...
README: add tail terminal dashboard
2025-10-02 14:31:26 -07:00
Aleix Conchillo Flaqué
4c054af17b
README: remove setup editor instructions
2025-10-02 14:30:31 -07:00
Aleix Conchillo Flaqué
dcba940d42
README: add tail terminal dashboard
2025-10-02 14:27:55 -07:00
Mark Backman
ad2adb0c58
Merge pull request #2518 from zgreathouse/hume-tts-service
...
Hume tts service
2025-10-02 17:26:39 -04:00
ivaaan
76923010b5
upd Hume version to 2
2025-10-02 13:57:07 -07:00
ivaaan
1b511557b2
upd evals
2025-10-02 13:48:30 -07:00
ivaaan
fdadb12933
upd Changelog
2025-10-02 13:46:22 -07:00
ivaaan
f1bbb7ba22
Regenerate uv.lock after resolving merge conflicts
2025-10-02 13:44:07 -07:00
ivaaan
c1492c5275
fixes based on markbackman review
2025-10-02 13:38:36 -07:00
ivaaan
4ffdabcfde
add Hume example, small fixes
2025-10-02 13:38:36 -07:00
zach
b489de2fc3
adds hume tts service
2025-10-02 13:38:05 -07:00
zach
d9656cbb1a
add hume sdk for hume tts service
2025-10-02 13:38:05 -07:00
zach
05fb223985
Add hume to .env.example
2025-10-02 13:34:37 -07:00
Mark Backman
62a5f07ad2
Merge pull request #2701 from pipecat-ai/mb/third-party-integrations
...
Add a third-party integrations guide
2025-10-02 15:59:38 -04:00
Mark Backman
b669e3a481
Update name to Community Integrations and streamline guide
2025-10-02 15:54:04 -04:00
Mark Backman
99f1041a47
More review fixes
2025-10-02 14:48:12 -04:00
Mark Backman
37b1345bfa
Changes from review feedback
2025-10-02 14:48:12 -04:00
Mark Backman
8994ac17eb
Add a third-party integrations guide
2025-10-02 14:48:12 -04:00
Mark Backman
63bc825008
Merge pull request #2771 from pipecat-ai/mb/update-publish-workflows
...
Updates to publish workflows
2025-10-02 12:35:43 -04:00
Mark Backman
e7ffde1c4c
Merge pull request #2774 from pipecat-ai/mb/docs-fixes-0.0.87
...
Fix: Resolve docstring build issues before 0.0.87 release
2025-10-02 12:34:27 -04:00
Mark Backman
1c88565725
Merge pull request #2772 from pipecat-ai/mb/fix-openai-realtime-import
...
Fix: Change import for OpenAIRealtimeLLMContext in OpenAIRealtimeLLMS…
2025-10-02 12:34:16 -04:00
Aleix Conchillo Flaqué
07a6c2fb0e
Merge pull request #2775 from pipecat-ai/aleix/pipecat-0.0.87
...
update CHANGELOG for 0.0.87
2025-10-02 09:12:41 -07:00
Aleix Conchillo Flaqué
e99f3bf75a
update CHANGELOG for 0.0.87
2025-10-02 09:11:30 -07:00
Mark Backman
f09d780413
Fix: Resolve docstring build issues before 0.0.87 release
2025-10-02 10:09:25 -04:00
Mark Backman
e370d23374
Fix: Change import for OpenAIRealtimeLLMContext in OpenAIRealtimeLLMService
2025-10-02 09:39:44 -04:00
Mark Backman
b68ec14146
Updates to publish workflows
2025-10-02 08:25:35 -04:00
Filipi da Silva Fuchter
c567fd71b1
Merge pull request #2747 from pipecat-ai/filipi/whatsapp_runner
...
Creating the whatsapp routes inside the runner.
2025-10-01 21:21:34 -03:00
Filipi da Silva Fuchter
2ca1b2d6f8
Merge pull request #2612 from pipecat-ai/filipi/deepgram_flux
...
Integrating the new Deepgram model (Flux) with Pipecat
2025-10-01 21:20:47 -03:00
Mark Backman
04041a9a9a
Merge pull request #2757 from pipecat-ai/hush/retryTimeout
...
Fix AWS Bedrock timeout exception handling
2025-10-01 19:08:09 -04:00
Aleix Conchillo Flaqué
6c498dc70f
Merge pull request #2745 from pipecat-ai/aleix/transport-message-frames-deprecations
...
transport message frames deprecations
2025-10-01 16:05:55 -07:00
James Hush
32b07c1720
Fix AWS Bedrock timeout exception handling
...
- Use ReadTimeoutError and asyncio.TimeoutError which are the actual exceptions thrown by boto3
2025-10-01 19:04:35 -04:00
Aleix Conchillo Flaqué
ad507ce23d
FrameLogger: it's fine to print transport messages
2025-10-01 16:00:42 -07:00
Aleix Conchillo Flaqué
be562cedfc
DailyTransport: deprecate DailyTransportMessage(Urgent)Frame
2025-10-01 16:00:42 -07:00
Aleix Conchillo Flaqué
089e703e1f
LiveKitTransport: deprecate LiveKitTransportMessage(Urgent)Frame
2025-10-01 16:00:42 -07:00
Aleix Conchillo Flaqué
4dc1e15a99
frames: use OutputTransportMessage(Urgent)Frame instead of TransportMessage(Urgent)Frame
2025-10-01 16:00:42 -07:00
Aleix Conchillo Flaqué
c7dc2e886f
frames: use InputTransportMessageFrame instead of InputTransportMessageUrgentFrame
...
By default, input frames are already urgent.
2025-10-01 15:30:45 -07:00
Filipi Fuchter
11bc4ea854
Adding deepgram flux to release evals.
2025-10-01 19:24:58 -03:00
Mark Backman
029d76033d
Merge pull request #2765 from pipecat-ai/mb/remove-daily-logging-04a
...
Remove DailyLogLevel from 04a example
2025-10-01 17:52:33 -04:00
Aleix Conchillo Flaqué
924d7dea9a
Merge pull request #2766 from pipecat-ai/aleix/rtvi-properly-deprecate-errors-enabled
...
RTVIParams: properly deprecate errors_enabled
2025-10-01 14:49:12 -07:00
Aleix Conchillo Flaqué
244e94f3ce
RTVIParams: properly deprecate errors_enabled
2025-10-01 14:30:41 -07:00
Mark Backman
af1f51d49e
Remove DailyLogLevel from 04a example
2025-10-01 17:06:35 -04:00
Filipi da Silva Fuchter
9ba3c168b8
Merge pull request #2756 from pipecat-ai/filipi/esp32
...
SDP munging fixes.
2025-10-01 16:05:47 -03:00
Filipi Fuchter
e6ee8f7a16
New example using DeepgramFluxSTTService.
2025-10-01 15:43:25 -03:00
Filipi Fuchter
2ea2bd99e0
Deepgram Flux speech-to-text service implementation.
2025-10-01 15:43:09 -03:00
Filipi Fuchter
0c2ced7c52
Created WebsocketSTTService base class.
2025-10-01 15:42:56 -03:00
Filipi Fuchter
fb160646b8
Fixing the SDP munging to keep it working on Chrome.
2025-10-01 14:18:39 -03:00
Filipi da Silva Fuchter
89fed57af2
Merge pull request #2748 from pipecat-ai/filipi/remove_smallwebrtc_queue
...
Removing the message queue inside the SmallWebRTCConnection.
2025-10-01 08:07:47 -03:00
Aleix Conchillo Flaqué
feae3b6d2d
Merge pull request #2742 from pipecat-ai/aleix/deprecate-daily-update-remote-participants-frame
...
DailyTransport: deprecated DailyUpdateRemoteParticipantsFrame
2025-09-30 16:27:34 -07:00
Aleix Conchillo Flaqué
92d3be8975
DailyTransport: deprecated DailyUpdateRemoteParticipantsFrame
2025-09-30 16:26:48 -07:00
Aleix Conchillo Flaqué
0f53e1db2c
Merge pull request #2759 from pipecat-ai/aleix/dont-cancel-if-finished
...
PipelineTask: avoid cancellation if application is finished
2025-09-30 16:21:16 -07:00
Aleix Conchillo Flaqué
d398e8cc10
Merge pull request #2761 from pipecat-ai/aleix/rtvi-tail-updates
...
RTVI updates: audio levels and system logs
2025-09-30 13:55:17 -07:00
Aleix Conchillo Flaqué
e5f263d380
update CHANGELOG
2025-09-30 13:51:35 -07:00
Aleix Conchillo Flaqué
3a4c303c54
RTVIParams: add errors_enabled deprecation warnings
2025-09-30 13:49:51 -07:00
Mark Backman
54a1ef47d0
Merge pull request #2758 from pipecat-ai/mb/claude-sonnet-4.5
...
Update AnthropicLLMService to use claude-sonnet-4-5-20250929
2025-09-30 16:42:47 -04:00
Aleix Conchillo Flaqué
149ffa4f3c
RTVIObserver: add support system logs
2025-09-30 13:42:40 -07:00
Aleix Conchillo Flaqué
e5465034d9
RTVIObserver: add support for user/bot audio levels
2025-09-30 13:41:26 -07:00
Aleix Conchillo Flaqué
568c7c782d
rtvi: allow None RTVIProcessor and rename to send_rtvi_message()
2025-09-30 13:35:27 -07:00
Aleix Conchillo Flaqué
9851334221
rtvi: deprecate errors_enabled and always send errors
2025-09-30 13:31:30 -07:00
Aleix Conchillo Flaqué
e79c4fc99d
PipelineTask: avoid cancellation if application is finished
2025-09-30 13:18:25 -07:00
Aleix Conchillo Flaqué
55c321f4ff
Merge pull request #2751 from pipecat-ai/aleix/nova-sonic-disconnect-fix
...
AWSNovaSonicLLMService: add missing await
2025-09-30 13:12:22 -07:00
kompfner
a14a53a005
Merge pull request #2735 from pipecat-ai/pk/remove-openaillmcontext-usage
...
Remove remaining usage of `OpenAILLMContext` throughout the codebase …
2025-09-30 10:09:25 -04:00
Mark Backman
a71f937e8f
Update AnthropicLLMService to use claude-sonnet-4-5-20250929
2025-09-30 08:49:30 -04:00
Filipi Fuchter
032032df65
Only remove ESP32 ICE candidates if host is defined.
2025-09-29 15:42:23 -03:00
Mark Backman
d0178edad0
Merge pull request #2753 from pipecat-ai/mb/quickstart-0.0.86
...
Quickstart: Update to 0.0.86, removing pytorch requirements
2025-09-29 09:43:33 -04:00
Mark Backman
795c5e55d9
Quickstart: Update to 0.0.86, removing pytorch requirements
2025-09-27 08:30:37 -04:00
Aleix Conchillo Flaqué
8f8d8ae0d8
AWSNovaSonicLLMService: add missing await
2025-09-26 15:58:05 -07:00
Vanessa Pyne
741f192d04
Merge pull request #2096 from pipecat-ai/vp-mcp-ex-nit
...
mcp examples: check for env vars needed for examples
2025-09-26 10:21:22 -05:00
Filipi Fuchter
a5595b82ea
removing the message queue inside the SmallWebRTCConnection.
2025-09-26 11:02:17 -03:00
Filipi Fuchter
4d1915eb41
Fixing ruff format.
2025-09-26 10:49:52 -03:00
Filipi Fuchter
b3a84fc772
Refactoring how we are handling the lifespan inside the runner.
2025-09-26 10:47:04 -03:00
Filipi Fuchter
403d22e62c
Creating the whatsapp routes inside the runner.
2025-09-26 10:28:19 -03:00
Aleix Conchillo Flaqué
ee00ee5c57
Merge pull request #2744 from pipecat-ai/aleix/vad-analyzer-thread-executor
...
BaseInputTransport: create VAD thread in VADAnalyzer
2025-09-25 13:43:34 -07:00
Aleix Conchillo Flaqué
f53fd880dc
BaseInputTransport: create VAD thread in VADAnalyzer
...
We move the thread creation to the VADAnalyzer instead of the input
transport. This can potentially be useful if we need to analyze multiple audio
streams.
2025-09-25 13:41:20 -07:00
Aleix Conchillo Flaqué
de3461e4cc
Merge pull request #2743 from pipecat-ai/aleix/turn-analyzer-fixes
...
turn analyzer fixes
2025-09-25 13:40:43 -07:00
Aleix Conchillo Flaqué
7bafc3a1bb
BaseSmartTurn: process speech in a separate thread
2025-09-25 13:37:28 -07:00
Aleix Conchillo Flaqué
22ef61fe8d
BaseTurnAnalyzer: add BaseTurnParams base class for parameters
2025-09-25 13:37:09 -07:00
Aleix Conchillo Flaqué
7078fb53bd
Merge pull request #2738 from pipecat-ai/aleix/openai-cached-tokens-metrics
...
BaseOpenAILLMService: include cached tokens to metrics frame
2025-09-25 13:36:03 -07:00
Aleix Conchillo Flaqué
33447ad6f2
BaseOpenAILLMService: include cached tokens to metrics frame
2025-09-24 19:32:16 -07:00
Paul Kompfner
6faa50ae5b
Remove remaining usage of OpenAILLMContext throughout the codebase in favor of LLMContext, except for:
...
- Usage in classes that are already deprecated
- Usage related to realtime LLMs, which don't yet support `LLMContext`
- Usage in (soon-to-be-deprecated) code paths related to `OpenAILLMContext` itself and associated machinery
2025-09-24 16:35:03 -04:00
vipyne
889dc19a27
mcp examples: check for env vars needed for examples
2025-09-19 12:09:50 -05:00