James Hush
fd6379cb6a
Remove extra
2025-05-02 11:22:12 +08:00
James Hush
7618d7511a
Revert line
2025-05-02 11:21:01 +08:00
James Hush
9ca7bad978
Remove extra
2025-05-02 11:20:19 +08:00
James Hush
10289c1f1c
Remove extra code
2025-05-02 11:18:44 +08:00
James Hush
213d5d6abc
THIS WORKS.
2025-05-02 11:14:56 +08:00
James Hush
01f37f769d
Comment
2025-05-02 10:02:04 +08:00
James Hush
52b393537a
RTVI simple chat example
2025-05-02 10:02:01 +08:00
Mark Backman
a6a4d3d71f
Merge pull request #1706 from rimelabs/matt/rime/update_url
...
[Rime] - Update url for Websockets API
2025-04-30 19:14:04 -04:00
Mark Backman
c52de0f5de
Merge pull request #1696 from pipecat-ai/mb/fix-gemini-live-context
...
Fix: GeminiMultimodalLiveLLMService was appending tokens to the context
2025-04-30 19:12:06 -04:00
Mark Backman
a1e1255f16
Strip newlines from generated user transcript
2025-04-30 18:27:46 -04:00
Mark Backman
c4f758725e
Ignore TranscriptionFrames too
2025-04-30 18:22:43 -04:00
Aleix Conchillo Flaqué
7bc9a78ce6
udpate CHANGELOG with RTVIObserverParams
2025-04-30 15:13:14 -07:00
Aleix Conchillo Flaqué
f8be71b32c
Merge pull request #1688 from pipecat-ai/aleix/add-rtvi-observer-params
...
RTVIObserver: add RTVIObserverParams to configure what to send
2025-04-30 15:11:18 -07:00
Aleix Conchillo Flaqué
957fa5546d
RTVIObserver: add RTVIObserverParams to configure what to send
2025-04-30 15:09:02 -07:00
Aleix Conchillo Flaqué
039cb8fcae
Merge pull request #1690 from pipecat-ai/aleix/rtvi-function-call-single-param
...
RTVIProcessor: use single FunctionCallParams
2025-04-30 15:04:05 -07:00
Mark Backman
8e05f2f1a1
Merge pull request #1702 from pipecat-ai/mb/stt-mute-transcription-frames
...
Add InterimTranscriptionFrame and TranscriptionFrame to STTMuteFilter…
2025-04-30 17:54:24 -04:00
Matt Kim
8467aa1ed3
[Rime] - Update url for Websockets API
...
Rime has migrated their Websockets api to the base url `user.rime.ai` along with all other tts endpoints.
See the [docs](https://docs.rime.ai/api-reference/endpoint/websockets )
`users-ws.rime.ai` is deprecated and will not reflect upgrades to the rime ws api.
2025-04-30 14:20:13 -07:00
Mark Backman
9c5878af3d
OpenAI Realtime and Gemini Live push LLMTextFrame again, overwrite the assitant context aggregator for LLMTextFrame
2025-04-30 17:18:20 -04:00
Mark Backman
ef29800fe9
Update the changelog
2025-04-30 16:28:17 -04:00
Mark Backman
7e09933070
OpenAI Realtime should push TTSTextFrame only
2025-04-30 16:28:17 -04:00
Mark Backman
82a9d7f992
Gemini Mulitmodal Live to push TTSTextFrame only
2025-04-30 16:28:17 -04:00
Mark Backman
facbebb15f
Transcribe user audio in 26b
2025-04-30 16:28:16 -04:00
Mark Backman
2ba60fc41f
Update TranscriptProcessor to handle GeminiMultimodalLiveLLMService changes
2025-04-30 16:28:16 -04:00
Mark Backman
685f951ae2
Fix: GeminiMultimodalLiveLLMService was appending tokens to the context
2025-04-30 16:28:16 -04:00
Mark Backman
27d4c927a8
Merge pull request #1701 from pipecat-ai/mb/gemini-extend-session
...
Add context_window_compression support to GeminiMultimodalLiveLLMService
2025-04-30 14:35:50 -04:00
Mark Backman
20a59e8c56
Add InterimTranscriptionFrame and TranscriptionFrame to STTMuteFilter frame processing
2025-04-30 10:50:56 -04:00
Mark Backman
d9a0a93667
Add context_window_compression support to GeminiMultimodalLiveLLMService
2025-04-30 09:55:34 -04:00
Mark Backman
154d5d1859
Merge pull request #1699 from pipecat-ai/mb/more-docs-mocks
...
Additional import mocks to fix docs failure
2025-04-30 08:36:57 -04:00
Mark Backman
a192217256
Additional import mocks to fix docs failure
2025-04-29 21:45:27 -04:00
Mark Backman
10cdc47e05
Merge pull request #1689 from pipecat-ai/mb/handle-http-smart-turn-errors
...
Handle case where Fal Smart Turn returns a 500 error
2025-04-29 14:21:45 -04:00
Mark Backman
2b4d41a548
Merge pull request #1693 from flixoflax/bump-modal-example-dependencies
...
Bump modal deployment example dependencies
2025-04-29 14:21:31 -04:00
Filipi da Silva Fuchter
962f8062a5
Merge pull request #1581 from pipecat-ai/voice_agent_ice_servers
...
Configuring the voice-agent example to wait for all the ice candidates.
2025-04-29 13:30:12 -03:00
Filipi Fuchter
d80d385b2f
Adding a section explaining about ice servers.
2025-04-29 12:19:59 -03:00
Filipi Fuchter
b347ca472f
Checking the state again to avoid any eventual race condition
2025-04-29 12:04:20 -03:00
Filipi Fuchter
c3c4952abf
Reducing the timeout to 2 seconds for gathering the ice candidates.
2025-04-29 11:34:58 -03:00
Filipi Fuchter
f369ab4c1a
Printing each new ice candidate.
2025-04-29 11:23:33 -03:00
flixoflax
62b41c6789
removed aiohttp from deps, and version constraint from pipecat-ai
2025-04-29 16:14:51 +02:00
Filipi da Silva Fuchter
2872bc7902
Merge pull request #1587 from pipecat-ai/improving_ice_servers
...
Updated `SmallWebRTCConnection` to support `ice_servers` with credentials.
2025-04-29 11:07:32 -03:00
Filipi Fuchter
9658b75a10
Configuring the voice-agent example to use ice-servers and wait for all the ice candidates.
2025-04-29 10:52:07 -03:00
flixoflax
63de9039e6
Bumb modal deployment example deps
2025-04-29 15:50:56 +02:00
Filipi Fuchter
9352396d7e
Mentioning the new feature in the changelog.
2025-04-29 10:40:09 -03:00
Filipi Fuchter
d1ab1d38b7
Fixing the examples to use the new IceServer structure.
2025-04-29 10:33:19 -03:00
Filipi Fuchter
080f70d91c
Allowing to define the username and credential for the ice servers.
2025-04-29 10:32:42 -03:00
Mark Backman
ebed1fc6ea
Restructure _send_raw_request to raise errors early, then process the successful response
2025-04-29 09:05:14 -04:00
Aleix Conchillo Flaqué
6821b1cdab
RTVIProcessor: use single FunctionCallParams
2025-04-29 05:56:23 -07:00
Mark Backman
144ae9b611
Handle case where Fal Smart Turn returns a 500 error
2025-04-29 08:53:02 -04:00
Aleix Conchillo Flaqué
a2e7331ce2
Merge pull request #1680 from pipecat-ai/aleix/local-input-select-stt-update
...
examples: update local-input-select-stt
2025-04-28 12:01:12 -07:00
Aleix Conchillo Flaqué
8accd3e387
Merge pull request #1681 from pipecat-ai/aleix/tts-service-llm-full-response-end-fix
...
TTSService: do not push LLMFullResponseEndFrame if not needed
2025-04-28 12:00:55 -07:00
Mark Backman
3d05a74dc0
Merge pull request #1678 from Dev-Khant/integrate-mem0-oss
...
Integrate with Mem0 OSS
2025-04-28 14:15:40 -04:00
Aleix Conchillo Flaqué
e3c965f4d5
TTSService: do not push LLMFullResponseEndFrame if not needed
2025-04-28 11:03:22 -07:00
Dev-Khant
5354e5d891
formatting
2025-04-28 22:57:18 +05:30
Aleix Conchillo Flaqué
5784e91cff
update .gitignore
2025-04-28 09:34:45 -07:00
Aleix Conchillo Flaqué
bc5f098aaa
examples: update local-input-select-stt
2025-04-28 09:28:26 -07:00
Aleix Conchillo Flaqué
93534b4692
Merge pull request #1679 from pipecat-ai/aleix/update-package-lock-apr-28
...
examples: update package-lock.json
2025-04-28 09:27:01 -07:00
Aleix Conchillo Flaqué
b23d54c609
examples: update package-lock.json
2025-04-28 09:15:14 -07:00
Dev-Khant
aa23a7b1e6
update changelog
2025-04-28 20:03:20 +05:30
Dev-Khant
c0c41789ab
Integrate with Mem0 OSS
2025-04-28 17:15:53 +05:30
Aleix Conchillo Flaqué
029ef4f8c2
Merge pull request #1667 from pipecat-ai/aleix/function-call-single-parameter
...
function call single parameter
2025-04-25 13:53:10 -07:00
Aleix Conchillo Flaqué
9cad6dfce9
RTVIProcessor: simplify handle_function_call() and depreacted handle_function_call_start()
2025-04-25 13:34:05 -07:00
Aleix Conchillo Flaqué
4df6444832
examples: update with single FunctionCallParams parameter
2025-04-25 13:34:05 -07:00
Aleix Conchillo Flaqué
944bc23135
LLMService: use a single FunctionCallParams parameter for function calls
2025-04-25 13:34:03 -07:00
Aleix Conchillo Flaqué
1d863ee7de
Merge pull request #1669 from pipecat-ai/aleix/short-utterances-fixes
...
short utterances fixes
2025-04-25 13:25:15 -07:00
Vanessa Pyne
9ca775d1ab
Merge pull request #1668 from pipecat-ai/vp-update-transport-params-in-ex
...
Update examples with new transport param names
2025-04-25 15:24:28 -05:00
Aleix Conchillo Flaqué
03002ad685
LLMUserContextAggregator: reduce aggregation_timeout to 0.5
2025-04-25 13:21:00 -07:00
Aleix Conchillo Flaqué
99a4154cbc
LLMUserContextAggregator: ignore short uterrances while bot speaking
2025-04-25 13:21:00 -07:00
vipyne
0f68cc182d
Update examples with new transport param names
2025-04-25 15:18:56 -05:00
Aleix Conchillo Flaqué
3ac50b9902
Merge pull request #1664 from CerebriumAI/kyle/fix-ultravox-performance
...
Improved: Ultravox performance
2025-04-25 12:59:25 -07:00
Michael Louis
9557705b53
Changed warmup to internal function
2025-04-25 14:40:19 -04:00
Mark Backman
a7718926e9
Merge pull request #1666 from pipecat-ai/mb/add-vad-start-stop-events
...
Add VADUserStartedSpeakingFrame and VADUserStoppedSpeakingFrame
2025-04-25 10:49:17 -04:00
Mark Backman
dfa10af6ed
Simplify VAD events to be detected and emitted from BaseInputTransport
2025-04-25 10:28:44 -04:00
Mark Backman
8485ea6c5e
Merge pull request #1650 from pipecat-ai/mb/fal-smart-turn-readme
...
Add hosted demo link to Fal smart turn README
2025-04-25 09:39:54 -04:00
Mark Backman
b298376766
Add VADUserStartedSpeakingFrame and VADUserStoppedSpeakingFrame
2025-04-25 09:06:54 -04:00
Mark Backman
7cfefe4f84
Merge pull request #1593 from WebinarGeek/wg/gladia-translations
...
Push gladia translations as a TranscriptionFrame
2025-04-25 08:35:36 -04:00
Mark Backman
71c7373987
Update CHANGELOG
2025-04-25 08:26:40 -04:00
Mark Backman
d1086914fe
Add TranslationFrame and use in GladiaSTTService; add 13c-gladia-translation.py
2025-04-25 08:25:39 -04:00
Dan Berg
2fb85941d3
Add hint installing test requirements
2025-04-25 08:24:51 -04:00
Dan Berg
be8788e4da
Push gladia translations as a TranscriptionFrame
2025-04-25 08:24:51 -04:00
Mark Backman
acb6abd761
Merge pull request #1644 from pipecat-ai/mb/add-transport-examples
...
Add transports examples to foundational examples
2025-04-25 08:22:26 -04:00
Mark Backman
a528aad957
Merge pull request #1642 from pipecat-ai/mb/foundational-requirements
...
Add deepgram and cartesia to foundational example requirements to mak…
2025-04-25 08:22:08 -04:00
Mark Backman
5c13252801
Merge pull request #1658 from pipecat-ai/mb/update-fal-smart-turn-version
...
Update daily-transport version in fal-smart-turn demo
2025-04-25 08:21:50 -04:00
Mark Backman
a4422ac6c2
Add transports examples to foundational examples
2025-04-25 08:20:04 -04:00
Mark Backman
985a031353
Merge pull request #1657 from pipecat-ai/mb/word-wrangler-example
...
Add Word Wrangler demos
2025-04-25 08:16:02 -04:00
Kyle Gani
5c4079b286
Cleaned up: layout
2025-04-25 13:18:03 +02:00
Kyle Gani
5489ac5a73
Merge branch 'main' into kyle/fix-ultravox-performance
2025-04-25 13:16:48 +02:00
Kyle Gani
49fbcc86ac
Improved: Ultravox performance
2025-04-25 13:12:08 +02:00
Aleix Conchillo Flaqué
d20c3307b9
Merge pull request #1648 from pipecat-ai/aleix/always-push-audio
...
input transports now always push audio frames
2025-04-24 18:59:09 -07:00
Aleix Conchillo Flaqué
9fd76923fd
STTService: passthrough audio frames by default
2025-04-24 17:14:18 -07:00
Aleix Conchillo Flaqué
a753a623d4
examples: allow setting custom program arguments
2025-04-24 17:14:18 -07:00
Aleix Conchillo Flaqué
4ee6c4b59e
BaseOutputTransport: reword camera with video
2025-04-24 17:14:18 -07:00
Aleix Conchillo Flaqué
e79a002e5a
examples: update camera_* with video_*
2025-04-24 17:14:18 -07:00
Aleix Conchillo Flaqué
420912dd4b
transports: deprecate TransportParams.camera_* in favor of video_*
2025-04-24 17:14:18 -07:00
Aleix Conchillo Flaqué
de7185e8db
examples: remove vad_enabled=True
2025-04-24 17:14:18 -07:00
Aleix Conchillo Flaqué
8bfcfe8b1d
transports: deprecate TransportParams.vad_enabled
2025-04-24 17:14:18 -07:00
Aleix Conchillo Flaqué
26d2ce5926
examples: remove vad_audio_passthrough=True
2025-04-24 17:14:18 -07:00
Aleix Conchillo Flaqué
9ee56bff9e
transports: push audio by default. s/vad_audio_passthrough/audio_in_passthrough/
2025-04-24 17:14:16 -07:00
Vanessa Pyne
2e0d77e4f0
Merge pull request #1627 from pipecat-ai/vp-mcp-take-3
...
MCP Service
2025-04-24 18:16:19 -05:00
vipyne
4cc8a4312c
Revert "update 39* examples as per #1648 "
...
This reverts commit b29ffeef29 .
2025-04-24 18:11:35 -05:00
vipyne
cb7cb381aa
Add MCPClient changelog line
2025-04-24 18:05:20 -05:00
vipyne
b29ffeef29
update 39* examples as per #1648
2025-04-24 17:19:50 -05:00
vipyne
b7b2a5b7a1
mcp service fix and add multiple mcp example
2025-04-24 17:13:40 -05:00
vipyne
3384598e07
mcp_service: pr notes
2025-04-24 17:13:40 -05:00
vipyne
c420dbe57f
MCP Service
...
WIP getting mcp.run to work
add mcp[cli] to toml and lint
mcp stdio example
mcp sse example
mcp_run POC
ruff formatting
2025-04-24 17:13:40 -05:00
Mark Backman
fa8aafc7a5
Merge pull request #1660 from pipecat-ai/mb/foundational-examples-readme
...
Update foundational README with ToC
2025-04-24 18:12:27 -04:00
Mark Backman
4b364dda29
Update foundational README with ToC
2025-04-24 18:04:36 -04:00
Mark Backman
6bb765e40f
Update daily-transport version in fal-smart-turn demo
2025-04-24 15:06:08 -04:00
Mark Backman
c80d09f66c
Add Word Wrangler demos
2025-04-24 14:31:48 -04:00
Mark Backman
f8ff10c5d5
Add hosted demo link to Fal smart turn README
2025-04-23 21:27:53 -04:00
Mark Backman
09ff836ef6
Merge pull request #1640 from pipecat-ai/mb/fal-smart-turn-example
...
Fal Smart Turn example
2025-04-23 17:27:27 -04:00
Mark Backman
e446ecac14
Merge pull request #1649 from pipecat-ai/mb/fix-smart-turn-metrics
...
fix: SmartTurnMetricsData was reporting 0 for inference and processin…
2025-04-23 17:23:16 -04:00
Mark Backman
8c0c8a6153
Code review fixes
2025-04-23 17:22:19 -04:00
Mark Backman
70033ae00b
fix: SmartTurnMetricsData was reporting 0 for inference and processing time
2025-04-23 17:17:46 -04:00
Mark Backman
ac9dce63ae
README updates
2025-04-23 16:55:35 -04:00
Mark Backman
8b2df48fab
Fix DailyTransport bot name
2025-04-23 16:43:54 -04:00
Mark Backman
156a5690fc
Merge pull request #1647 from mattmatters/fix-typo
...
Fix wating -> waiting typo
2025-04-23 16:27:00 -04:00
Matt Lewis
d42c618398
Fix wating -> waiting typo
2025-04-23 15:55:14 -04:00
Mark Backman
b23ca5a4a8
Merge pull request #1641 from pipecat-ai/mb/11labs-input-params
...
ElevenLabs: InputParams can be set individually
2025-04-23 14:31:37 -04:00
Mark Backman
63a6697a90
ElevenLabs: InputParams can be set individually
2025-04-23 14:28:38 -04:00
Aleix Conchillo Flaqué
f1e45d0f02
Merge pull request #1646 from pipecat-ai/aleix/pipecat-0.0.65
...
update CHANGELOG for 0.0.65
2025-04-23 11:27:25 -07:00
Mark Backman
4ad227ca2d
Merge pull request #1643 from pipecat-ai/mb/gladia-keepalive
...
Add a keepalive task to GladiaSTTService
2025-04-23 14:27:15 -04:00
Aleix Conchillo Flaqué
66cc18194b
update CHANGELOG for 0.0.65
2025-04-23 11:25:32 -07:00
Mark Backman
7d65132c93
Add a keepalive task to GladiaSTTService
2025-04-23 14:21:44 -04:00
Mark Backman
db7d7a4204
Merge pull request #1645 from pipecat-ai/mb/telnyx-auto-hang-up
...
Add auto_hang_up to Telnyx serializer
2025-04-23 14:20:28 -04:00
Mark Backman
7bbac11084
Add docstrings to TelnyxFrameSerializer
2025-04-23 14:13:20 -04:00
Mark Backman
76c8322b57
Make call_sid optional in TwilioFrameSerializer
2025-04-23 14:08:50 -04:00
Mark Backman
7b1cd3523d
Twilio: send only one hangup command
2025-04-23 13:41:36 -04:00
Mark Backman
6bd821ac9a
Add auto_hang_up to Telnyx serializer
2025-04-23 13:29:54 -04:00
Mark Backman
a6d51c343e
Add deepgram and cartesia to foundational example requirements to make quickstart smoother
2025-04-23 08:47:47 -04:00
Mark Backman
1a5cf7a521
Add local run and deployment steps to README
2025-04-22 21:37:35 -04:00
Mark Backman
69491417ec
Fal Smart Turn example
2025-04-22 21:16:41 -04:00
Aleix Conchillo Flaqué
b91780ced2
Merge pull request #1638 from pipecat-ai/aleix/pipecat-0.0.64
...
update CHANGELOG for 0.0.64
2025-04-22 17:35:25 -07:00
Aleix Conchillo Flaqué
8ded666958
update CHANGELOG for 0.0.64
2025-04-22 17:32:06 -07:00
Filipi da Silva Fuchter
2490c804a5
Merge pull request #1631 from pipecat-ai/smart_turn_timeout
...
Returning the turn as complete if the request don’t return a result within SmartTurnParams stop_secs
2025-04-22 19:51:10 -03:00
Filipi Fuchter
dd8856a673
Merge branch 'main' into smart_turn_timeout
...
# Conflicts:
# dot-env.template
2025-04-22 19:49:32 -03:00
Aleix Conchillo Flaqué
e7da08dab1
move smart turn files to audio.turn.smart_turn package
2025-04-22 15:29:31 -07:00
Aleix Conchillo Flaqué
ae60d42016
s/SmartTurnAnalyzer/HttpSmartTurnAnalyzer/ and add FalSmartTurnAnalyzer
2025-04-22 15:13:12 -07:00
Aleix Conchillo Flaqué
50e8d82ece
SmartTurn: some linting cleanup
2025-04-22 14:39:02 -07:00
Mark Backman
cc9901a82f
Replace httpx with aiohttp
2025-04-22 17:14:19 -04:00
Aleix Conchillo Flaqué
1fd43e8a3f
Merge pull request #1636 from pipecat-ai/aleix/examples-logging
...
examples: always use loguru for logging
2025-04-22 13:06:40 -07:00
Aleix Conchillo Flaqué
fdc508a1a5
examples: always use loguru for logging
2025-04-22 11:51:49 -07:00
Mark Backman
37269db247
Merge pull request #1634 from pipecat-ai/mb/twilio-end-call
...
Automatically hangup Twilio calls
2025-04-22 14:05:10 -04:00
Mark Backman
51269aabbd
Added cancel method to WebsocketServerOutputTransport
2025-04-22 13:58:39 -04:00
Mark Backman
74ecc19e09
Code review feedback
2025-04-22 13:54:12 -04:00
Mark Backman
c6d48c16df
Add twilio to pyproject.toml, update demo to use twilio option
2025-04-22 13:01:56 -04:00
Mark Backman
873d84aa09
Twilio serializer to return None
2025-04-22 12:50:11 -04:00
Mark Backman
7360866c97
Add docstrings
2025-04-22 12:49:17 -04:00
Mark Backman
81f4768661
Automatically hangup Twilio calls
2025-04-22 12:45:34 -04:00
Vanessa Pyne
972d65f61b
Merge pull request #1628 from pipecat-ai/vp-typo-fixes
...
typo fixes in phone-chatbot example
2025-04-22 10:05:56 -05:00
Mark Backman
1da9d398e3
Merge pull request #1619 from pipecat-ai/mb/grok-3-beta
...
GrokLLMService uses grok-3-beta as default model
2025-04-22 10:33:32 -04:00
Filipi Fuchter
7358bc6428
Returning the turn as complete if the request don’t return a result within SmartTurnParams stop_secs
2025-04-22 10:35:14 -03:00
vipyne
a6af499f84
typo fixes in phone-chatbot example
2025-04-21 23:49:13 -05:00
Aleix Conchillo Flaqué
f9d1a53e28
Merge pull request #1609 from pipecat-ai/aleix/pyproject-py-typed
...
pyproject: fix license fields
2025-04-21 16:14:22 -07:00
Mark Backman
3f3010af79
Add a SmartTurnMetricsData class, emitted by Metrics Frame in response to smart turn responses
2025-04-21 18:56:14 -04:00
Aleix Conchillo Flaqué
a02d47ddbd
Merge pull request #1625 from 0xPatryk/patch-1
...
Fixed AttributeError: object has no attribute '_sample_rate"
2025-04-21 15:40:54 -07:00
Patryk
a649aff3e7
Fixed AttributeError: 'OpenAITTSService' object has no attribute '_sample_rate'
2025-04-21 11:03:45 +02:00
Mark Backman
a9b551d73e
GrokLLMService uses grok-3-beta as default model
2025-04-19 08:05:59 -04:00
Mark Backman
747a821943
Merge pull request #1614 from pipecat-ai/mb/changelog-for-1525
...
Add CHANGELOG entry for PR 1525
2025-04-19 07:10:13 -04:00
Aleix Conchillo Flaqué
010db3ccd5
README: minor update
2025-04-18 20:57:05 -07:00
Aleix Conchillo Flaqué
db773b8b93
Merge pull request #1616 from pipecat-ai/aleix/new-readme
...
make README more fun
2025-04-18 18:15:35 -07:00
Mark Backman
16b7bf71b4
Additional README changes
2025-04-18 21:00:57 -04:00
Aleix Conchillo Flaqué
82d19508a4
make README more fun
2025-04-18 14:37:28 -07:00
Mark Backman
dc3646f0e7
Merge pull request #1615 from pipecat-ai/mb/issue-template
...
Add issue templates and move the pull request template to .github
2025-04-18 14:58:09 -04:00
Mark Backman
62e659cd3a
Update to .yml templates so that types are used
2025-04-18 13:21:01 -04:00
Mark Backman
b2945f44fd
Add issue templates and move the pull request template to .github
2025-04-18 12:17:46 -04:00
Mark Backman
618fbef81c
Add CHANGELOG entry for PR 1525
2025-04-18 11:32:34 -04:00
Mark Backman
70c42dfa6e
Merge pull request #1525 from shaiyon/google-default-creds
...
Enable usage of Application Default Credentials in Google services
2025-04-18 11:31:08 -04:00
Mark Backman
9ab374dd1f
Merge pull request #1612 from pipecat-ai/mb/07g-stt-model
...
examples: Fix 07g by changing STT model
2025-04-18 08:04:20 -04:00
Mark Backman
cc6d284417
examples: Fix 07g by changing STT model
2025-04-18 07:13:34 -04:00
Filipi da Silva Fuchter
f77d8f0b6f
Merge pull request #1611 from pipecat-ai/smart_turn_changelog
...
Mentioning the Smart Turn Detection into the changelog.
2025-04-17 23:02:57 -03:00
Varun Singh
9c0beb05cf
Merge pull request #1597 from pipecat-ai/vr000m-opus-added
...
Changing default codec to OPUS for telephony
2025-04-17 18:42:12 -07:00
Aleix Conchillo Flaqué
858981c404
Merge pull request #1610 from pipecat-ai/aleix/add-base-turn-analyzer
...
audio: add BaseTurnAnalyzer class
2025-04-17 18:38:08 -07:00
Aleix Conchillo Flaqué
9eed225aa2
audio: add BaseTurnAnalyzer class
2025-04-17 18:37:52 -07:00
Filipi Fuchter
9f7371e485
Mentioning the Smart Turn Detection into the changelog.
2025-04-17 22:31:40 -03:00
Aleix Conchillo Flaqué
d77c37ff14
pyproject: add py.typed (PEP 561)
2025-04-17 17:29:04 -07:00
Aleix Conchillo Flaqué
b4916f9dae
pyproject: fix license fields
2025-04-17 17:28:14 -07:00
Aleix Conchillo Flaqué
004a920920
Merge pull request #1563 from Bnowako/packaging-type-information
...
Add marker file for static type checkers
2025-04-17 17:26:15 -07:00
Filipi da Silva Fuchter
203c5a3a60
Merge pull request #1592 from pipecat-ai/smart_turn
...
Smart turn
2025-04-17 18:21:47 -03:00
Filipi Fuchter
7f6fb1754b
Merge remote-tracking branch 'origin/smart_turn' into smart_turn
2025-04-17 17:53:53 -03:00
Filipi Fuchter
a390ce13a4
Removing the UserEndOfTurnFrame
2025-04-17 17:53:31 -03:00
Filipi da Silva Fuchter
61d31d1c40
Restoring stop_secs to default value.
...
Co-authored-by: Mark Backman <mark@daily.co >
2025-04-17 17:44:47 -03:00
Filipi da Silva Fuchter
e872ff943a
Using the default model for OpenAi.
...
Co-authored-by: Mark Backman <mark@daily.co >
2025-04-17 17:43:39 -03:00
Filipi da Silva Fuchter
c71005e249
Using the default model for OpenAi.
...
Co-authored-by: Mark Backman <mark@daily.co >
2025-04-17 17:43:23 -03:00
Filipi Fuchter
6e06bf97c0
Preventing emitting the UserStartedSpeaking event multiple times.
2025-04-17 17:21:29 -03:00
Filipi Fuchter
a80dc94e91
Fixing ruff format.
2025-04-17 16:47:17 -03:00
Filipi Fuchter
3ea9cfd251
Keeping the _speech_triggered as true if the state is incomplete.
2025-04-17 16:46:15 -03:00
Filipi Fuchter
a80f82cdb6
Moving the environment variables to inside the demo.
2025-04-17 16:28:50 -03:00
Aleix Conchillo Flaqué
d24bab354f
Merge pull request #1607 from pipecat-ai/aleix/fix-websocket-disconnects
...
services: fix TTS websocket services disconnections
2025-04-17 12:27:52 -07:00
Filipi Fuchter
53ee3fb64c
Changing the log levels used in smart_turn
2025-04-17 16:14:13 -03:00
Filipi Fuchter
3599761e4e
Changing the default behavior to only use the last vad segment, and increasing the default stop_secs to 3
2025-04-17 16:07:03 -03:00
Aleix Conchillo Flaqué
c0b3fe3985
services: only read from TTS websocket if websocket connection established
2025-04-17 11:54:07 -07:00
Aleix Conchillo Flaqué
497d48b6c8
services: fix TTS websocket services disconnections
...
Fixes #1467
2025-04-17 11:29:49 -07:00
Filipi Fuchter
e179916c9c
Creating a new param use_only_last_vad_segment
2025-04-17 11:49:51 -03:00
Filipi Fuchter
b0b38beb19
Returning the max duration back to 8 seconds.
2025-04-17 11:39:48 -03:00
Filipi Fuchter
8577139d21
Fixing to keep the last max samples.
2025-04-17 11:39:06 -03:00
Filipi Fuchter
e2fbbb4b40
Renaming the smart turn classes.
2025-04-17 10:43:21 -03:00
Filipi Fuchter
88ce117e84
Changing the max duration default value to 16 seconds.
2025-04-17 10:35:13 -03:00
Filipi Fuchter
266537c3f4
Fixing to respect the stop_secs.
2025-04-17 10:07:08 -03:00
Filipi Fuchter
230d2f80fa
Merge branch 'main' into smart_turn
2025-04-17 09:36:30 -03:00
Filipi Fuchter
3f0688aefa
Testing smart turn using stop_secs as 5 seconds
2025-04-17 09:36:03 -03:00
Filipi da Silva Fuchter
5be3e6979e
Merge pull request #1533 from pipecat-ai/daily_small_webrtc
...
Example interoping between SmallWebRTC and Daily
2025-04-17 09:19:23 -03:00
Mark Backman
9c19cff818
Merge pull request #1585 from ArmanJR/main
...
Troubleshooting SSL error
2025-04-16 22:46:45 -04:00
Mark Backman
95f3537bde
Merge pull request #1598 from pipecat-ai/mb/11labs-http-timestamps
...
Added word/timestamp pairs to ElevenLabsHttpTTSService
2025-04-16 22:38:26 -04:00
Mark Backman
7ff748defd
Merge pull request #1600 from pipecat-ai/mb/11labs-previous-text
...
Add previous_text context to ElevenLabsHttpTTSService
2025-04-16 22:33:38 -04:00
Mark Backman
2dafbee2aa
Code review fixes
2025-04-16 22:29:33 -04:00
Mark Backman
1e0a9d7b06
Add previous_text context to ElevenLabsHttpTTSService
2025-04-16 22:22:08 -04:00
Mark Backman
4a23e138b1
Added word/timestamp pairs to ElevenLabsHttpTTSService
2025-04-16 22:20:51 -04:00
Mark Backman
384f80983f
Added word/timestamp pairs to ElevenLabsHttpTTSService
2025-04-16 21:55:00 -04:00
Aleix Conchillo Flaqué
f6f01ea7e4
Merge pull request #1588 from pipecat-ai/aleix/llm-aggregator-params
...
LLM aggregator params
2025-04-16 15:25:21 -07:00
Aleix Conchillo Flaqué
f385cc0460
pyproject: add websockets as google dependency
2025-04-16 15:19:25 -07:00
Aleix Conchillo Flaqué
e97de43de2
add LLMUserAggregatorParams and LLMAssistantAggregatorParams
2025-04-16 15:19:19 -07:00
Aleix Conchillo Flaqué
8299c96ad4
Merge pull request #1603 from pipecat-ai/aleix/deepgram-tavus-fixes
...
deepgram/tavus fixes
2025-04-16 14:55:45 -07:00
Aleix Conchillo Flaqué
e9af585edd
DeepgramTTSService: re-add base_url to constructor
2025-04-16 14:54:02 -07:00
Aleix Conchillo Flaqué
31f7082d12
DeepgramTTSService: use Deepgram's asyncrest instead of asyncio.to_thread
2025-04-16 14:40:59 -07:00
Aleix Conchillo Flaqué
6cea71270e
tts: use smaller audio chunk sizes
2025-04-16 14:40:59 -07:00
Aleix Conchillo Flaqué
d05b2d0e8d
TavusVideoService: fix rate limiting and max size
2025-04-16 14:40:59 -07:00
Filipi Fuchter
a458c1e92b
Improving the README and fixing the env.example
2025-04-16 18:38:48 -03:00
Filipi Fuchter
5bbf1d0209
Example interoping between SmallWebRTC and Daily.
2025-04-16 17:14:12 -03:00
Mark Backman
235cd9cecc
Merge pull request #1586 from rahultayal22/rah_google_vertex_issue
...
Fixed params issue in Google Vertex ai
2025-04-16 14:56:46 -04:00
Mark Backman
829f3ed2db
Merge pull request #1601 from pipecat-ai/mb/eject-at-exp-token
...
Add eject_at_token_exp to Daily REST helpers, modify default values
2025-04-16 14:54:41 -04:00
Rahul Tayal
ac64f0ba91
Run ruff on code
2025-04-16 23:19:09 +05:30
Rahul Tayal
ce41a7585b
Resolved comment to update change log
2025-04-16 22:24:25 +05:30
Mark Backman
ce92dfb5ec
Add eject_at_token_exp to Daily REST helpers, modify default values
2025-04-16 12:26:33 -04:00
Mark Backman
ee132a2188
Merge pull request #1596 from pipecat-ai/mb/gpt-4.1
...
Update services and examples to use gpt-4.1 by default
2025-04-16 08:37:48 -04:00
Mark Backman
5f3bbf9828
Rely on default OpenAI model for examples and tests
2025-04-16 08:33:34 -04:00
Mark Backman
55d1d81430
Merge pull request #1595 from pipecat-ai/mb/rtvi-start-convo
...
Update client/server demos to kick off conversation in on_client_read…
2025-04-16 08:23:16 -04:00
Filipi Fuchter
8e36bdbed7
Adding some comments to the code.
2025-04-16 09:11:27 -03:00
Filipi Fuchter
cd8bd7f487
Adding some comments to the code.
2025-04-16 08:58:40 -03:00
Filipi Fuchter
5fa47b7a5c
Adding the dependencies for the remote smart turn
2025-04-16 08:45:01 -03:00
Filipi Fuchter
616961b487
Stop removing segments from the end
2025-04-16 08:04:38 -03:00
Filipi Fuchter
650d4d9ee2
Changing the start speech time and adding logs.
2025-04-16 07:55:20 -03:00
Filipi Fuchter
2627cb6bf2
Allowing to define SmartTurnParams
2025-04-16 07:13:13 -03:00
Filipi Fuchter
0e4115049b
Refactoring to use keep alive sessions.
2025-04-16 06:44:57 -03:00
Filipi Fuchter
3ebef9346f
Adding support for RemoteSmartTurn
2025-04-16 06:33:42 -03:00
Filipi Fuchter
3e2d21779f
Refactoring the BaseEndOfTurnAnalyzer to include most of the logic
2025-04-16 06:11:56 -03:00
Filipi Fuchter
cfefcac35f
Resetting the silence frames when the user speaks.
2025-04-15 20:51:36 -03:00
Filipi Fuchter
57b39c084f
Triggering to check if the turn is complete based on the maximum timeout
2025-04-15 20:42:41 -03:00
Filipi Fuchter
11b6de0900
Triggering to check if the turn is complete each time the user stops speaking based on the vad
2025-04-15 17:28:00 -03:00
Varun Singh
824bc9bf16
Update dial.js
2025-04-15 12:48:33 -07:00
Varun Singh
d0ddef6c12
Update server.py
2025-04-15 12:37:33 -07:00
Mark Backman
ad40a0f076
Update OpenAILLMService and OpenPipeLLMService to use gpt-4.1 by default
2025-04-15 15:11:05 -04:00
Filipi Fuchter
e6325a8229
Integrating with the smart turn model to predict
2025-04-15 16:01:09 -03:00
Mark Backman
6d10732889
Update OpenAILLMService examples to use gpt-4.1
2025-04-15 14:59:55 -04:00
Mark Backman
fdb46a0fa9
Update client/server demos to kick off conversation in on_client_ready handler
2025-04-15 14:50:38 -04:00
Filipi Fuchter
3588b06718
Adding missing torch dependency.
2025-04-15 12:28:36 -03:00
Filipi Fuchter
73874f6ec0
Loading the smart turn model.
2025-04-15 12:11:06 -03:00
Filipi Fuchter
6ab9a8ad7f
Starting to create a local smart turn
2025-04-15 11:24:39 -03:00
Filipi Fuchter
821e303249
Bringing Aleix initial implementation for the smart turn.
2025-04-15 10:21:40 -03:00
chadbailey59
efae26a5a8
Client connect/disconnect events for DailyTransport ( #1544 )
...
* added multi transport example
* added working example
* restructured example and added readme
* removed image
* cleanup
* changed data type of callback signature
* removed pipecat example
* added changelog
2025-04-14 15:56:41 -05:00
Aleix Conchillo Flaqué
d16ace22ac
Merge pull request #1583 from pipecat-ai/aleix/soundfilemixer-constructor-updates
...
SoundfileMixer: add mixing argument and require keywords
2025-04-14 10:59:30 -07:00
Rahul Tayal
001c26b79c
Fixed params issue in Google Vertex ai
2025-04-14 23:29:16 +05:30
Arman
8dc4f1cda0
Troubleshooting SSL error
2025-04-14 13:39:53 -04:00
Aleix Conchillo Flaqué
ab6be11a0e
SoundfileMixer: add mixing argument and require keywords
2025-04-14 08:30:56 -07:00
Filipi da Silva Fuchter
054158b0ff
Merge pull request #1579 from pipecat-ai/fixing_smallwebrtc_issue
...
Fixed an issue in `SmallWebRTCTransport`
2025-04-14 10:44:22 -03:00
Filipi da Silva Fuchter
174cf13abd
Merge pull request #1580 from pipecat-ai/fixing_voice_agent_example
...
Fixing the voice agent example to always create the video transceiver.
2025-04-14 10:44:07 -03:00
Filipi Fuchter
099d2c02e1
Fixing the voice agent example to always create the video transceiver.
2025-04-14 10:41:39 -03:00
Filipi Fuchter
e1108466f6
Fixed an issue in SmallWebRTCTransport where an error was thrown if the client did not create a video transceiver.
2025-04-14 10:36:25 -03:00
Mark Backman
edd53d425e
Merge pull request #1577 from pipecat-ai/hush/trackStoppedSimpleChatbot
...
docs: Fix TrackStopped typo in SimpleChatbot
2025-04-14 08:32:58 -04:00
James Hush
b160cf34e9
Remove formatting
2025-04-14 15:13:45 +08:00
James Hush
dae3b927e1
docs: Fix TrackStopped typo in SimpleChatbot
2025-04-14 15:12:17 +08:00
Mark Backman
bd3d30111a
Merge pull request #1569 from pipecat-ai/pipecat-0.0.63
...
Update CHANGELOG for 0.0.63
2025-04-11 20:09:58 -04:00
Mark Backman
8c7e16e717
Update CHANGELOG for 0.0.63
2025-04-11 20:04:50 -04:00
Mark Backman
f6accbd510
Updating foundation examples to use SmallWebRTCTransport and pipecat-ai-small-webrtc-prebuilt ( #1534 )
...
Co-authored-by: Filipi Fuchter <filipi@daily.co >
2025-04-11 19:44:16 -04:00
Mark Backman
8186219879
Merge pull request #1513 from pipecat-ai/mb/gemini-context-formatting
...
Fix: GeminiMultimodalLiveLLMService, add spaces between words in assi…
2025-04-11 15:30:51 -04:00
Mark Backman
b9a2ed5b58
Fix: GeminiMultimodalLiveLLMService, add spaces between words in assistant context messages
2025-04-11 15:14:52 -04:00
Mark Backman
7ac12ffc85
Merge pull request #1550 from pipecat-ai/mb/cartesia-spelling-timestamps
...
Fix: Cartesia's spelling feature adds whole word to context
2025-04-11 15:13:55 -04:00
Filipi da Silva Fuchter
f623cf96f7
Merge pull request #1560 from pipecat-ai/bot_left_signalling
...
Bot left signalling message
2025-04-11 16:08:01 -03:00
Mark Backman
06be20eb16
Fix: Cartesia's spelling feature adds whole word to context
2025-04-11 15:04:58 -04:00
Filipi Fuchter
816b3a9545
Fixing ruff format
2025-04-11 15:37:16 -03:00
Filipi Fuchter
255666925b
Sending a new signalling message peerLeft.
2025-04-11 15:35:50 -03:00
Mark Backman
0df065fda4
Merge pull request #1566 from pipecat-ai/mb/gemini-live-beta
...
Add Gemini Live support for languages, native model transcriptions, media resolution, and VAD settings
2025-04-11 12:40:04 -04:00
Mark Backman
241a947b8b
Add CHANGELOG entries
2025-04-11 11:46:48 -04:00
Mark Backman
e28c199dd1
Add GeminiMultimodalLiveLLMService support for VAD Params
2025-04-11 11:46:48 -04:00
Filipi da Silva Fuchter
6220ee4efb
Merge pull request #1565 from pipecat-ai/fixing_video_transform_demo
...
Fixing the video transform demo to use 20ms audio.
2025-04-11 11:45:29 -03:00
Filipi Fuchter
b650d043bf
Fixing the video transform demo to use 20ms audio.
2025-04-11 11:22:41 -03:00
Mark Backman
121e6d2157
Add media resolution support to GeminiMultimodalLiveLLMService
2025-04-11 10:18:29 -04:00
Mark Backman
dbd7869de7
Add model transcription support
2025-04-11 10:02:52 -04:00
Mark Backman
b7d56d5ff0
Add language support for Gemini Live
2025-04-11 09:21:14 -04:00
Bnowako
61cba0136f
Add marker file for static type checkers
2025-04-11 11:00:57 +02:00
Filipi da Silva Fuchter
ed743b55d4
Merge pull request #1561 from pipecat-ai/fixing_voice_agent
...
Fixing voice agent example
2025-04-10 23:33:35 -03:00
Filipi Fuchter
fb074895f5
Fixing ruff format.
2025-04-10 23:19:31 -03:00
Filipi Fuchter
d916865ccc
Fixing voice agent example to work with the last released version of pipecat.
2025-04-10 23:10:50 -03:00
Filipi Fuchter
6378a8ccd3
Starting to implement a signalling message to when the bot has left
2025-04-10 23:02:27 -03:00
Aleix Conchillo Flaqué
5dbb5f176b
Merge pull request #1551 from pipecat-ai/aleix/daily-python-0.17.0
...
pyproject: update daily-python to 0.17.0
2025-04-10 09:06:55 -07:00
Filipi da Silva Fuchter
b89f2611f7
Merge pull request #1539 from pipecat-ai/small_wbertc_mute_state
...
SmallWebRTC mute state
2025-04-10 11:26:53 -03:00
Filipi Fuchter
db0f783c55
Updating the video-transform demo to use the latest version of the SmallWebRTCTransport.
2025-04-10 11:23:28 -03:00
Filipi Fuchter
20ec323647
Refactoring the video-transform demo to be able to enable or disable the cam.
2025-04-10 11:23:05 -03:00
Filipi Fuchter
f71c09a4fd
Added support in SmallWebRTCTransport to detect when remote tracks are muted.
2025-04-10 11:22:37 -03:00
Mark Backman
cba4ebfcf9
Merge pull request #1555 from pipecat-ai/mb/gemini-beta-base
2025-04-10 09:01:16 -04:00
Mark Backman
3b9a8946f9
Update GeminiMultimodalLiveLLMService base_url
2025-04-10 08:17:52 -04:00
Mark Backman
db3620c4be
Merge pull request #1553 from balaji-atoa/main
...
feat: change default model name on live api
2025-04-10 08:10:35 -04:00
Mark Backman
11338ea92d
Merge pull request #1552 from pipecat-ai/mb/p2p-capture-image
...
Add image capture to SmallWebRTCTransport
2025-04-10 07:52:13 -04:00
Filipi da Silva Fuchter
90563a4091
Merge pull request #1542 from pipecat-ai/small_webrtc_prebuilt_ui
...
Using the small-webrtc-prebuilt-ui
2025-04-10 07:39:26 -03:00
Filipi da Silva Fuchter
937f5f7cb7
Update examples/p2p-webrtc/video-transform/server/requirements.txt
...
Co-authored-by: Mark Backman <mark@daily.co >
2025-04-10 07:37:23 -03:00
Filipi da Silva Fuchter
4f221b817a
Update examples/p2p-webrtc/video-transform/README.md
...
Co-authored-by: Mark Backman <mark@daily.co >
2025-04-10 07:37:07 -03:00
balaji-atoa
c79c1f65fc
feat: change default model name on live api
2025-04-10 11:59:11 +05:30
Mark Backman
8ad2ad0e59
Add image capture to SmallWebRTCTransport
2025-04-09 23:01:06 -04:00
Aleix Conchillo Flaqué
499b258bf9
pyproject: update daily-python to 0.17.0
2025-04-09 18:59:10 -07:00
Filipi Fuchter
05b6a5ae4b
Improving the video-transform readme
2025-04-09 15:55:13 -03:00
Filipi Fuchter
65fcea28ce
Using the small-webrtc-prebuilt-ui
2025-04-09 15:45:30 -03:00
Kwindla Hultman Kramer
005c0b55b6
Merge pull request #1545 from pipecat-ai/khk/gem-live-0408
...
Gemini Multimodal Live API base_url format fix
2025-04-08 21:46:30 -07:00
Kwindla Hultman Kramer
1828127f41
small fix to wss base_url
2025-04-08 20:22:26 -07:00
Filipi da Silva Fuchter
77ab841cab
Merge pull request #1532 from pipecat-ai/p2p_ios_demo
...
iOS demo for the p2p-webrtc video-transform example
2025-04-07 16:58:06 -03:00
Filipi Fuchter
3bbc75110a
Mentioning the iOS client inside the changelog and fixing the readme.
2025-04-07 16:54:26 -03:00
Filipi Fuchter
b2ce1d9378
Merge branch 'main' into p2p_ios_demo
2025-04-07 16:50:58 -03:00
Filipi Fuchter
58714865df
Using the public version of pipecat-client-ios-small-webrtc
2025-04-07 16:48:18 -03:00
Mark Backman
03b3635b0a
Merge pull request #1521 from pipecat-ai/mb/increase-bot-vad-stop-secs
...
Increase BOT_VAD_STOP_SECS for services with slower speech patterns
2025-04-07 14:44:31 -04:00
Mark Backman
aaa7b5e626
Merge pull request #1524 from pipecat-ai/mb/tts-generate-with-text
...
TTS: Skip generation when there is no text
2025-04-07 14:44:18 -04:00
Varun Singh
0b8486ce39
Merge pull request #1418 from pipecat-ai/vr000m-pcc-dialin-webhook-server
...
Pipecat Cloud: Companion server to handle webhooks for pinless dial-in
2025-04-07 09:00:38 -07:00
Mark Backman
d4ae091ddd
Update port in FastAPI README, add run steps to nextjs README
2025-04-07 11:09:43 -04:00
Mark Backman
9e0a57a6de
Rename directories
2025-04-07 10:44:41 -04:00
Mark Backman
fc4c1e4110
README updates
2025-04-07 10:33:18 -04:00
Mark Backman
9b740d9e72
Merge pull request #1537 from pipecat-ai/mb/azure-tts-lang
...
Fix: Set language for Azure TTS services
2025-04-07 09:46:08 -04:00
Mark Backman
b03563765f
Fix: Set language for Azure TTS services
2025-04-07 09:24:31 -04:00
Filipi Fuchter
a1578bd67a
iOS demo for the p2p-webrtc video-transform example
2025-04-04 16:40:52 -03:00
Filipi da Silva Fuchter
6466573b84
Merge pull request #1498 from pipecat-ai/aiortc_example_ios
...
Improvements for the SmallWebRTCTransport
2025-04-04 16:39:06 -03:00
Filipi Fuchter
b42dc83696
Improvements for the SmallWebRTCTransport:
...
- Wait until the pipeline is ready before triggering the `connected` event.
- Queue messages if the data channel is not ready.
- Update the aiortc dependency to fix an issue where the 'video/rtx' MIME type
was incorrectly handled as a codec retransmission.
- Avoid initial video delays.
2025-04-04 16:33:57 -03:00
Filipi Fuchter
fe5931b884
Updating aiortc to fix an issue where 'video/rtx' MIMEType retransmission incorrectly handled as a codec
2025-04-04 16:28:54 -03:00
Filipi Fuchter
4b438ff7d7
Allowing ngrok connections to the video-transform demo
2025-04-04 16:28:37 -03:00
Filipi da Silva Fuchter
89a8c16676
Merge pull request #1531 from pipecat-ai/fix_chunk_default_value
...
Fixed SmallWebRTCTransport to support dynamic chunk values.
2025-04-04 16:04:05 -03:00
Filipi Fuchter
c4c92585f9
Fixed SmallWebRTCTransport to support dynamic chunk values.
2025-04-04 15:38:12 -03:00
Shaiyon Hariri
af23200511
Use default google creds as fallback when not provided in llm_vertex,stt, and tts
2025-04-03 16:42:58 -04:00
Mark Backman
63146d6f85
TTS: Skip generation when there is no text
2025-04-03 16:15:58 -04:00
Mattie Ruth
ec00edc893
Update client examples to use latest versions ( #1523 )
2025-04-03 15:47:03 -04:00
Mark Backman
a21be058e2
Increase BOT_VAD_STOP_SECS for services with slower speech patterns
2025-04-03 15:25:48 -04:00
Mark Backman
c226c20e12
Merge pull request #1519 from pipecat-ai/mb/ref-docs-toc
...
Docs: Update ToC With Adapters and Observers
2025-04-03 15:19:35 -04:00
Aleix Conchillo Flaqué
78e6669105
Merge pull request #1514 from pipecat-ai/aleix/producer-consumer-processors
...
processors: add ProducerProcessor and ConsumerProcessor
2025-04-03 12:18:49 -07:00
Aleix Conchillo Flaqué
79f29e14dd
processors: add ProducerProcessor and ConsumerProcessor
2025-04-03 09:44:56 -07:00
Mark Backman
d4a00fd080
Merge pull request #1517 from pipecat-ai/mb/update-simple-chatbot-packages
...
Update client packages for simple-chatbot JS and React
2025-04-03 10:07:40 -04:00
Mark Backman
d4186fa115
Merge pull request #1518 from pipecat-ai/mb/openai-verse
...
Add verse voice and bump the OpenAI version
2025-04-03 09:48:09 -04:00
Mark Backman
3536cbcd13
Add docstrings to FunctionSchema, update CONTRIBUTING.md with docstrings guidance, ignore __init__ docstrings if a class is sufficiently documented
2025-04-03 09:21:26 -04:00
Mark Backman
e3bcb70b13
Update ToC With Adapters and Observers
2025-04-03 09:02:09 -04:00
Mark Backman
19a82f9522
Add verse voice and bump the OpenAI version
2025-04-03 08:23:59 -04:00
Mark Backman
8c0a847449
Update client packages for simple-chatbot JS and React
2025-04-03 07:43:25 -04:00
Dominic Stewart
e3704cd1a1
Updated imports to work with pipecat 0.62 ( #1515 )
2025-04-03 15:07:02 +08:00
Dominic Stewart
1ba037865b
Call Transfer demo ( #1348 )
...
* Updated code to dial out to an operator, keep track of operator conversation while escalated and then return to conversation when finished
* Removed unnecessary imports
* Updated bot runner code, added call routing file and then updated the call transfer and voicemail detection examples
* Updated the bot files
* Made prompt one level higher in the body and an array
* Updated call transfer examples to work correctly
* Updated gemini voicemail detection example to work
* Added twilio bot support back to the bot_runner
* Moved some state management, participant management and other logic to the helper file.
* Updated comments
* Updated env and requirements file
* Ran the examples and made sure code works. Still need to work on the prompts a bit
* Fixed format issue
* Add support to disable summary in call transfer
* Added support for operator transfer mode
* Updated readme file
* Updated readme based on feedback, and handling of various properties in the json to be more flexible for future examples
* Updated number of endpoints
* Updated readme to remove fly deployment text and replaced with Pipecat Cloud
* Starting to tweak function calls and prompts
* Updated examples to more consistently call the functions and say what they need to say
* Updated examples
* Updated examples
* Updated examples to work correctly
* Add simple bot versions of dialin and dialout
* Refactored the bot runner file to make adding future examples easier
* Based on feedback, removed examples for multiple LLMs and also adjusted voicemail detection code to be simpler
* Made sure to only capture the users transcription once
* Updated readme with latest changes
* Forgot to update the order of examples in one place
* Fixed formatting issue
* Adjusted based on james feedback
* Changed default_mode to default_calltransfer_mode
2025-04-03 09:03:23 +09:00
Aleix Conchillo Flaqué
909520f76e
Merge pull request #1508 from pipecat-ai/mb/gemini-push-stop-speaking-frame
...
LLMAssistantContextAggregator should push BotStoppedSpeakingFrames
2025-04-02 16:25:08 -07:00
Mark Backman
d06cfcd597
Merge pull request #1512 from pipecat-ai/mb/fix-gemini-examples
...
Examples: Fix context_aggregator.assistant() pipeline position
2025-04-02 19:07:09 -04:00
Mark Backman
2579d0cf57
Examples: Fix context_aggregator.assistant() pipeline position
2025-04-02 16:11:03 -04:00
Mark Backman
1ec20b2e74
Merge pull request #1509 from pipecat-ai/mb/openia-voices
...
Add new voices to OpenAITTSService
2025-04-02 15:50:39 -04:00
Mark Backman
55a6e5aa4c
Add new voices to OpenAITTSService
2025-04-02 12:09:36 -04:00
Varun Singh
2229730169
moving to appropriate directory
2025-04-01 23:45:09 -07:00
Varun Singh
24b54c66ee
fixes review comments
2025-04-01 23:39:21 -07:00
Varun Singh
a14205415f
replaced dailyAPIKey with pccApiKey, also allow handling of messages when hmac is missing
2025-04-01 23:34:24 -07:00
Varun Singh
18b56d4a10
Fix README.md
2025-04-01 23:32:50 -07:00
Mark Backman
b85bd91d08
LLMAssistantContextAggregator should push BotStoppedSpeakingFrames
2025-04-01 23:35:09 -04:00
Aleix Conchillo Flaqué
23f3285a7d
Merge pull request #1507 from pipecat-ai/aleix/pipecat-0.0.62
...
update CHANGELOG for 0.0.62
2025-04-01 19:00:06 -07:00
Aleix Conchillo Flaqué
94f6436619
update CHANGELOG for 0.0.62
2025-04-01 18:55:04 -07:00
Aleix Conchillo Flaqué
480692971c
Merge pull request #1506 from pipecat-ai/aleix/websockets-mixer-loop-fixes
...
transports(websocket): close connection from last transport
2025-04-01 18:52:47 -07:00
Aleix Conchillo Flaqué
5df5f6ae4c
transports(websocket): close connection from last transport
2025-04-01 18:32:03 -07:00
Aleix Conchillo Flaqué
6940112ab9
Merge pull request #1504 from pipecat-ai/aleix/base-output-transport-audio-10ms-chunk-update
...
TransportParams: set audio_out_10ms_chunks to 4
2025-04-01 15:15:24 -07:00
Aleix Conchillo Flaqué
80584e9138
TransportParams: set audio_out_10ms_chunks to 4
2025-04-01 15:13:28 -07:00
Aleix Conchillo Flaqué
1fd01e715d
Merge pull request #1503 from pipecat-ai/aleix/function-call-result-system-frame
...
frames: make FunctionCallResultFrame a SystemFrame
2025-04-01 15:08:26 -07:00
Aleix Conchillo Flaqué
a7a1cd0cde
Merge pull request #1502 from pipecat-ai/aleix/test-user-idle-py310
...
tests: fix test_user_idle_processor for python 3.10
2025-04-01 15:08:10 -07:00
Aleix Conchillo Flaqué
e5a6b9d2b4
Merge pull request #1500 from pipecat-ai/aleix/base-output-transport-optimize-bot-speaking
...
BaseOutputTransport: optimize BotSpeakingFrames
2025-04-01 14:59:25 -07:00
Aleix Conchillo Flaqué
169b50af61
frames: make FunctionCallResultFrame a SystemFrame
2025-04-01 14:42:22 -07:00
Aleix Conchillo Flaqué
31311d8ac5
tests: fix test_user_idle_processor for python 3.10
2025-04-01 13:54:59 -07:00
Aleix Conchillo Flaqué
bfd06b321d
BaseOutputTransport: optimize BotSpeakingFrames
2025-04-01 11:11:49 -07:00
Aleix Conchillo Flaqué
3efbcab39c
Merge pull request #1499 from pipecat-ai/aleix/base-output-transport-set-chunks-size
...
BaseOutputTransport: allow setting 10ms output audio chunks
2025-04-01 11:10:34 -07:00
Aleix Conchillo Flaqué
b40ca391f5
BaseOutputTransport: allow setting 10ms output audio chunks
2025-04-01 10:48:36 -07:00
Aleix Conchillo Flaqué
43008c8c5b
Merge pull request #1501 from pipecat-ai/aleix/transcription-processor-interruption
...
TranscriptProcessor: send TranscriptionUpdateFrame after interruption
2025-04-01 10:46:16 -07:00
Aleix Conchillo Flaqué
3a37b11e56
TranscriptProcessor: send TranscriptionUpdateFrame after interruption
2025-04-01 10:21:21 -07:00
Mark Backman
9ea81bc982
Merge pull request #1497 from pipecat-ai/mb/gladia-languages
...
Align languages with Gladia's supported languages, remove audio_enhancer option
2025-04-01 11:54:24 -04:00
Mark Backman
98b499e2e9
Remove audio_enhancer option
2025-04-01 10:26:28 -04:00
Mark Backman
72c8f6c8c3
Update GladiaSTTService language list
2025-04-01 10:17:42 -04:00
Mark Backman
ea61256ddc
Merge pull request #1496 from pipecat-ai/mb/gladia-model
...
Update GladiaSTTService default model
2025-04-01 08:52:13 -04:00
Mark Backman
babafadbe4
Merge pull request #1494 from pipecat-ai/mb/p2p-examples-gitignore
...
Add .gitignore to p2p video-transform example
2025-04-01 07:39:35 -04:00
Mark Backman
a5660f6dc7
Add .gitignore to p2p video-transform example
2025-04-01 07:20:39 -04:00
Aleix Conchillo Flaqué
64ad916c5f
Merge pull request #1492 from pipecat-ai/aleix/downgrade-to-aiohttp-3.11.12
...
pyproject: downgrade to aiohttp 3.11.12
2025-03-31 19:01:04 -07:00
Aleix Conchillo Flaqué
13d0563298
pyproject: downgrade to aiohttp 3.11.12
...
See https://pypi.org/project/aiohttp/#history
2025-03-31 18:59:41 -07:00
Mark Backman
20a1dd066d
Update GladiaSTTService default model
2025-03-31 19:02:28 -04:00
Mark Backman
56f6e3ceb4
Merge pull request #1490 from pipecat-ai/fix_ruff_format
...
Fixing ruff format.
2025-03-31 18:37:19 -04:00
Mark Backman
3afab63870
Merge pull request #1488 from pipecat-ai/mb/stt-mute-filter-logline
...
Clarify the mute/unmute log line in STTMuteFilter
2025-03-31 18:35:47 -04:00
Filipi Fuchter
d3b9a0aab0
Fixing ruff format.
2025-03-31 19:17:40 -03:00
Filipi da Silva Fuchter
6b21081a7d
Merge pull request #1487 from pipecat-ai/smallwebrtc_ios_support
...
SmallWebRTCTransport: Improvements to work with mobile
2025-03-31 19:10:03 -03:00
Aleix Conchillo Flaqué
648bdea64c
fix formatting
2025-03-31 15:04:45 -07:00
milo157
ed387e876a
Merge pull request #1486 from CerebriumAI/feature/ultravox
...
Feature/ultravox - bug fixes
2025-03-31 15:03:26 -07:00
Aleix Conchillo Flaqué
2fb9aa4d76
Merge pull request #1489 from pipecat-ai/aleix/base-ai-services-restructure
...
services: restructure base AI services into modules
2025-03-31 15:00:13 -07:00
Aleix Conchillo Flaqué
9eba8f1637
services: restructure base AI services into modules
2025-03-31 13:53:36 -07:00
Mark Backman
43c255f58a
Clarify the mute/unmute log line in STTMuteFilter
2025-03-31 16:45:02 -04:00
Filipi Fuchter
121e70a029
Improvements on the video transform example to work on mobile.
2025-03-31 17:11:38 -03:00
Filipi Fuchter
70e28a0547
Adding support to yuvj420p which is the format that we receive from mobile iOS.
2025-03-31 13:12:20 -03:00
Mark Backman
c9a93f2504
Merge pull request #1469 from pipecat-ai/mb/update-gladia
...
Refactor GladiaSTTService to support addition params
2025-03-31 11:18:32 -04:00
Mark Backman
8a12470efd
Reorganize into a directory
2025-03-30 20:01:40 -04:00
Mark Backman
05d53bc66f
Refactor GladiaSTTService; add support for additional params
2025-03-30 19:54:55 -04:00
Aleix Conchillo Flaqué
e763cd7bee
Merge pull request #1471 from pipecat-ai/aleix/services-restructure
...
services: restructure services into folders
2025-03-30 16:23:26 -07:00
Aleix Conchillo Flaqué
94ec5118e6
track already reported deprecated modules (mark's update)
2025-03-30 16:21:00 -07:00
Aleix Conchillo Flaqué
7203ef6885
examples: use new services packages
2025-03-30 16:21:00 -07:00
Aleix Conchillo Flaqué
3074a62bb1
services: restructure services into folders
2025-03-30 16:21:00 -07:00
Mark Backman
31712b84ac
Merge pull request #1479 from pipecat-ai/mb/qwen-pyproject-entry
2025-03-29 22:36:40 -04:00
Mark Backman
c99ec0b0b7
Add placeholder entry for qwen to pyproject.toml
2025-03-29 20:20:48 -04:00
Mark Backman
cd7abd2962
Merge pull request #1478 from pipecat-ai/mb/alibaba-cloud-offerings
...
Add QwenLLMService
2025-03-29 20:13:21 -04:00
Mark Backman
c7544954cf
Merge pull request #1476 from pipecat-ai/mb/ref-docs-mem0-mlx-whisper
...
Update reference docs generation for mem0 and mlx-whisper
2025-03-29 20:12:58 -04:00
Mark Backman
4f390b15a3
Merge pull request #1477 from pipecat-ai/mb/fix-mem0-example-number
...
Renumber mem0 example, small changelog updates
2025-03-29 20:04:45 -04:00
Mark Backman
f2a05b065d
Add QwenLLMService
2025-03-29 19:43:37 -04:00
Mark Backman
5d5041eb2b
Renumber mem0 example, small changelog updates
2025-03-29 18:45:39 -04:00
Mark Backman
f4dc66cb13
Update reference docs generation for mem0 and mlx-whisper
2025-03-29 18:42:08 -04:00
Mark Backman
b88744b18d
Merge pull request #1475 from pipecat-ai/khk/whisper-mlx-example
...
Example and CHANGELOG for WhisperSTTServiceMLX service
2025-03-29 18:09:17 -04:00
Kwindla Hultman Kramer
209de2638d
WhisperSTTServiceMLX example and CHANGELOG
2025-03-29 18:04:07 -04:00
Mark Backman
5d829fb6a9
Merge pull request #1474 from pipecat-ai/khk/mem0-changelog
...
Changelog entry for mem0 service
2025-03-29 18:02:32 -04:00
Mark Backman
a978a5cd4a
Fix Whisper formatting
2025-03-29 17:57:50 -04:00
Mark Backman
b9ea3f0fd9
Update README, organize pyproject.toml
2025-03-29 17:56:17 -04:00
Kwindla Hultman Kramer
d2f5ee2915
Changelog entry for mem0 service
2025-03-29 17:55:26 -04:00
Mark Backman
acddddc508
Merge pull request #1472 from pipecat-ai/mb/small-webrtc-readme
...
Add README link for SmallWebRTCTransport
2025-03-29 17:38:15 -04:00
Kwindla Hultman Kramer
0c2c6fa771
Merge pull request #1383 from zboyles/add-mlx-whisper
...
Added Support for MLX Whisper models on Apple M-Series
2025-03-29 14:25:37 -07:00
Mark Backman
80088c6138
Merge pull request #1473 from pipecat-ai/mb/ref-docs-updates
...
Update packages for auto-generating docs
2025-03-29 17:20:46 -04:00
Kwindla Hultman Kramer
766639a9a4
Merge pull request #1388 from deshraj/user/dyadav/mem0-integration
...
Added mem0 service.
2025-03-29 13:12:58 -07:00
Mark Backman
675e2b1498
Update packages for auto-generating docs
2025-03-29 08:21:58 -04:00
Mark Backman
af6c23f7b1
Add README link for SmallWebRTCTransport
2025-03-28 21:29:24 -04:00
Aleix Conchillo Flaqué
d212e88030
Merge pull request #1468 from pipecat-ai/aleix/smallwebrtc-updates
...
transports(webrtc): some SmallWebRTC updates
2025-03-28 14:41:45 -07:00
Aleix Conchillo Flaqué
d6758bf2ad
transports(webrtc): rename appMessage to app-message
2025-03-28 14:35:11 -07:00
Filipi Fuchter
5abfb15300
Registering the event handlers and fixing the examples.
2025-03-28 17:30:06 -03:00
Aleix Conchillo Flaqué
f576254d61
transports(webrtc): some SmallWebRTC updates
2025-03-28 13:19:23 -07:00
Aleix Conchillo Flaqué
a90807a3d2
Merge pull request #1465 from roey-priel/main
...
Tavus / Deepgram TTS compatibility
2025-03-28 08:43:09 -07:00
roey
a06fc4ce50
yield outside of the loop
2025-03-28 08:41:36 -07:00
roey
80cb4497f0
Merge pull request #1 from roey-priel/deepgram-tts-tavus-compatibility
...
Update deepgram.py
2025-03-27 17:06:33 -07:00
roey
8aa878c5e9
Update deepgram.py
2025-03-27 17:05:29 -07:00
Filipi da Silva Fuchter
e982b3d919
Merge pull request #1290 from pipecat-ai/aiortc_example
...
P2P WebRTC transport option to Pipecat
2025-03-27 18:29:44 -03:00
Filipi Fuchter
8945fd1fc6
Starting the server by default as localhost.
2025-03-27 18:27:56 -03:00
Filipi Fuchter
16b97d151b
Adding the SmallWebRTCTransport to the changelog.
2025-03-27 17:56:12 -03:00
Filipi Fuchter
f7ac142ad2
Merge branch 'main' into aiortc_example
2025-03-27 17:50:46 -03:00
Filipi da Silva Fuchter
2355067f61
Merge pull request #1441 from pipecat-ai/aiortc_example_small_webrtc_transport
...
P2P WebRTC transport - example improvements.
2025-03-27 17:49:12 -03:00
Filipi Fuchter
76f9626d35
Using the @pipecat-ai/small-webrtc-transport from npm.
2025-03-27 17:48:32 -03:00
Filipi da Silva Fuchter
f82c2566e8
Merge pull request #1270 from pipecat-ai/improve_protobuf_serializer
...
Added support to `ProtobufFrameSerializer` to send the transport messages
2025-03-27 17:28:37 -03:00
Filipi Fuchter
b6007bb3d6
Added support to ProtobufFrameSerializer to send the transport messages
2025-03-27 17:26:03 -03:00
Filipi Fuchter
311a5360ad
Renaming the example to p2p-webrtc
2025-03-27 16:46:00 -03:00
Filipi Fuchter
62cb0376f2
Changing the file types.
2025-03-27 16:34:40 -03:00
Filipi Fuchter
91a69b7029
Improving the readmes for the webrtc examples.
2025-03-27 16:32:46 -03:00
Mark Backman
1d4d7f28a1
Merge pull request #1463 from pipecat-ai/mb/add-piper-readme
...
Add Piper to README
2025-03-27 08:52:31 -04:00
Mark Backman
a55a7bbb96
Add Piper to README
2025-03-27 08:03:16 -04:00
Mark Backman
a394b35e85
Merge pull request #1459 from pipecat-ai/mb/issue-1454
...
Fix: GoogleTTSService was emitting two TTSStoppedFrames
2025-03-27 08:00:16 -04:00
Mark Backman
aa85df4fd6
Fix: GoogleTTSService was emitting two TTSStoppedFrames
2025-03-27 07:55:19 -04:00
Filipi da Silva Fuchter
3bb1f5f7a8
Merge pull request #1130 from pedro-a-n-moreira/piper-tts
...
Add support for Piper TTS
2025-03-27 08:08:05 -03:00
Filipi Fuchter
7c115f9d59
Merge branch 'main' into piper-tts
...
# Conflicts:
# CHANGELOG.md
2025-03-27 08:01:38 -03:00
Filipi Fuchter
a82b847971
Fixing ruff format.
2025-03-27 07:58:53 -03:00
Filipi Fuchter
50515aa842
Adding PiperTTSService to the changelog.
2025-03-27 07:50:47 -03:00
Filipi Fuchter
b348fde32b
Refactoring PiperTTSService to match the others TTS services provided by Pipecat and fixing noise issue due to wav header.
2025-03-27 07:46:38 -03:00
Filipi Fuchter
45787520b2
Refactoring the piper test to use run_test provided by Pipecat
2025-03-27 07:45:28 -03:00
Filipi Fuchter
053bf72da2
Adding pytest-aiohttp to the dev requirements.
2025-03-27 07:44:46 -03:00
Filipi Fuchter
ca4893397a
Creating a foundational example which uses the piper service.
2025-03-27 07:44:26 -03:00
Filipi Fuchter
c1f6a4e079
Adding PIPER_BASE_URL to the env template.
2025-03-27 07:44:05 -03:00
Aleix Conchillo Flaqué
135ed811f1
Merge pull request #1460 from pipecat-ai/aleix/segmented-tts-ignore-emulated-frames
...
segmented tts ignore emulated frames
2025-03-26 16:03:59 -07:00
Aleix Conchillo Flaqué
055a3f1c53
LLMAssistantContextAggregator: stop emulations if the user starts speaking
2025-03-26 14:39:12 -07:00
Aleix Conchillo Flaqué
750bb88586
SegmentedSTTService: ignore emulated frames
2025-03-26 14:38:48 -07:00
Aleix Conchillo Flaqué
c4f9171fe1
frames: indicate if UserStartedSpeakingFrame/UserStoppedSpeakingFrame are emulated
2025-03-26 14:37:36 -07:00
Filipi Fuchter
d223201c3f
Merge branch 'main' into piper-tts
...
# Conflicts:
# test-requirements.txt
2025-03-26 16:47:45 -03:00
Mark Backman
86701fd3c7
Merge pull request #1457 from pipecat-ai/mb/fix-rtvi-observer-gemini
...
Fix: Resolve an issue where Google LLM context messages were causing …
2025-03-26 14:18:37 -04:00
Mark Backman
b414077a07
Fix: Resolve an issue where Google LLM context messages were causing a TypeError
2025-03-26 13:55:42 -04:00
kompfner
15f23929e9
Merge pull request #1455 from pipecat-ai/prepare-0.0.61
...
Update CHANGELOG for 0.0.61
2025-03-26 13:50:29 -04:00
Mark Backman
cc9e4047d0
Merge pull request #1447 from nicougou/feat/support_tts_instruct
...
feature/support instructions in OpenAITTSService
2025-03-26 13:35:41 -04:00
Paul Kompfner
4ef4dcefce
Update CHANGELOG for 0.0.61
2025-03-26 13:06:31 -04:00
kompfner
f3caa8cf7a
Merge pull request #1452 from pipecat-ai/daily-python-0.16.1
...
Bump daily-python dependency to 0.16.1 to pick up a bugfix
2025-03-26 13:01:38 -04:00
Mark Backman
e5470fec7a
Merge pull request #1453 from pipecat-ai/khk/groq
...
New GroqTTSService
2025-03-26 12:49:18 -04:00
Mark Backman
887c197bce
Add sample_rate to the constructor
2025-03-26 12:29:40 -04:00
Kwindla Hultman Kramer
f5d49fea81
try/catch import of groq SDK
2025-03-26 12:29:40 -04:00
Kwindla Hultman Kramer
e087f6ec5d
GroqTTSService added to CHANGELOG.md
2025-03-26 12:29:39 -04:00
Kwindla Hultman Kramer
406f5a395b
fix class heirarchy and audio chunking
2025-03-26 12:29:18 -04:00
Kwindla Hultman Kramer
060bb4c26b
wip
2025-03-26 12:29:18 -04:00
Nico
499e69846d
review: add changelog entries
2025-03-26 17:13:30 +01:00
Paul Kompfner
e6e339a02e
Bump daily-python dependency to 0.16.1 to pick up a bugfix
2025-03-26 11:22:23 -04:00
Nico
dc2ee2bf0a
review: remove websocket_base_url
2025-03-26 15:41:42 +01:00
Nico
d982fc35d8
fix: formatter
2025-03-26 15:41:42 +01:00
Nico
72d373e565
feature/support instructions in OpenAITTSService
2025-03-26 15:41:42 +01:00
Aleix Conchillo Flaqué
59fdfe697d
Merge pull request #1449 from pipecat-ai/aleix/google-assistant-aggregator-function-call-result
...
GoogleAssistantContextAggregator: allow any value as function call result
2025-03-26 07:25:34 -07:00
Filipi da Silva Fuchter
97c9e0676e
Merge pull request #1451 from pipecat-ai/set-tool-choice-from-context-aggregator
...
Set tool choice from context aggregator
2025-03-26 09:12:26 -03:00
Filipi Fuchter
aeac40312e
Added the feature to change dynamically the tool choice to the changelog.
2025-03-26 09:06:29 -03:00
Filipi Fuchter
ce9f75a851
Fixing the tool choice extra type to be a dict instead of string.
2025-03-26 08:17:50 -03:00
Filipi Fuchter
c45d852f6b
Merge branch 'main' into set-tool-choice-from-context-aggregator
...
# Conflicts:
# src/pipecat/processors/aggregators/llm_response.py
2025-03-26 07:14:57 -03:00
Deshraj Yadav
55cc1fe9f6
Fix import lines
2025-03-25 23:35:47 -07:00
Deshraj Yadav
1ba7e2d6fa
Format imports properly
2025-03-25 23:30:01 -07:00
Deshraj Yadav
1b8d326b49
Run ruff
2025-03-25 23:15:35 -07:00
Aleix Conchillo Flaqué
077952b658
GoogleAssistantContextAggregator: allow any value as function call result
2025-03-25 19:11:27 -07:00
Deshraj Yadav
e694971423
Merge pull request #2 from pipecat-ai/khk/mem0
...
small changes to make 35-mem0.py
2025-03-25 18:10:36 -07:00
Kwindla Hultman Kramer
d00ae492e5
small changes to make 35-mem0.py like the other foundational single-file examples.
2025-03-25 15:51:38 -07:00
Aleix Conchillo Flaqué
9450b07ec5
Merge pull request #1442 from pipecat-ai/aleix/on-context-updated-as-task
...
LLMAssistantContextAggregator: create a task to run on_context_updated
2025-03-25 15:39:36 -07:00
Aleix Conchillo Flaqué
19b464ba23
tests: add assistant aggregator function call frame handling
2025-03-25 15:37:06 -07:00
Aleix Conchillo Flaqué
8aebf00c2d
GoogleAssistantContextAggregator: function call result should be a JSON object
2025-03-25 15:37:06 -07:00
Aleix Conchillo Flaqué
01458895c2
LLMAssistantContextAggregator: create a task to run on_context_updated
2025-03-25 14:37:11 -07:00
kompfner
2082d023ef
Merge pull request #1448 from pipecat-ai/daily-python-0.16.0
...
Bump daily-python dependency to 0.16.0 to pick up support in `DailyTr…
2025-03-25 17:32:38 -04:00
Paul Kompfner
c99436b80e
Bump daily-python dependency to 0.16.0 to pick up support in DailyTransport for updating remote participants' canReceive permission via the update_remote_participants() method
2025-03-25 17:29:48 -04:00
Filipi Fuchter
f884c93826
Refactoring the video-transform example to use pipecat client.
2025-03-25 17:32:25 -03:00
Deshraj Yadav
2780c6eed6
Incorporate suggestions
2025-03-25 10:45:08 -07:00
Deshraj Yadav
7ad36eeaf4
Add mem0 as a service integration
2025-03-25 10:44:12 -07:00
Filipi Fuchter
67a93d09c2
Merge branch 'main' into aiortc_example
2025-03-25 10:31:53 -03:00
Aleix Conchillo Flaqué
f3b50bc3c4
Revert "LLMAssistantContextAggregator: create a task to run on_context_updated"
...
This reverts commit 397bae29f7 .
2025-03-24 15:40:26 -07:00
Aleix Conchillo Flaqué
397bae29f7
LLMAssistantContextAggregator: create a task to run on_context_updated
2025-03-24 15:39:35 -07:00
Mark Backman
3b3fdd0da1
Merge pull request #1439 from pipecat-ai/mb/fix-rtvi-bot-speaking-events
...
Fix: RTVIObserver now outputs a single bot started and stopped speaki…
2025-03-24 11:44:31 -04:00
Mark Backman
a9b1298f3b
Fix: RTVIObserver now outputs a single bot started and stopped speaking event per turn
2025-03-24 10:25:31 -04:00
Filipi Fuchter
2fcf4e6d70
Fixing ruff format
2025-03-24 11:23:55 -03:00
Filipi Fuchter
fcb8b9a5b3
Refactoring how we are creating the answer so we don't need to wait for the client gathering all ice candidates.
2025-03-24 11:12:41 -03:00
Filipi Fuchter
fee0409f63
Logging if the remote peer supports trickle ice.
2025-03-24 08:59:21 -03:00
Filipi Fuchter
3be6973e2c
Adding support to define ice servers.
2025-03-24 08:57:24 -03:00
Filipi Fuchter
5184d178ef
Merge branch 'main' into aiortc_example
2025-03-24 08:37:08 -03:00
Thomas B.
48e8d3968a
fix: recognition language correctly set for Azure STT ( #1436 )
2025-03-23 19:29:52 -07:00
Aleix Conchillo Flaqué
59644a939a
Merge pull request #1434 from pipecat-ai/aleix/examples-07-interruptible-local
...
examples: add foundational 07x-interruptible-local.py
2025-03-23 05:44:40 -07:00
Aleix Conchillo Flaqué
3311afc581
examples: add foundational 07x-interruptible-local.py
2025-03-22 21:58:55 -07:00
Filipi da Silva Fuchter
a3ccbf91f7
Merge pull request #1429 from pipecat-ai/fixing_set_tool_issue
...
Only checking the length if tools is a list.
2025-03-21 13:56:45 -03:00
Filipi Fuchter
3ed764a769
Only checking the length if tools is a list.
2025-03-21 12:56:05 -03:00
Mark Backman
be8d5a31f5
Merge pull request #1425 from Allenmylath/patch-25
...
Update env.example
2025-03-21 08:39:03 -04:00
Mark Backman
480bcc1ab1
Merge pull request #1424 from Allenmylath/patch-24
...
Update requirements.txt
2025-03-21 08:38:54 -04:00
allenmylath
dd81048ddb
Update env.example
...
EXAMPLE USES CARTESI NOT ELEVNE LABS
2025-03-21 10:11:28 +05:30
allenmylath
04d462ff02
Update requirements.txt
...
example uses cartesia not elevenlabs
2025-03-21 10:09:09 +05:30
Aleix Conchillo Flaqué
7e7aaeddd9
Merge pull request #1423 from pipecat-ai/aleix/elevenlabs-pcm-8000
...
ElevenLabs: add support for a sample rate of 8000
2025-03-20 19:34:16 -07:00
Aleix Conchillo Flaqué
e77f7c8456
update ruff and pyright versions
2025-03-20 19:19:08 -07:00
Aleix Conchillo Flaqué
442f18d47b
ultravox: fix formatting
2025-03-20 19:19:08 -07:00
Aleix Conchillo Flaqué
fc78e6fc5a
ElevenLabs: add support for a sample rate of 8000
2025-03-20 19:13:23 -07:00
Aleix Conchillo Flaqué
d71b520153
update CHANGELOG.md and fix formatting
2025-03-20 18:58:06 -07:00
milo157
3b4d91e1c1
Fixed ultravox service bugs ( #1420 )
2025-03-20 18:55:43 -07:00
Aleix Conchillo Flaqué
09c62d939a
Merge pull request #1422 from pipecat-ai/aleix/pipecat-0.0.60
...
update CHANGELOG for 0.0.60
2025-03-20 16:25:52 -07:00
Aleix Conchillo Flaqué
f2b9789acf
update CHANGELOG for 0.0.60
2025-03-20 16:17:34 -07:00
Aleix Conchillo Flaqué
1592703e77
Merge pull request #1421 from pipecat-ai/aleix/rollback-deepgram-to-3.8.0
...
pyproject: rollback deepgram-sdk to 3.8.0
2025-03-20 16:16:08 -07:00
Aleix Conchillo Flaqué
66e42ae410
pyproject: rollback deepgram-sdk to 3.8.0
2025-03-20 16:15:43 -07:00
Mark Backman
8d6dbbe293
Merge pull request #1417 from pipecat-ai/mb/update-realtime-transcription
...
Update InputAudioTranscription to use gpt-4o-transcribe model, update…
2025-03-20 18:49:06 -04:00
Mark Backman
2ac8f2ec2d
Fix linting
2025-03-20 18:40:16 -04:00
Paul Kompfner
41688205be
Provide new settings in OpenAI Realtime example
2025-03-20 18:23:25 -04:00
Mark Backman
541a4b6063
Update InputAudioTranscription to use gpt-4o-transcribe model, update 19 examples to use FunctionSchema
2025-03-20 18:23:24 -04:00
Aleix Conchillo Flaqué
8f6d92ce7d
update CHANGELOG with BaseOpenAILLMService default_headers
2025-03-20 13:47:15 -07:00
Aleix Conchillo Flaqué
96fa6c19a8
Merge pull request #1398 from nicougou/feature/openai_custom_headers
...
feature: add custom headers to AsyncOpenAI
2025-03-20 13:45:57 -07:00
Varun Singh
c9f7882728
initial commit
2025-03-20 12:31:08 -07:00
Aleix Conchillo Flaqué
0fdd577ae7
Merge pull request #1416 from pipecat-ai/aleix/pipecat-0.0.59
...
update CHANGELOG for 0.0.59
2025-03-20 11:48:14 -07:00
Aleix Conchillo Flaqué
2133152e5b
update CHANGELOG for 0.0.59
2025-03-20 11:42:54 -07:00
Aleix Conchillo Flaqué
c3f3f4603d
Merge pull request #1413 from pipecat-ai/aleix/llm-user-aggregator-emulate-fixes
...
LLMUserContextAggregator: fix emulated user started/stopped speaking issues
2025-03-20 11:41:26 -07:00
Aleix Conchillo Flaqué
b20ce7d655
examples: move 07u-interruptible-neuphonic to 07v
2025-03-20 11:38:29 -07:00
Aleix Conchillo Flaqué
66ba1116a4
pyproject: rollback azure to 1.42.0
2025-03-20 11:23:40 -07:00
Aleix Conchillo Flaqué
08956e914a
livekit: remove unnecessary transport cleanup() function
2025-03-20 11:23:40 -07:00
Aleix Conchillo Flaqué
5a39f146f6
LLMUserContextAggregator: fix emulated user started/stopped speaking issues
2025-03-20 11:23:40 -07:00
kompfner
de8a831ee1
Merge pull request #1414 from pipecat-ai/march-main
...
March OpenAI updates
2025-03-20 14:22:09 -04:00
Aleix Conchillo Flaqué
efa5f133d7
openai_realtime: fix and update function calling
2025-03-20 11:14:59 -07:00
Paul Kompfner
44380bc8c0
Remove duplicate changelog entry due to rebase mistake
2025-03-20 13:51:16 -04:00
Paul Kompfner
721ee75887
Comment tweak
2025-03-20 13:43:00 -04:00
Paul Kompfner
ada68f0699
More robust handling of conversation item retrieval errors in OpenAIRealtimeBetaLLMService
2025-03-20 13:43:00 -04:00
Mark Backman
70dbf0d6fc
Updated default models for OpenAISTTService and OpenAITTSService to gpt-4o based models
2025-03-20 13:42:56 -04:00
Paul Kompfner
f0774268cc
Rename gpt-4o-transcribe-latest to gpt-4o-transcribe in OpenAIRealtimeBetaLLMService
2025-03-20 13:39:40 -04:00
Chad Bailey
2ae5bdd8a9
lets talk about dogs
2025-03-20 13:39:40 -04:00
Chad Bailey
0d74bcacb7
updated models in the 07g example
2025-03-20 13:39:40 -04:00
Paul Kompfner
f94a099111
Revert the default model to be "gpt-4o-realtime-preview-2024-12-17" In OpenAIRealtimeBetaLLMService
2025-03-20 13:39:36 -04:00
Paul Kompfner
3dd4ef7230
Tweak changelog entries describing slate of recent updates to OpenAIRealtimeBetaLLMService
2025-03-20 13:36:22 -04:00
Paul Kompfner
e707efbffa
Update changelog with slate of recent updates to OpenAIRealtimeBetaLLMService
2025-03-20 13:35:12 -04:00
Paul Kompfner
7b594093dd
Handle the possibility of multiple concurrent calls to retrieve_conversation_item() in the OpenAIRealtimeBetaLLMService
2025-03-20 13:31:28 -04:00
Paul Kompfner
31317ce77d
Add error handling to the retrieve_conversation_item() method of the OpenAIRealtimeBetaLLMService
2025-03-20 13:31:28 -04:00
Paul Kompfner
f693a3c70f
Add retrieve_conversation_item() method to OpenAIRealtimeBetaLLMService, using the new conversation.item.retrieve introspection message.
2025-03-20 13:31:28 -04:00
Paul Kompfner
39ca607bbb
Add on_conversation_item_created and on_conversation_item_updated events to OpenAIRealtimeBetaLLMService.
...
The hope is that this will expose to the user conversation item ids at relevant times for them to use with the new `conversation.item.retrieve` introspection message.
2025-03-20 13:31:28 -04:00
Paul Kompfner
9840abd85b
Make it so you specifying model=None when creating a InputAudioTranscription results in a validation error
2025-03-20 13:31:28 -04:00
Paul Kompfner
1075c25055
Add new semantic turn detection option to OpenAIRealtimeBetaLLMService
2025-03-20 13:31:28 -04:00
Paul Kompfner
e91610c69e
linter fix
2025-03-20 13:31:28 -04:00
Paul Kompfner
1a20d9bed7
Add new input_audio_noise_reduction option to OpenAIRealtimeBetaLLMService
2025-03-20 13:31:28 -04:00
Paul Kompfner
d009b80438
Add new GPT-4o transcription option to OpenAIRealtimeBetaLLMService
2025-03-20 13:31:28 -04:00
kompfner
fe5fc30211
Revert "Add new GPT-4o transcription option to OpenAIRealtimeBetaLLMService"
2025-03-20 13:31:28 -04:00
Paul Kompfner
be2cf6d556
formatting fix
2025-03-20 13:31:28 -04:00
Paul Kompfner
e80bfe22de
Add new GPT-4o transcription option to OpenAIRealtimeBetaLLMService
2025-03-20 13:31:28 -04:00
Paul Kompfner
214c8f79eb
linter fix
2025-03-20 13:31:28 -04:00
Paul Kompfner
16accafa6d
formatting fix
2025-03-20 13:31:28 -04:00
Kwindla Hultman Kramer
4449e9a25b
add response.done status=failed error
2025-03-20 13:31:28 -04:00
Kwindla Hultman Kramer
bfdf52bd69
change examples/foundational/19-openai-realtime-beta.py to use the new preview model
2025-03-20 13:31:28 -04:00
Kwindla Hultman Kramer
2b4debec11
add support for conversation.item.input_audio_transcription.delta
2025-03-20 13:31:28 -04:00
Mark Backman
f4626287cd
Merge pull request #1411 from pipecat-ai/mb/add-fal-wizper
...
Add FalSTTService
2025-03-20 13:08:08 -04:00
Mark Backman
e4bb4aacb4
Example: Rename 07 ultravox example
2025-03-20 12:46:00 -04:00
Mark Backman
f298febacf
Add FalSTTService
2025-03-20 12:45:16 -04:00
Aleix Conchillo Flaqué
c51291190b
Merge pull request #1394 from pipecat-ai/aleix/function-calls-as-tasks
...
function calls as tasks
2025-03-20 09:34:37 -07:00
Aleix Conchillo Flaqué
e0c3f6ad83
services: mark function calls as completed even the result is None
2025-03-20 08:51:25 -07:00
Aleix Conchillo Flaqué
b1d506c137
GoogleAssistantContextAggregator: properly update function response
2025-03-20 08:51:25 -07:00
Aleix Conchillo Flaqué
1f6ed01ba6
LLMAssistantContextAggregator: remove tool call id with image requests
2025-03-20 08:51:25 -07:00
Aleix Conchillo Flaqué
3e9678db84
user image requests can now be related to function calls
2025-03-20 08:51:25 -07:00
Aleix Conchillo Flaqué
d455fd070e
update CHANGELOG
2025-03-20 08:51:25 -07:00
Aleix Conchillo Flaqué
d1550d5a85
tests: remove TestFrameProcessor, reimplement with run_test()
2025-03-20 08:51:25 -07:00
Aleix Conchillo Flaqué
c15286b148
examples: deprecate start_callback from LLMService.register_function()
2025-03-20 08:51:25 -07:00
Aleix Conchillo Flaqué
a98000fd1d
function calling now run in tasks
2025-03-20 08:51:25 -07:00
Aleix Conchillo Flaqué
fc06306efd
Merge pull request #1406 from pipecat-ai/aleix/pipeline-task-idle-timeouts
...
PipelineTask: automatically cancel tasks if pipeline is idle
2025-03-20 08:37:39 -07:00
Mark Backman
039fa59165
Merge pull request #1409 from pipecat-ai/aleix/segmented-stt-service-vad-events
...
SegmentedSTTService: use VAD events to detect valid audio
2025-03-20 09:11:08 -04:00
Aleix Conchillo Flaqué
0e14cec139
pyproject: update multiple libraries
2025-03-20 01:22:33 -07:00
Aleix Conchillo Flaqué
2417ec4f92
LLMUserContextAggregator: increase bot_interruption_timeout to 5 seconds
2025-03-20 01:20:34 -07:00
Aleix Conchillo Flaqué
7cdcd1c3d1
OpenAITTSService: allow specifying any model name
2025-03-20 01:20:34 -07:00
Aleix Conchillo Flaqué
b6be25ab84
SegmentedSTTService: use VAD events to detect valid audio
2025-03-20 00:31:49 -07:00
Aleix Conchillo Flaqué
e18d9f6a11
PipelineTask: automatically cancel tasks if pipeline is idle
2025-03-19 23:30:46 -07:00
Mark Backman
3a73346a41
Merge pull request #1408 from pipecat-ai/mb/claude-models-example
...
Update to Claude 3.7 Sonnet latest in examples
2025-03-20 01:44:59 -04:00
Aleix Conchillo Flaqué
8d58d1c8bb
Merge pull request #1404 from pipecat-ai/aleix/gemini-push-frame-fixes
...
GeminiMultimodalLiveLLMService: fix duplicated messages in context
2025-03-19 21:51:39 -07:00
Mark Backman
07a77e066f
Update to Claude 3.7 Sonnet latest in examples
2025-03-19 23:18:30 -04:00
Aleix Conchillo Flaqué
3024896d3d
Merge pull request #1405 from pipecat-ai/aleix/tts-services-fallback
...
WebsocketTTSService: add `on_connection_error` and `reconnect_on_error`
2025-03-19 19:39:51 -07:00
Aleix Conchillo Flaqué
a3b5e4413a
WebsocketTTSService: add on_connection_error and reconnect_on_error
2025-03-19 19:38:08 -07:00
Aleix Conchillo Flaqué
f31e77c4f6
pyproject: added empty tavus dependencies
2025-03-19 18:43:07 -07:00
Aleix Conchillo Flaqué
8942c2e053
GeminiMultimodalLiveLLMService: fix duplicated messages in context
...
Fixes #1384
2025-03-19 15:33:54 -07:00
Aleix Conchillo Flaqué
afb26be0ad
Merge pull request #1396 from pipecat-ai/aleix/stt-service-audio-passthrough
...
SegmentedSTTService: allow audio to pass-through downstream
2025-03-19 11:16:40 -07:00
Aleix Conchillo Flaqué
48d73a2636
SegmentedSTTService: allow audio to pass-through downstream
2025-03-19 11:06:12 -07:00
Aleix Conchillo Flaqué
da531dabfd
Merge pull request #1304 from pipecat-ai/aleix/handle-emails-user-email-gathering
...
add skip tags aggregator to support TTS service spelling out tags
2025-03-19 11:05:10 -07:00
Aleix Conchillo Flaqué
336e2f1579
TTSServices: for now just specify a single text aggregator
2025-03-19 11:02:29 -07:00
Aleix Conchillo Flaqué
fc0f404d26
examples: add new 36-user-email-gathering.py
2025-03-19 10:57:29 -07:00
Aleix Conchillo Flaqué
54620133d4
services: add spelling out support to CartesiaTTSService and RimeTTSService
2025-03-19 10:57:29 -07:00
Aleix Conchillo Flaqué
e7224473f2
utils(text): add new SkipTagsAggregator
2025-03-19 10:57:29 -07:00
Aleix Conchillo Flaqué
1a3a268c9d
utils(string): add new function parse_start_end_tags()
2025-03-19 10:57:29 -07:00
Aleix Conchillo Flaqué
11984b89b7
utils(string): add support for floating point numbers
2025-03-19 10:57:29 -07:00
Aleix Conchillo Flaqué
1dbad2326a
utils(string): support email addresses in end of sentence matching
2025-03-19 10:57:27 -07:00
Mark Backman
2e0c6c2bd1
Merge pull request #1397 from pipecat-ai/mb/disconnect-bot
...
Fix: RTVI message disconnect-bot now pushes EndTaskFrame
2025-03-19 10:45:24 -04:00
Nico
5f28834588
feature: add custom headers to AsyncOpenAI
2025-03-19 14:49:51 +01:00
Mark Backman
7f1ccab445
Fix: RTVI message disconnect-bot now pushes EndTaskFrame
2025-03-19 07:07:45 -04:00
Aleix Conchillo Flaqué
7ddac4eb88
Merge pull request #1395 from pipecat-ai/aleix/multiple-text-filters-and-aggregators
...
TTSService: allow passing multiple text filters and aggregators
2025-03-18 21:25:29 -07:00
Aleix Conchillo Flaqué
514ecda755
TTSService: allow passing multiple text filters and aggregators
2025-03-18 17:31:01 -07:00
balalo
48b6850df4
allow other function names
2025-03-18 20:45:31 +01:00
Aleix Conchillo Flaqué
71a38a120e
Merge pull request #1376 from pipecat-ai/aleix/event-handlers-as-tasks
...
event handlers are now executed in separate tasks
2025-03-18 12:10:34 -07:00
Mark Backman
79616de7a4
Merge pull request #1392 from pipecat-ai/mb/fix-google-stt-timeout
...
Fix an issue where GoogleSTTService would timeout due to stream inact…
2025-03-18 14:17:44 -04:00
Mark Backman
6368fbe0dd
Merge pull request #1318 from Vaibhav159/vl_google_vertex_llm
...
adding vertex google llm
2025-03-18 14:17:21 -04:00
Mark Backman
5dc8b48fbe
Fix an issue where GoogleSTTService would timeout due to stream inactivity
2025-03-18 14:06:32 -04:00
Aleix Conchillo Flaqué
9112ff114f
Merge pull request #1359 from lucasrothman/tavus-output-sample-rate
...
Tavus support for custom output rate
2025-03-18 10:16:34 -07:00
Aleix Conchillo Flaqué
32609b1132
event handlers are now executed in separate tasks
2025-03-18 09:25:39 -07:00
Vaibhav159
4303ed4991
rename service
2025-03-18 20:58:21 +05:30
Mark Backman
4677c34663
Merge pull request #1387 from pipecat-ai/mb/pattern-aggregator
...
Add PatternPairAggregator
2025-03-18 08:46:42 -04:00
Mark Backman
b28276446d
Code review feedback
2025-03-18 07:49:54 -04:00
Mark Backman
2dee882710
Add unit tests
2025-03-18 07:30:37 -04:00
Mark Backman
6ec4052f29
Add CHANGELOG entries
2025-03-18 07:30:36 -04:00
Mark Backman
ddcc1fbb2f
Add foundational example 35
2025-03-18 07:30:11 -04:00
Mark Backman
e731a0d41f
Add PairPatternAggregator
2025-03-18 07:30:11 -04:00
Mark Backman
4918eab4e8
Merge pull request #1371 from pipecat-ai/mb/openai-realtime-transcription
...
Add TranscriptProcessor support for OpenAIRealtimeBetaLLMService
2025-03-18 07:28:07 -04:00
Mark Backman
11987765d8
Merge pull request #1381 from pipecat-ai/mb/recording-example-stt
...
Update the 34-audio-recording.py example to include an STT processor
2025-03-18 07:20:42 -04:00
Mark Backman
6f09ee25b8
Merge pull request #1385 from pipecat-ai/mb/add-neuphonic-readme
...
Add Google Imagen and Neuphonic TTS to README
2025-03-18 07:20:15 -04:00
Mark Backman
83dda8a759
Merge pull request #1390 from adnansiddiquei/add-neuphonic-languages
...
Added 5 new languages for Neuphonic: FR, PT, RU, ZH, HI.
2025-03-18 07:18:27 -04:00
Adnan Siddiquei
188677e601
Added 4 new languages: FR, PT, RU, ZH, HI.
2025-03-18 10:35:22 +00:00
balalo
dc5067407d
Fix ruff check
2025-03-18 11:12:51 +01:00
balalo
1c19777d5e
Fix format
2025-03-18 11:09:40 +01:00
balalo
2e1a18503b
Set tool choice from context aggregator
2025-03-18 10:41:43 +01:00
Lucas Rothman
c57fa93a70
Renamed to sample_rate
2025-03-17 16:22:36 -07:00
Mark Backman
6885d07e88
Simplify the TranscriptProcessor _emit_aggregated_text logic
2025-03-17 16:36:03 -04:00
Mark Backman
acd0660f66
Update GeminiMultimodalLiveLLMService to work with the TranscriptProcessor
2025-03-17 16:36:03 -04:00
Mark Backman
3f002f8ffb
Remove unnecessary TranscriptProcessor examples
2025-03-17 16:36:02 -04:00
Mark Backman
d5776c27f4
Update 19-openai-realtime-beta
2025-03-17 16:35:35 -04:00
Mark Backman
6e6905405b
Update CHANGELOG
2025-03-17 16:35:35 -04:00
Mark Backman
571c10403f
tests: Add additional coverage to test_transcript_processor
2025-03-17 16:35:35 -04:00
Mark Backman
5b6b700214
OpenAIRealtimeBetaLLMService outputs a TTSTextFrame
2025-03-17 16:35:35 -04:00
Mark Backman
1ad8e28025
Update TranscriptProcessor to more robustly handle different TTSTextFrame outputs
2025-03-17 16:35:35 -04:00
Mark Backman
3458f1b6de
Add Google Imagen to README
2025-03-17 11:43:40 -04:00
Mark Backman
02dbef8f5a
Add Neuphonic TTS to README
2025-03-17 11:28:51 -04:00
Zac
1baa52a17e
Enhanced whisper.py with MLX Whisper model support and added optional mlx-whisper to pyproject.toml. Added error handling for missing modules and created a new WhisperSTTServiceMLX class for MLX Whisper integration.
2025-03-16 02:18:54 -04:00
Mark Backman
c1382b0691
Update the 34-audio-recording.py example to include an STT processor
2025-03-15 20:30:35 -04:00
Vaibhav159
5f000efc61
adding example
2025-03-15 10:36:26 +05:30
Vaibhav159
fa7da8f5f6
adding vertex llm
2025-03-15 10:21:40 +05:30
Mark Backman
8b86f6991d
Merge pull request #1343 from pipecat-ai/mb/pipecat-cloud-example
...
Add a Pipecat Cloud deployment example
2025-03-14 20:49:45 -04:00
Mark Backman
d3cd1a6c59
Update with latest starter
2025-03-14 20:40:33 -04:00
Mark Backman
24220f38f0
Add a Pipecat Cloud deployment example
2025-03-14 20:40:29 -04:00
Aleix Conchillo Flaqué
1f8752ab03
Merge pull request #1378 from pipecat-ai/aleix/remove-deprecations
...
removed most deprecations
2025-03-14 14:42:34 -07:00
Aleix Conchillo Flaqué
16d7df1c9f
removed most deprecations
2025-03-14 14:37:08 -07:00
Aleix Conchillo Flaqué
2474211291
Merge pull request #1379 from pipecat-ai/aleix/introduce-text-aggregators
...
introduce text aggregators
2025-03-14 13:03:49 -07:00
Aleix Conchillo Flaqué
b632d71465
TTSService: flush_audio() should be in the base class
2025-03-14 10:48:25 -07:00
Aleix Conchillo Flaqué
f8610a69a5
introduce text aggregators
2025-03-14 10:48:25 -07:00
Aleix Conchillo Flaqué
624a454f8b
Merge pull request #1366 from adnansiddiquei/neuphonic-tts-plugin
...
Add integration for Neuphonic TTS
2025-03-14 10:27:24 -07:00
Aleix Conchillo Flaqué
11ba08b7ba
Merge pull request #1377 from pipecat-ai/aleix/task-upstream-downstream-filters
...
PipelineTask: only call event handlers if a filter is matched
2025-03-14 08:49:24 -07:00
Adnan Siddiquei
11b13d053b
Fixed a bug from previous commit. Removed the concept of model from Neuphonic.
2025-03-14 11:17:22 +00:00
Adnan Siddiquei
7dec8431e1
Review comments by aconchillo.
2025-03-14 10:52:13 +00:00
Aleix Conchillo Flaqué
ce3f3b2edb
Merge pull request #1372 from pipecat-ai/khk-fix-multimodal-live-example
...
fix for 26-gemini-multimodal-live.py
2025-03-13 20:22:07 -07:00
Aleix Conchillo Flaqué
1b3b4ee04a
PipelineTask: only call event handlers if a filter is matched
2025-03-13 18:44:30 -07:00
Mark Backman
676c5d9ba7
Merge pull request #1374 from pipecat-ai/mb/add-riva-to-readme
2025-03-13 20:41:05 -04:00
Mark Backman
6eb3a8409f
README: Add Parakeet and FastPitch
2025-03-13 18:42:19 -04:00
Filipi Fuchter
526f9c2e06
Fixing the voice agent feedback when disconnected.
2025-03-13 18:41:40 -03:00
Kwindla Hultman Kramer
c9a31ea513
fix for 26-gemini-multimodal-live.py
2025-03-13 14:35:47 -07:00
Filipi Fuchter
2770d64a25
Fixing ruff format.
2025-03-13 17:38:11 -03:00
Filipi Fuchter
8a7e305619
Closing the old peer connection
2025-03-13 17:35:47 -03:00
Filipi Fuchter
8f2dadf5a0
Improving the reconnection logic to be able to recreate the peer connection in some cases.
2025-03-13 17:07:32 -03:00
Aleix Conchillo Flaqué
c0c7c5d600
Merge pull request #1370 from pipecat-ai/aleix/minor-ultravox-updates
...
services(ultravox): CHANGELOG, formatting and minor changes
2025-03-13 12:05:13 -07:00
Aleix Conchillo Flaqué
87004937be
services(ultravox): CHANGELOG, formatting and minor changes
2025-03-13 11:49:18 -07:00
Aleix Conchillo Flaqué
b426be3067
Merge pull request #1331 from CerebriumAI/feature/ultravox
...
Added ultravox service
2025-03-13 10:40:00 -07:00
Aleix Conchillo Flaqué
b71e2b97ff
Merge pull request #1368 from pipecat-ai/aleix/pipelinetask-frame-event-handlers
...
PipelineTask: add on_frame_reached_upstream and on_frame_reached_downstream
2025-03-13 10:31:33 -07:00
Aleix Conchillo Flaqué
25dcf7def6
PipelineTask: add on_frame_reached_upstream/on_frame_reached_downstream
2025-03-13 10:26:11 -07:00
Filipi Fuchter
30432639b4
Creating a keep alive connection
2025-03-13 14:20:25 -03:00
Adnan Siddiquei
1bf964a667
Added two examples on how to use Neuphonic as a TTS (07u).
2025-03-13 14:42:42 +00:00
Adnan Siddiquei
08fb931ef6
Swapped NEUPHONIC_API_TOKEN for NEUPHONIC_API_KEY.
2025-03-13 12:10:03 +00:00
Aleix Conchillo Flaqué
c5aa931096
Merge pull request #1358 from pipecat-ai/aleix/abstractmethod-fixes
...
ai_services: fix abstractmethod issues
2025-03-12 17:26:48 -07:00
Filipi Fuchter
d33a4b3a11
Implementing reconnection logic.
2025-03-12 18:23:12 -03:00
Filipi Fuchter
9cad8bfcc6
Increasing the time that we are waiting for the frame.
2025-03-12 15:36:40 -03:00
Mark Backman
b084a3e9e7
Merge pull request #1367 from MaCaki/macaki/rime/send_msg_in_flush_audio
...
[rime client] Sending over trailing space to help indicate end of utt…
2025-03-12 14:25:18 -04:00
macaki
5c9e33bc7a
formatting
2025-03-12 12:20:18 -06:00
Filipi Fuchter
93d8ddf4f2
Only showing the timout warning to receive frame if the client is connected.
2025-03-12 15:13:59 -03:00
Adnan Siddiquei
0b9c4b2255
Fixed a couple of small bugs.
2025-03-12 18:04:48 +00:00
macaki
effb5f6cd8
added changelog
2025-03-12 11:57:25 -06:00
Adnan Siddiquei
ead555eb4b
Corrected versions on pyproject.toml.
2025-03-12 17:39:04 +00:00
macaki
f843482968
[rime client] Sending over trailing space to help indicate end of utterance after a punctuation.
2025-03-12 11:26:43 -06:00
Adnan Siddiquei
23a4933af9
Initial implementation of Neuphonic service. A TTS provider.
2025-03-12 17:15:31 +00:00
Filipi Fuchter
0d05312071
Supporting renegotiation inside the voice agent server.
2025-03-12 11:55:38 -03:00
Filipi Fuchter
f8e33d8b7b
Improving the video transform feedback when we are connecting, and cleaning the pc_id when disconnected.
2025-03-12 11:49:33 -03:00
Filipi Fuchter
f24c5b0aa7
Adding support for renegotiation.
2025-03-12 11:31:18 -03:00
Michael Louis
d9ef19233a
Added foundational example for ultravox
2025-03-12 10:30:23 -04:00
Mark Backman
357334e3c9
Merge pull request #1341 from pipecat-ai/mb/fix-google-typo
...
Add a set_language convenience method for GoogleSTTService
2025-03-12 09:05:52 -04:00
Filipi Fuchter
da25e0c008
Configuring the bot to receive the video live.
2025-03-12 10:00:33 -03:00
Filipi Fuchter
c99d02d8bb
Adding support for interruptions when using SmallWebRTCTransport.
2025-03-12 09:09:11 -03:00
Mark Backman
59ea94af86
Merge pull request #1360 from pipecat-ai/mb/update-cartesia-voice
...
Update Cartesia voice for demos
2025-03-12 08:02:26 -04:00
Mark Backman
4a363bebf0
Add a set_language convenience method for GoogleSTTService
2025-03-12 07:58:29 -04:00
Mark Backman
c196fb5f98
Merge pull request #1342 from pipecat-ai/mb/lmnt-flush-audio
2025-03-11 22:22:38 -04:00
Mark Backman
5f97f6ff94
Add flush_audio() to LmntTTSService
2025-03-11 21:57:54 -04:00
Mark Backman
5860fe5319
Merge pull request #1340 from pipecat-ai/mb/fish-flush
...
Add flush_audio to FishTTSService
2025-03-11 21:56:44 -04:00
Mark Backman
3522bbb533
tmp
2025-03-11 21:55:18 -04:00
Mark Backman
cfca7269f4
Update the Cartesia voice in all demos with one built for sonic-2
2025-03-11 21:53:03 -04:00
Mark Backman
e6f269a903
Add flush_audio to FishTTSService
2025-03-11 21:48:41 -04:00
Mark Backman
468e936a5f
Merge pull request #1356 from pipecat-ai/mb/add-chirp-tts-support
...
Add support for Chirp voices in GoogleTTSService
2025-03-11 20:12:52 -04:00
Lucas Rothman
ecc4411128
Tavus support for custom output rate
2025-03-11 16:02:33 -07:00
Aleix Conchillo Flaqué
740ba4e759
ai_services: fix abstractmethod issues
2025-03-11 14:29:03 -07:00
Filipi Fuchter
e56c8f881c
Full video transformation example using SmallWebRTCTransport.
2025-03-11 11:36:47 -03:00
Filipi Fuchter
a747f08017
Simple voice agent example using SmallWebRTCTransport.
2025-03-11 11:36:23 -03:00
Filipi Fuchter
c6c0b73345
P2P WebRTC transport option to Pipecat: SmallWebRTCTransport.
2025-03-11 11:35:39 -03:00
Filipi Fuchter
fde90ee01d
Creating an EventEmitter util class
2025-03-11 11:33:47 -03:00
Filipi Fuchter
689a844aaf
Created a new transport param to inform if the camera input should be enabled.
2025-03-11 11:33:24 -03:00
Filipi Fuchter
aab98b61a0
Fixed issue where sending too many images per second caused Gemini to ignore them.
2025-03-11 11:32:38 -03:00
Mark Backman
a62741df94
Add support for Chirp voices in GoogleTTSService
2025-03-11 07:56:27 -04:00
Mark Backman
5bd359ada9
Merge pull request #1354 from pipecat-ai/mb/cartesia-changelog
...
Changelog entry for Cartesia model update
2025-03-11 07:20:04 -04:00
Mark Backman
40562402a2
Changelog entry for Cartesia model update
2025-03-10 21:10:11 -04:00
Mark Backman
98e5089fbe
Merge pull request #1353 from kunal-cai/main
...
[Cartesia] Update the default alias for Cartesia TTS Service
2025-03-10 21:07:19 -04:00
Kunal Shah
e1c8a09b60
[Cartesia] Update the default alias for Cartesia TTS Service
2025-03-10 14:43:58 -07:00
Filipi da Silva Fuchter
154fe65011
Merge pull request #1336 from pipecat-ai/fixing_function_calling_examples
...
Pipecat small fixes and refactored function calling examples
2025-03-07 16:10:27 -03:00
Mark Backman
61f534ca34
Merge pull request #1334 from pipecat-ai/aleix/user-and-bot-turn-audio
...
add support for user and bot turn audio
2025-03-06 18:35:56 -05:00
Mark Backman
a91c26785f
Store recording in a folder
2025-03-06 18:31:48 -05:00
Aleix Conchillo Flaqué
d7e93551d2
examples(chatbot-audio-recording): add support for user/bot turn audio
2025-03-06 11:49:01 -08:00
Aleix Conchillo Flaqué
06c742a2ad
AudioBufferProcessor: add on_user_turn_audio_data and on_bot_turn_audio_data
2025-03-06 11:49:01 -08:00
Filipi Fuchter
55b0797fd5
Removing the extra examples inside the unified-format-function-calling folder
2025-03-06 12:00:22 -03:00
Filipi Fuchter
21443b9a08
Refactored gemini multimodal example to use the unified format for function calling.
2025-03-06 11:59:08 -03:00
Filipi Fuchter
4b167a3c3d
Fixing the ruff format.
2025-03-06 10:38:45 -03:00
Filipi Fuchter
2df77430aa
Refactoring the 14 series examples to use the unified format for function calling.
2025-03-06 10:35:26 -03:00
Filipi Fuchter
2d114b15f9
Adding missing flush_audio method to AzureTTSService.
2025-03-06 10:34:25 -03:00
Filipi Fuchter
26000b616d
Fixing the base_whisper services to implement set_language.
2025-03-06 10:15:04 -03:00
Aleix Conchillo Flaqué
710eebab09
Merge pull request #1332 from pipecat-ai/aleix/base-object-and-event-handlers
...
introduce BaseObject class
2025-03-05 13:41:27 -08:00
Dominic Stewart
532423eb4c
Updated example to switch pipelines per the original request ( #1320 )
2025-03-05 13:40:36 -08:00
Aleix Conchillo Flaqué
bb29e50adb
introduce BaseObject class
2025-03-05 13:38:53 -08:00
Filipi da Silva Fuchter
4048d6782b
Merge pull request #1211 from pipecat-ai/function_calling_unified_format
...
Unified format for function calling
2025-03-05 18:30:22 -03:00
Filipi Fuchter
76d36a312b
Adding the unified format function calling to the changelog.
2025-03-05 14:18:37 -03:00
Filipi Fuchter
2a75373c04
Created examples for unified format function calling.
2025-03-05 14:12:30 -03:00
Filipi Fuchter
a840b0e815
Prevents pytest from collecting TestFrameProcessor.
2025-03-05 14:11:52 -03:00
Filipi Fuchter
ebcde719a6
Integration test for function calling.
2025-03-05 14:11:16 -03:00
Filipi Fuchter
5c912927bb
Unit tests for function calling adapters.
2025-03-05 14:11:02 -03:00
Filipi Fuchter
0e55db054e
Created script to fix ruff format issues.
2025-03-05 14:10:47 -03:00
Filipi Fuchter
5967ac0d4f
Implementing unified format for function calling.
2025-03-05 14:10:32 -03:00
Aleix Conchillo Flaqué
1451483cf7
Merge pull request #1330 from pipecat-ai/aleix/playht-update-0.1.12
...
pyproject: update pyht to 0.1.12
2025-03-04 18:35:03 -08:00
Michael Louis
3fe7c1d730
Added ultravox service
2025-03-04 13:59:03 -05:00
Aleix Conchillo Flaqué
c14b85c12b
pyproject: update pyht to 0.1.12
...
Fixes #1309
2025-03-04 10:26:11 -08:00
kompfner
9f3c0219d7
Merge pull request #1329 from pipecat-ai/add-permissions-to-daily-meeting-token-properties
...
Add the `permissions` property to `DailyMeetingTokenProperties`
2025-03-03 14:44:10 -05:00
Aleix Conchillo Flaqué
ec36fef26e
updated CHANGELOG and fix GladiaSTTService formatting
2025-03-03 09:53:03 -08:00
allenmylath
5f1848d24b
Update gladia.py ( #1317 )
...
* Update gladia.py
According to gladia docs
https://docs.gladia.io/api-reference/v2/live/init
speech threshould value close to 1 enables gladia to better isolate speeech from noise.
2025-03-03 09:51:11 -08:00
Aleix Conchillo Flaqué
d6867bd12f
Merge pull request #1321 from pipecat-ai/aleix/allow-setting-context-aggregator-parameters
...
LLMService: add user/assistant args to create_context_aggregator()
2025-03-03 09:48:31 -08:00
Aleix Conchillo Flaqué
17a1f30572
LLMService: add user/assistant args to create_context_aggregator()
2025-03-03 09:46:37 -08:00
Paul Kompfner
8e0dc1f256
Add the permissions property to DailyMeetingTokenProperties
2025-03-03 10:13:25 -05:00
Kwindla Hultman Kramer
b9100beee3
Merge pull request #1327 from pipecat-ai/azure-realtime-changelog
...
CHANGELOG.md entry for AzureRealtimeBetaLLMService
2025-03-02 20:30:40 -08:00
Mark Backman
b8bc3d2565
Merge pull request #1326 from pipecat-ai/mb/11labs-speed
...
Add speed as InputParam to ElevenLabs TTS services
2025-03-02 15:20:01 -05:00
Kwindla Hultman Kramer
3213e85b7d
CHANGELOG.md entry for AzureRealtimeBetaLLMService
2025-03-02 12:16:50 -08:00
Kwindla Hultman Kramer
de3bcd64c4
Merge pull request #1324 from pipecat-ai/azure-realtime
...
Support for Azure OpenAI Realtime API
2025-03-02 12:13:29 -08:00
Mark Backman
ad7f1eec12
Create a function to build voice_settings dictionary
2025-03-02 08:27:29 -05:00
Mark Backman
29310b4e92
Add speed as InputParam to ElevenLabs TTS services
2025-03-02 08:19:44 -05:00
Kwindla Hultman Kramer
2f4d36a146
docstring fixup
2025-03-01 15:44:10 -08:00
Kwindla Hultman Kramer
6c9bb782b1
add __init__.py
2025-03-01 15:42:20 -08:00
Kwindla Hultman Kramer
010d9103d4
support for Azure OpenAI Realtime API
2025-03-01 15:39:19 -08:00
Aleix Conchillo Flaqué
12131eb7c5
Merge pull request #1313 from Vaibhav159/vl_add_automated_formatting
...
using ruff automated formatting to avoid action failures.
2025-02-28 13:12:31 -08:00
Aleix Conchillo Flaqué
80b830322a
Merge pull request #1311 from pipecat-ai/aleix/llm-full-response-aggregator
...
add new LLMFullResponseAggregator
2025-02-28 13:08:06 -08:00
Aleix Conchillo Flaqué
8db9d16174
add new LLMFullResponseAggregator
2025-02-28 13:05:21 -08:00
Aleix Conchillo Flaqué
1c92fab1fb
Merge pull request #1308 from Vaibhav159/vl_google_openai_format
...
adding GoogleLLMOpenAIBetaService
2025-02-28 12:04:37 -08:00
Vaibhav159
974717d1b9
sync with main
2025-03-01 01:16:21 +05:30
Vaibhav159
59fb631390
fixing function calling and adding example
2025-03-01 01:14:37 +05:30
Vaibhav159
4824220260
adding GoogleLLMOpenAIBetaService
2025-03-01 01:14:26 +05:30
Mark Backman
55a338614d
Merge pull request #1312 from pipecat-ai/mb/move-server-message-frame
...
Rename ServerMessageFrame to RTVIServerMessageFrame and move to rtvi.py
2025-02-28 13:59:31 -05:00
Vaibhav159
f033046963
using ruff automated formatting to avoid repeated failures
2025-02-28 08:25:15 +05:30
Mark Backman
6018fc068c
Rename ServerMessageFrame to RTVIServerMessageFrame and move to rtvi.py
2025-02-27 20:07:07 -05:00
Aleix Conchillo Flaqué
d5b634301f
Merge pull request #1302 from pipecat-ai/aleix/cleanup-llm-tts-logging
...
services: minor LLM and TTS logging improvements
2025-02-27 13:51:04 -08:00
Aleix Conchillo Flaqué
a37eb1049d
Merge pull request #1310 from Canonical-AI-Inc/without-audio
...
Optional Recording
2025-02-27 13:37:39 -08:00
Adrian Cowham
803ea9d8bc
update the canonical client so that the audio recording is optional as long as there is a transcript
2025-02-27 12:31:02 -08:00
Mark Backman
499bc25217
Merge pull request #1303 from pipecat-ai/mb/add-server-to-client-msg
...
Add a new generic server to client message and frame type
2025-02-27 12:56:57 -05:00
Mark Backman
53d403af4b
Remove the RTVIServerMessage logic from the RTVIProcessor
2025-02-27 12:50:43 -05:00
Aleix Conchillo Flaqué
a0a8ea1641
Merge pull request #1301 from pipecat-ai/aleix/example-22d-fix-llm-aggregator
2025-02-26 22:39:48 -08:00
Mark Backman
26c68ccd7c
Add a new generic server to client message and frame type
2025-02-26 18:59:06 -05:00
Aleix Conchillo Flaqué
fa010c8644
services: minor LLM and TTS logging improvements
2025-02-26 15:36:25 -08:00
Aleix Conchillo Flaqué
d58f398bc4
examples: fix for 22d-natural-conversation-gemini-audio.py
2025-02-26 13:15:07 -08:00
Aleix Conchillo Flaqué
11383a86a1
Merge pull request #1300 from pipecat-ai/aleix/prepare-0.0.58
...
update CHANGELOG for 0.0.58
2025-02-26 11:31:24 -08:00
Aleix Conchillo Flaqué
daa52ff8df
update CHANGELOG for 0.0.58
2025-02-26 11:29:04 -08:00
Mark Backman
a5f41e22f7
Merge pull request #1299 from pipecat-ai/mb/add-track-level-recording
...
Added on_track_audio_data callback to AudioBufferProcessor for track level recording
2025-02-26 13:49:36 -05:00
Mark Backman
530bb5233d
example: Added a foundational example (34) for audio recording
2025-02-26 13:44:32 -05:00
Aleix Conchillo Flaqué
4a64e09f6c
Merge pull request #1297 from pipecat-ai/aleix/daily-python-0.15.0
...
pyproject: update daily-python, aiohttp and pydantic
2025-02-26 10:26:59 -08:00
Aleix Conchillo Flaqué
74582bb8d5
pyproject: update daily-python, aiohttp and pydantic
2025-02-26 10:22:34 -08:00
Mark Backman
1ca2101e3a
Added on_track_audio_data callback to AudioBufferProcessor for track level recording
2025-02-26 10:48:56 -05:00
Aleix Conchillo Flaqué
e80311c323
Merge pull request #1296 from pipecat-ai/aleix/google-always-send-text-with-audio
...
GoogleLLMService: always send text with audio
2025-02-26 07:47:56 -08:00
Aleix Conchillo Flaqué
2f24c422b6
Merge pull request #1289 from pipecat-ai/aleix/tts-http-improvements
...
small TTS http improvements
2025-02-26 07:47:26 -08:00
Mark Backman
0d0b9fddef
Merge pull request #1291 from pipecat-ai/mb/playht-http-protocol
...
PlayHTHttpTTSService now takes a separate protocol input
2025-02-26 08:09:49 -05:00
Mark Backman
1753cc99f4
PlayHTHttpTTSService now takes a separate protocol input
2025-02-26 08:01:54 -05:00
Aleix Conchillo Flaqué
4f8b036abe
pyproject: remote httpx old dependency and upgrade anthropic/google-genai
2025-02-25 22:28:21 -08:00
Aleix Conchillo Flaqué
f83c89c202
examples: update google examples
2025-02-25 22:28:02 -08:00
Aleix Conchillo Flaqué
bb89a036e5
google: always send text part when sending inline audio
2025-02-25 22:27:38 -08:00
Aleix Conchillo Flaqué
b994a03466
examples: add more HTTP TTS services examples
2025-02-25 21:40:41 -08:00
Aleix Conchillo Flaqué
27161f8e3b
BaseOutputTransport: cleanup audio buffer after bot stops talking
2025-02-25 21:39:47 -08:00
Aleix Conchillo Flaqué
8acf9a488b
tts: some small HTTP-based services improvements
2025-02-25 21:39:47 -08:00
Aleix Conchillo Flaqué
96c6aeaada
Merge pull request #1295 from pipecat-ai/aleix/pipelinetask-keyword-arguments
...
PipelineTask: force constructor keyword arguments
2025-02-25 19:00:58 -08:00
Aleix Conchillo Flaqué
6722aae598
PipelineTask: force constructor keyword arguments
2025-02-25 18:58:47 -08:00
Aleix Conchillo Flaqué
66564392a6
Merge pull request #1293 from pipecat-ai/aleix/log-pipecat-version
...
log pipecat version on application startup
2025-02-25 18:57:52 -08:00
Aleix Conchillo Flaqué
f258f5ab66
Merge pull request #1292 from pipecat-ai/aleix/audiocontext-terminate-nicely
...
AudioContextWordTTSService: wait for all requested audio
2025-02-25 18:56:41 -08:00
Aleix Conchillo Flaqué
f8f0578c3d
log pipecat version on application startup
2025-02-25 18:55:45 -08:00
Aleix Conchillo Flaqué
aa60a413f3
Merge pull request #1294 from pipecat-ai/aleix/improve-test-requirements
...
improve test-requirements.txt
2025-02-25 18:55:18 -08:00
Aleix Conchillo Flaqué
3e66f2378d
improve test-requirements.txt
2025-02-25 17:34:33 -08:00
Aleix Conchillo Flaqué
9a50f33e36
AudioContextWordTTSService: wait for all requested audio
2025-02-25 15:35:47 -08:00
Aleix Conchillo Flaqué
4bd5e9c0a7
Merge pull request #1285 from pipecat-ai/aleix/handle-stop-task-gracefully
...
handle stop task gracefully
2025-02-25 11:25:38 -08:00
Mark Backman
12092c8715
Merge pull request #1288 from pipecat-ai/mb/clean-up-tts-text-input
...
TTSService: Remove newlines before sending text to TTS service to gen…
2025-02-25 14:00:43 -05:00
Mark Backman
92cc6d39f2
TTSService: Remove newlines before sending text to TTS service to generate
2025-02-25 13:37:25 -05:00
Aleix Conchillo Flaqué
34a50033cb
tk: use TkTransportParams in examples
2025-02-25 10:24:24 -08:00
Aleix Conchillo Flaqué
e60b65228b
allow multiple StartFrames
2025-02-25 10:24:04 -08:00
Mark Backman
e74864335b
Merge pull request #1287 from pipecat-ai/mb/30-observer-pipeline-task
...
Example 30: Move observers to PipelineTask
2025-02-25 12:11:23 -05:00
Mark Backman
27a088a457
Merge pull request #1286 from pipecat-ai/mb/update-grok-2
...
Set grok-2 as default model for GrokLLMSService
2025-02-25 12:11:09 -05:00
Mark Backman
cfe72143b8
Example 30: Move observers to PipelineTask
2025-02-25 10:54:25 -05:00
Mark Backman
36a729cbfe
Set grok-2 as default model for GrokLLMSService
2025-02-25 10:00:45 -05:00
Aleix Conchillo Flaqué
d2f006682c
introduce new BaseTaskManager
2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué
fb7fe540f5
tts: don't connect to websocket if already connected
2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué
1ec68bd071
make sure we don't create tasks if already created
2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué
4536d03e82
FrameProcessor: cancel input/push tasks on CancelFrame
2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué
699704732c
asyncio: re-raise CancelledError in wait_for_task()
2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué
376d969a77
task: handle StopFrame and StopTaskFrame gracefully
2025-02-24 23:38:51 -08:00
Aleix Conchillo Flaqué
68789dfcf0
frames: add new StopFrame
2025-02-24 21:34:23 -08:00
Aleix Conchillo Flaqué
fe9fc61c4e
Merge pull request #1282 from pipecat-ai/aleix/pipelinetask-observers-constructor
...
PipelineTask: pass observers in contructor parameter
2025-02-24 21:29:46 -08:00
Aleix Conchillo Flaqué
6028f0f23a
PipelineTask: pass observers in contructor parameter
2025-02-24 21:29:17 -08:00
Aleix Conchillo Flaqué
e9a0959e28
Merge pull request #1283 from pipecat-ai/aleix/check-dangling-tasks
...
PipelineTask: add check_dangling_tasks parameter
2025-02-24 21:26:32 -08:00
Dominic Stewart
f66be2cfa7
Dom/gemini system prompt switching ( #1260 )
...
* Updated example to use Gemini
* Fixed typo
* Based on feedback, made the gemini file something that can be called separately
* Updated the readme
* Updated the readme
* Changed example to use gemini 2.0 flash lite
* This works
* Improvement
* I think this works
* Updated the code to use the correct prompt broken down into smaller pieces
* Added a few more things to detect in the prompt
* Fixed import ordering
* Updated prompt for non gemini bot to look for more voicemail examples, plus added logic to detect if we're doing dialin or not to avoid a non-fatal dialin related error
* moved terminate call to handlers class
* Simplified logic for dialin
* Forgot to use the same logic for the openai bot
* Starting to add logic for native audio input for flash lite
* Fixed logic
* Fixed some code based on suggestions
2025-02-24 22:29:55 -06:00
Aleix Conchillo Flaqué
f818bed58f
Merge pull request #1281 from pipecat-ai/aleix/google-context-aggregator-upgrade-context
...
google: updgrade OpenAILLMContext to GoogleLLMContext
2025-02-24 17:37:26 -08:00
Aleix Conchillo Flaqué
07b9be5308
PipelineTask: add check_dangling_tasks parameter
2025-02-24 17:33:10 -08:00
Aleix Conchillo Flaqué
40c2452d6e
google: updgrade OpenAILLMContext to GoogleLLMContext
2025-02-24 15:35:18 -08:00
Aleix Conchillo Flaqué
30cdd1b71a
Merge pull request #1280 from pipecat-ai/aleix/add-completion-timeout
...
services(llm): add on_completion_timeout event
2025-02-24 15:07:20 -08:00
Aleix Conchillo Flaqué
2110b79507
services(llm): add on_completion_timeout event
2025-02-24 14:55:36 -08:00
Aleix Conchillo Flaqué
fc544fa61c
Merge pull request #1272 from pipecat-ai/aleix/tts-websocket-interruptions
...
services: fix some TTS websocket service interruption handling
2025-02-24 14:54:41 -08:00
Mark Backman
976fe95304
Merge pull request #1279 from pipecat-ai/mb/remove-open-optional-dep
...
Remove `openai` optional dependency from services as it's now required
2025-02-24 17:42:53 -05:00
Aleix Conchillo Flaqué
408270b647
lmnt: don't send "eof" before closing the socket
2025-02-24 14:37:37 -08:00
Mark Backman
1dfb75bc9d
Merge pull request #1278 from pipecat-ai/mb/claude-3-7
...
Update AnthropicLLMService to use claude-3-7-sonnet-20250219 by default
2025-02-24 15:41:28 -05:00
Mark Backman
cefc2a1088
Fix test-requirements.text ordering
2025-02-24 15:06:13 -05:00
Mark Backman
3b9b9200ea
Remove openai optional dependency from services as it's now required
2025-02-24 15:05:42 -05:00
Mark Backman
d6f29a0f4b
Update AnthropicLLMService to use claude-3-7-sonnet-20250219 by default
2025-02-24 14:32:00 -05:00
Aleix Conchillo Flaqué
5b762d11ef
Merge pull request #1228 from CarlKho-Minerva/main
...
Missing Cartesia~=1.3.1 → `test-requirements`
2025-02-24 08:47:41 -08:00
Aleix Conchillo Flaqué
2f3e2da6b9
Merge pull request #1259 from pipecat-ai/openai-not-optional
...
Since the `openai` package is used by pretty much everything in pipec…
2025-02-24 08:45:45 -08:00
allenmylath
45058d4a94
Update audio_buffer_processor.py ( #1266 )
2025-02-24 08:41:19 -08:00
Aleix Conchillo Flaqué
5b637bd826
services: fix some TTS websocket service interruption handling
2025-02-24 08:37:22 -08:00
Mark Backman
2d4fd7e903
Merge pull request #1274 from pipecat-ai/mb/add-ellipsis-test
...
Add one additional ellipsis test to test_utils_string
2025-02-23 11:26:20 -05:00
Mark Backman
b5662520aa
Add one additional ellipsis test to test_utils_string
2025-02-23 11:04:24 -05:00
Aleix Conchillo Flaqué
af45c170b5
Merge pull request #1264 from pipecat-ai/aleix/add-log-observers
...
add initial log observers
2025-02-21 15:20:45 -08:00
Aleix Conchillo Flaqué
65f548b2ec
examples(30-observer): update to use LLMLogObserver
2025-02-21 15:15:16 -08:00
Aleix Conchillo Flaqué
b29ab8c608
observers: add LLMLogObserver and TranscriptionLogObserver
2025-02-21 15:15:16 -08:00
Aleix Conchillo Flaqué
d6dc37f0b6
Merge pull request #1269 from pipecat-ai/aleix/endofsentence-support-ellipses
...
utils: add support for ellipses in match_endofsentence()
2025-02-21 15:08:22 -08:00
Aleix Conchillo Flaqué
12bce2e8c0
utils: add support for ellipses in match_endofsentence()
2025-02-21 15:05:50 -08:00
Aleix Conchillo Flaqué
4acf7296e0
Merge pull request #1261 from pipecat-ai/aleix/emualted-frames-being-triggered-prematurely
...
LLMUserContextAggregator: don't reset timer with interim transcription
2025-02-21 10:15:28 -08:00
Aleix Conchillo Flaqué
98706d429c
LLMUserContextAggregator: make sure incoming transcription has text
2025-02-21 10:12:54 -08:00
Aleix Conchillo Flaqué
41720b1a13
LLMUserContextAggregator: don't reset timer with interim transcription
...
It turns out that in some cases we only get interim transcriptions (e.g. someone
is speaking very very softly or someone is talking in the background). In those
cases we don't want to interrupt the bot because there's really nothing to
interrupt the bot for.
We originally thought we should interrupt the bot right at the time we got an
interim frame, but this is causing too many false positives. It's actually
better to simply wait for a real transcription before interrupting (in case VAD
didn't interrupt).
2025-02-21 09:05:56 -08:00
Aleix Conchillo Flaqué
3ef4245166
Merge pull request #1265 from pipecat-ai/aleix/transport-remove-audio-out-is-live
2025-02-21 06:51:09 -08:00
Filipi da Silva Fuchter
3bb0797922
Merge pull request #1257 from pipecat-ai/fastapi_disconnect_issue
...
Fixed an issue where FastAPI was not triggering on_client_disconnected.
2025-02-21 09:15:15 -03:00
Filipi Fuchter
7c7b4c52af
Fixed an issue where EndTaskFrame was not triggering on_client_disconnected or closing the WebSocket in FastAPI.
2025-02-21 09:11:58 -03:00
Aleix Conchillo Flaqué
01f083b7fc
transports: remove TransportParams.audio_out_is_live
2025-02-20 23:33:06 -08:00
Aleix Conchillo Flaqué
91fcaebe25
Merge pull request #1263 from Vaibhav159/vl_fix_deepgram_sample_rate_mismatch
...
fixing deepgram mismatch
2025-02-20 22:39:06 -08:00
Vaibhav159
9c5fe5c85e
fixing deepgram mismatch
2025-02-21 09:32:40 +05:30
Aleix Conchillo Flaqué
7e5e167a4b
Merge pull request #1250 from pipecat-ai/aleix/context-aggregation-simulatenous-text-tools
...
AssistantContextAggregator: append aggregation and tools in the same turn
2025-02-20 17:32:57 -08:00
Aleix Conchillo Flaqué
d04c4b36f3
AssistantContextAggregator: append aggregation and tools in the same turn
2025-02-20 17:29:43 -08:00
Aleix Conchillo Flaqué
a811e53626
Merge pull request #1253 from pipecat-ai/aleix/http-tts-services-stopped-frame
...
HTTP TTS services stopped frame
2025-02-20 17:28:05 -08:00
Paul Kompfner
df57202a05
Since the openai package is used by pretty much everything in pipecat (due to OpenAILLMContext being the standard context representation), let's make it a non-optional dependency.
...
This change solves an issue faced by users who aren't intending to use OpenAI getting scary error messages saying that they need the `openai` optional dependency "in order to use OpenAI", along with an instruction to set the OPENAI_API_KEY environment variable.
Note that with this change we could theoretically remove from pyproject.toml a number of defined optional dependencies that list only the `openai` package as a dependency (like `deepseek`, for example), but I didn't want to "break the API" in terms of how users install/consume pipecat and its set of built-in services.
Finally, I removed the `python-deepcompare` dependency from the `openai` optional dependency, since it appears to me like it was added by mistake (my guess is it was used for debugging during development and then never removed).
2025-02-20 15:21:35 -05:00
Aleix Conchillo Flaqué
69e6f3fdb7
rime: pass aiohttp session to constructor
2025-02-20 07:36:24 -08:00
Aleix Conchillo Flaqué
6809254963
tts: fix metrics and TTSStoppedFrame frame in HTTP services
...
Fixes #1247
2025-02-20 07:36:21 -08:00
Aleix Conchillo Flaqué
81093d3bed
Merge pull request #1252 from pipecat-ai/aleix/remove-vad-extra-logging
...
BaseInputTransport: remove VAD logging
2025-02-20 07:32:20 -08:00
Aleix Conchillo Flaqué
d9a67164f6
Merge pull request #1251 from pipecat-ai/aleix/fish-tts-service-push-stop-frame
...
FishAudioTTSService should push TTSStoppedFrame
2025-02-20 07:32:05 -08:00
Aleix Conchillo Flaqué
98259af54e
update CHANGELOG
2025-02-19 22:05:48 -08:00
Dominic Stewart
039d144c79
examples(phone-bot): updated example to use Gemini ( #1233 )
2025-02-19 22:03:37 -08:00
Aleix Conchillo Flaqué
d0f67fc189
BaseInputTransport: remove VAD logging
...
These logs are very verbose. They were added to try to find an issue that
resulted in being because of low CPU/memory resources, but these logs were not
helpful to determine that.
2025-02-19 21:55:11 -08:00
Aleix Conchillo Flaqué
6e3f96aa83
fish: automatically send TTSStoppedFrame after timeout
2025-02-19 21:41:18 -08:00
Aleix Conchillo Flaqué
293677588d
tts: make push_stop_frames default to 2.0s
2025-02-19 21:39:00 -08:00
Filipi da Silva Fuchter
77e777b1ce
Merge pull request #1249 from pipecat-ai/invoking_call_start_function
...
Fixed an issue that `start_callback` was not invoked for some LLM services
2025-02-19 18:09:00 -03:00
Filipi Fuchter
7e7926059c
Fixed an issue that start_callback was not invoked for some LLM services.
2025-02-19 18:04:20 -03:00
Aleix Conchillo Flaqué
c948754eff
Merge pull request #1248 from pipecat-ai/aleix/daily-transport-room-url
...
daily: add room_url property
2025-02-19 09:46:46 -08:00
Aleix Conchillo Flaqué
83f1a8830d
daily: add room_url property
2025-02-19 09:29:53 -08:00
James Hush
80f8e05fcf
docs: fix transcripts in translation chatbot example ( #1199 )
2025-02-19 16:07:22 +08:00
Aleix Conchillo Flaqué
afd1a1e80b
Merge pull request #1245 from pipecat-ai/aleix/stt-mute-filter-trace-logging
2025-02-18 21:21:55 -08:00
Aleix Conchillo Flaqué
84ac88cad7
STTMuteFilter: change suppressed logging to trace
2025-02-18 18:03:37 -08:00
Aleix Conchillo Flaqué
211163e5c7
Merge pull request #1241 from pipecat-ai/aleix/deepgram-nova-3
...
deepgram: use the new nova-3 model as default
2025-02-18 17:53:04 -08:00
Aleix Conchillo Flaqué
1b0bcebef6
deepgram: use the new nova-3 model as default
2025-02-18 17:51:54 -08:00
Aleix Conchillo Flaqué
89736b03c4
Merge pull request #1243 from pipecat-ai/aleix/add-deepgram-addons
...
deepgram: add ability to provide custom addons
2025-02-18 17:47:48 -08:00
Aleix Conchillo Flaqué
4edda718ed
deepgram: add ability to provide custom addons
2025-02-18 17:45:41 -08:00
Aleix Conchillo Flaqué
22a62edc9e
Merge pull request #1242 from pipecat-ai/aleix/utils-network-exponential
...
network: added exponential_backoff_time() function
2025-02-18 17:44:21 -08:00
Aleix Conchillo Flaqué
50b6cc8135
network: added exponential_backoff_time() function
2025-02-18 17:42:43 -08:00
Aleix Conchillo Flaqué
45cf36925a
Merge pull request #1240 from pipecat-ai/aleix/handle-deepgram-on-error
...
deepgram: handle error event and reconnect
2025-02-18 17:41:29 -08:00
Filipi da Silva Fuchter
83a71e1fec
Merge pull request #1112 from pipecat-ai/bot-ready-signalling-rn
...
React Native client for the bot ready example.
2025-02-18 15:17:38 -03:00
Filipi Fuchter
e809c8680e
Upgrading to use the latest node stable version
2025-02-18 15:12:44 -03:00
Aleix Conchillo Flaqué
c926063d74
deepgram: handle error event and reconnect
2025-02-18 09:52:18 -08:00
Aleix Conchillo Flaqué
0334550356
Merge pull request #1238 from pipecat-ai/aleix/stt-mute-filter-ignore-input-audio-frames
...
STTMuteFilter: ignore audio frames so no transcriptions are generated
2025-02-18 09:48:13 -08:00
Aleix Conchillo Flaqué
90b9dce710
STTMuteFilter: ignore audio frames so no transcriptions are generated
2025-02-17 19:59:05 -08:00
Carl Kho
a5cdd5f1b8
Add Cartesia API key to dot-env.template
2025-02-14 21:29:37 -08:00
Carl Kho
5f937b8479
Update test requirements to include Cartesia version 1.3.1
2025-02-14 21:14:32 -08:00
Aleix Conchillo Flaqué
b45f7fee6f
Merge pull request #1225 from pipecat-ai/aleix/prepare-0.0.57
...
update CHANGELOG for 0.0.57
2025-02-14 18:50:08 -08:00
Aleix Conchillo Flaqué
01c06c5cac
update CHANGELOG for 0.0.57
2025-02-14 18:47:33 -08:00
Aleix Conchillo Flaqué
329e89c1d9
TTSService: push BotStoppedSpeakingFrame
2025-02-14 18:47:33 -08:00
Aleix Conchillo Flaqué
883410d8ac
FrameProcessor: no need to create an input event every time
2025-02-14 18:47:33 -08:00
Aleix Conchillo Flaqué
1f5b790dd0
TTSService: reset processing text during interruptions
2025-02-14 18:47:33 -08:00
Aleix Conchillo Flaqué
a107b1cb4b
examples(06a): use CartesiaTTSService
2025-02-14 18:47:33 -08:00
Aleix Conchillo Flaqué
63950912f0
LLMAssistantContextAggregator: add missing variable initialization
2025-02-14 18:47:33 -08:00
Aleix Conchillo Flaqué
2ce9402571
LLMAssistantResponseAggregator: initialize messages
2025-02-14 18:47:33 -08:00
Aleix Conchillo Flaqué
f6912c0f9a
utils: don't consider colon an end of sentence
2025-02-14 18:47:33 -08:00
Aleix Conchillo Flaqué
633a4d4c58
FalImageGenService: load image async to not block the event loop
2025-02-14 18:47:33 -08:00
Aleix Conchillo Flaqué
67da745bb3
tts: make frame pausing/resuming optional
2025-02-14 18:47:33 -08:00
Aleix Conchillo Flaqué
5126d4de92
tts: handle incoming frames pausing/resuming from base TTSService class
2025-02-14 18:47:33 -08:00
Aleix Conchillo Flaqué
426d7ac213
transports: some local audio and tk updates
2025-02-14 18:47:33 -08:00
Mark Backman
9115692c72
Merge pull request #1227 from pipecat-ai/mb/fix-25-error
...
fix: ensure proper Google message format conversion in transcription …
2025-02-14 21:01:05 -05:00
Mark Backman
c26fe3f277
fix: ensure proper Google message format conversion in transcription filter
2025-02-14 20:28:26 -05:00
Mark Backman
47b059d387
Merge pull request #1226 from pipecat-ai/mb/add-transcript-processor-tests
...
tests: add tests for TranscriptProcessor
2025-02-14 19:50:38 -05:00
Mark Backman
a49d81e519
tests: add tests for TranscriptProcessor
2025-02-14 17:10:40 -05:00
Aleix Conchillo Flaqué
b3a575c7c7
Merge pull request #1212 from Vaibhav159/vl_fix_incorrect_has_regular_messages_check
...
fixing google llm service error
2025-02-14 13:16:37 -08:00
Aleix Conchillo Flaqué
790d0c1256
Merge pull request #1224 from M1ngXU/patch-1
...
Update openai.py
2025-02-14 13:13:00 -08:00
Aleix Conchillo Flaqué
ee7e0dc3f7
Merge pull request #1223 from pipecat-ai/aleix/audio-context-tts-service
...
audio context tts service and cartesia fixes
2025-02-14 12:12:42 -08:00
Aleix Conchillo Flaqué
f53ee79ddb
RimeTTSService: use AudioContextWordTTSService
2025-02-14 11:55:54 -08:00
Aleix Conchillo Flaqué
aeadb40c3f
CartesiaTTSService: use AudioContextWordTTSService
...
By supporting multiple audio requests we fix an issue that was causing audio
overlapping.
2025-02-14 11:55:54 -08:00
Aleix Conchillo Flaqué
cacb07f4c2
introduce AudioContextWordTTSService
2025-02-14 11:55:54 -08:00
M1ngXU
0b91d821fb
Update openai.py
...
d
2025-02-14 20:27:08 +01:00
Aleix Conchillo Flaqué
af66a43056
Merge pull request #1222 from pipecat-ai/aleix/websocket-service-handle-clean-disconnection
...
WebsocketService: handle clean server disconnection
2025-02-14 10:33:54 -08:00
Aleix Conchillo Flaqué
e006dcf172
WebsocketService: handle clean server disconnection
...
The websocket async iterator doesn't raise an exception when the server
disconnects cleanly. We should handle that and raise an exception so we can
reconnect.
2025-02-14 10:11:56 -08:00
Filipi da Silva Fuchter
8588f8b0d8
Merge pull request #1220 from pipecat-ai/instant_voice_demo_example
...
Instant voice example.
2025-02-14 14:24:13 -03:00
Filipi Fuchter
bff54547b0
Instant voice example.
2025-02-14 14:19:17 -03:00
Mark Backman
b2754bf208
Merge pull request #1219 from pipecat-ai/mb/markdown-text-filter-tests
...
Add MarkdownTextFilter tests
2025-02-13 21:10:52 -05:00
Mark Backman
9a4942b0d0
Merge pull request #1218 from pipecat-ai/mb/user-idle-tests
...
Add UserIdleProcessor tests
2025-02-13 18:53:22 -05:00
Mark Backman
ed6201910b
Add MarkdownTextFilter tests
2025-02-13 18:51:46 -05:00
Mark Backman
ac5ebc587e
Add tests for UserIdleProcessor
2025-02-13 18:47:29 -05:00
Aleix Conchillo Flaqué
dff4c54e57
Merge pull request #1209 from pipecat-ai/aleix/reimplement-llm-response-aggregators
...
reimplement LLM response aggregators
2025-02-13 15:30:40 -08:00
Aleix Conchillo Flaqué
c744409651
SegmentedSTTService: fix process_audio_frame() arguments
2025-02-13 15:25:22 -08:00
Aleix Conchillo Flaqué
7578fbeaef
update google requirements
2025-02-13 15:25:22 -08:00
Aleix Conchillo Flaqué
5909dff423
LLMContextResponseAggregator: add VAD emulation support
2025-02-13 15:25:22 -08:00
Aleix Conchillo Flaqué
a6502df72c
services: forgot to pass context instead of user aggregator
2025-02-13 13:50:33 -08:00
Aleix Conchillo Flaqué
e0d24d7fc0
update CHANGELOG
2025-02-13 13:21:32 -08:00
Aleix Conchillo Flaqué
99779046a8
services: use push_context_frame()
2025-02-13 13:20:38 -08:00
Aleix Conchillo Flaqué
67cdc0063a
BaseTransportOutput: allow pushing frames upstream
2025-02-13 13:20:38 -08:00
Aleix Conchillo Flaqué
b28f752afa
tests: add anthropic and google aggregator tests
2025-02-13 13:20:38 -08:00
Aleix Conchillo Flaqué
463078e375
initialize assistant aggregators with context and push upstream instead
2025-02-13 13:20:38 -08:00
Aleix Conchillo Flaqué
84510fd521
LLMUserContextAggregator: add space between transcriptions
2025-02-13 13:20:38 -08:00
Aleix Conchillo Flaqué
9f6a1c093a
LLMUserContextAggregator: reset user speaking time after bot interruption
2025-02-13 13:20:38 -08:00
Aleix Conchillo Flaqué
b602e78625
tests: add OpenAI context aggregator tests
2025-02-13 13:20:38 -08:00
Aleix Conchillo Flaqué
7c815121ea
LLMContextResponseAggregator: add missing reset() implementation
2025-02-13 13:20:38 -08:00
Aleix Conchillo Flaqué
16a107948b
services: missing kwargs in anthropic/openai user context aggregator
2025-02-13 13:20:38 -08:00
Aleix Conchillo Flaqué
839aa7d935
llm_response: add some initial docstrings to LLM aggregators
2025-02-13 13:20:38 -08:00
Aleix Conchillo Flaqué
4cbcfe2b0b
LLMUserContextAggregator: interrupt the bot if VAD happened a while back
2025-02-13 13:20:38 -08:00
Aleix Conchillo Flaqué
91a628d1ba
UserResponseAggregator: implement on top of LLMUserResponseAggregator
2025-02-13 13:20:37 -08:00
Aleix Conchillo Flaqué
50288eeaaa
tests: add LLM response aggregators tests
2025-02-13 13:20:37 -08:00
Aleix Conchillo Flaqué
e1f2bbceb3
reimplement LLM response aggregators
2025-02-13 13:20:37 -08:00
Aleix Conchillo Flaqué
8bdd7ed0ed
tests: implement langchain tests with run_test()
2025-02-13 13:20:37 -08:00
Aleix Conchillo Flaqué
1b7dfe8126
tests: add a new SleepFrame
...
The new SleepFrame allow us to control when system frames are pushed to the
pipeline.
2025-02-13 13:20:37 -08:00
Aleix Conchillo Flaqué
d1ee851a65
tests: rename some variables to make things clearer
2025-02-13 13:20:37 -08:00
Filipi da Silva Fuchter
0358673b46
Merge pull request #1215 from pipecat-ai/instant_voice_demo
...
Instant voice demo improvements - part 02
2025-02-13 18:14:15 -03:00
Filipi Fuchter
16fe1b10e9
- Added support for the RTVIProcessor to handle buffered audio in base64 format, converting it into InputAudioRawFrame for transport.
...
- Added support for the `RTVIProcessor` to trigger `start_audio_in_streaming` only after the `client-ready` message.
2025-02-13 18:08:55 -03:00
Filipi Fuchter
f001819df8
- Added a new audio_in_stream_on_start field to TransportParams.
...
- Added a new method `start_audio_in_streaming` in the `BaseInputTransport`.
- Updated `DailyTransport` to respect the `audio_in_stream_on_start` field, ensuring it only starts receiving the audio input if it is enabled.
2025-02-13 18:08:36 -03:00
Filipi Fuchter
dceec60186
Updated FastAPIWebsocketOutputTransport to send TransportMessageFrame and TransportMessageUrgentFrame to the serializer.
2025-02-13 18:07:33 -03:00
Filipi Fuchter
b96979a4ed
Update WebsocketServer to not wrap the message inside a text frame.
2025-02-13 18:07:04 -03:00
Mark Backman
745c40def4
Merge pull request #1214 from pipecat-ai/mb/stt-mute-tests
...
Improve STTMuteFilter, add tests
2025-02-13 09:50:43 -05:00
Mark Backman
42ab62716d
Merge pull request #1198 from pipecat-ai/mb/more-whisper-params
...
Add prompt and temperature args to OpenAI and Groq hosted Whisper STT…
2025-02-13 09:16:38 -05:00
Mark Backman
16ba2010aa
Refactor process_frame to be more consistent
2025-02-13 09:15:29 -05:00
Mark Backman
ec0ca46617
Fix temperature docstrings to reference optional
2025-02-13 09:04:20 -05:00
Mark Backman
6ff1f526ff
Merge pull request #1216 from pipecat-ai/mb/google-cloud-speech
...
Add the google-cloud-speech package to the google dependency
2025-02-13 07:04:34 -05:00
Mark Backman
84143cc80c
self._muted now returns from STT process_audio_frames
2025-02-13 07:00:44 -05:00
Mark Backman
229dccedc6
Add the google-cloud-speech package to the google dependency
2025-02-12 23:19:17 -05:00
Aleix Conchillo Flaqué
68aaa1f8f4
Merge pull request #1213 from pipecat-ai/aleix/base-transport-output-bot-vad-stop-secs
...
BaseOutputTransport: use specific VAD stop secs for the bot
2025-02-12 19:01:56 -08:00
Aleix Conchillo Flaqué
f110a45c85
BaseOutputTransport: use specific VAD stop secs for the bot
2025-02-12 19:01:39 -08:00
Mark Backman
1e8a86de63
Handle starting muted, add tests
2025-02-12 19:01:49 -05:00
Mark Backman
ee93e2a2b1
Reorder frame pushing for STTMuteFilter, update STTMuteFrame to SystemFrame
2025-02-12 15:51:18 -05:00
Mark Backman
2e87a019a8
Merge pull request #1208 from pipecat-ai/mb/stt-mute-first-bot-speech
...
Add new STTMuteStrategy: MUTE_UNTIL_FIRST_BOT_COMPLETE
2025-02-12 12:21:02 -05:00
Vaibhav159
687b3d9d4c
fixing google llm service error
2025-02-12 22:22:04 +05:30
Mark Backman
397768d872
Add new STTMuteStrategy: MUTE_UNTIL_FIRST_BOT_COMPLETE
2025-02-12 10:59:28 -05:00
Mark Backman
24cdcd74e6
Merge pull request #1197 from pipecat-ai/mb/google-stt
...
Add GoogleSTTService
2025-02-12 10:16:18 -05:00
Mark Backman
5d6370690c
Add _reconnect_if_needed to simplify reconnect logic
2025-02-12 10:11:18 -05:00
Mark Backman
9f728aa623
Add reconnect logic to handle Google's 5 min time limit
2025-02-12 10:11:18 -05:00
Mark Backman
32d8f6153f
Update InputParams to languages: support str or List of Languages
2025-02-12 10:11:18 -05:00
Mark Backman
8c2071f248
Add ClientOptions for region selection
2025-02-12 10:11:18 -05:00
Mark Backman
a9c2197dc6
Add ability to update options
2025-02-12 10:11:18 -05:00
Mark Backman
ce0358804b
Docstrings and cleanup
2025-02-12 10:11:18 -05:00
Mark Backman
66a6a6a295
Enable interim transcriptions, add VAD events option
2025-02-12 10:11:18 -05:00
Mark Backman
9f1732c390
Update CHANGELOG and README
2025-02-12 10:11:17 -05:00
Mark Backman
b44ddf2456
07n uses all Google services
2025-02-12 10:09:36 -05:00
Mark Backman
17420f4d0c
Update language support
2025-02-12 10:09:36 -05:00
Mark Backman
6cb55ec2cb
Add GoogleSTTService
2025-02-12 10:09:36 -05:00
Filipi da Silva Fuchter
e2b4554a54
Merge pull request #1129 from pipecat-ai/instant_voice_demo
...
Pipecat improvements for the instant voice demo
2025-02-12 11:53:40 -03:00
Mark Backman
fd68b82e48
Merge pull request #1163 from pipecat-ai/mb/rime-websocket
...
Add RimeTTSService
2025-02-12 09:51:56 -05:00
Filipi Fuchter
cc90f5ab9f
Sending the RTVI messages to the websocket
2025-02-12 11:46:49 -03:00
Filipi Fuchter
08f40d9179
Adding support to DailyTransport receive raw-audio through appMessage
2025-02-12 11:46:37 -03:00
Aleix Conchillo Flaqué
80e1325621
include codecov.yml
2025-02-11 23:46:19 -08:00
Aleix Conchillo Flaqué
ed76a5bfa5
Merge pull request #1202 from pipecat-ai/aleix/fix-simli-audiolayout-error
...
simli: fix audio layout error
2025-02-11 22:24:22 -08:00
Mark Backman
69b0d9035f
Mark end_time as unused
2025-02-11 17:44:52 -05:00
Mark Backman
dcc63dd648
Use the vendor default for temperature
2025-02-11 14:29:33 -05:00
Aleix Conchillo Flaqué
2d08f42870
Merge pull request #1204 from pipecat-ai/aleix/add-coverage-support
...
github: add coverage support
2025-02-11 11:09:25 -08:00
Mark Backman
0814c0bc82
Merge pull request #1203 from pipecat-ai/expose-update-remote-participants-on-daily-transport
...
Expose `update_remote_participants()` from `DailyTransport`
2025-02-11 13:57:08 -05:00
Paul Kompfner
28e233b195
Update CHANGELOG to reflect the addition of update_remote_participants()
2025-02-11 13:23:47 -05:00
Aleix Conchillo Flaqué
6e4d2d6ade
examples: fix more dependabot warnings
2025-02-11 10:09:33 -08:00
Aleix Conchillo Flaqué
266135ec54
examples: fix dependabot warnings
2025-02-11 10:07:05 -08:00
Aleix Conchillo Flaqué
d81aa48262
test-requirements: update transformers to 4.48.0
2025-02-11 10:04:21 -08:00
Aleix Conchillo Flaqué
8c7752fbc2
github: add coverage support
2025-02-11 09:58:21 -08:00
Julien Le Bourg
77fb63372a
fix: incorrectly changed the base type in my last pull request for L… ( #1184 )
...
* fix: incorrectly changed the base type in my last pull request for LocalAudioTransport
* update examples to use the new LocalTransportParams
* add local device select example
2025-02-11 08:35:57 -08:00
Paul Kompfner
5a8279d3c2
Expose update_remote_participants() from DailyTransport
2025-02-11 11:28:03 -05:00
Aleix Conchillo Flaqué
4db620198a
simli: fix audio layout error
...
Fixes #1201
2025-02-11 07:05:35 -08:00
Mark Backman
d35f4c6b99
Add prompt and temperature args to OpenAI and Groq hosted Whisper STT services
2025-02-10 21:06:37 -05:00
Aleix Conchillo Flaqué
0a990b2aaa
Merge pull request #1196 from pipecat-ai/aleix/audio-buffer-processor-continuous-intermittent-stream
...
AudioBufferProcessor: handle continuous and intermittent user audio
2025-02-10 16:07:12 -08:00
Mark Backman
97586b132d
Simplify _calculate_word_times
2025-02-10 18:45:49 -05:00
Mark Backman
8020db350e
Update RimeHttpTTSService to use mistv2 model by default
2025-02-10 18:45:48 -05:00
Mark Backman
54f64b8dad
Code review feedback
2025-02-10 18:45:08 -05:00
Mark Backman
8f8a3ae7f9
Add RimeTTSService
2025-02-10 18:45:06 -05:00
Mark Backman
344aff5681
Merge pull request #1191 from pipecat-ai/mb/azure-tts-error-handling
...
Improve AzureTTSService error handling
2025-02-10 18:01:39 -05:00
Mark Backman
0d2e90cff1
Merge pull request #1190 from pipecat-ai/mb/languages-hosted-whisper
...
Add language support to OpenAI and Groq hosted Whisper
2025-02-10 17:49:38 -05:00
Mark Backman
1a8dd6b713
Improve AzureTTSService error handling
2025-02-10 17:48:55 -05:00
Mark Backman
2dc585aee0
Merge pull request #1185 from pipecat-ai/mb/update-readme-hacking
...
Add missing pip install -e . step to the README, and clarify steps
2025-02-10 17:45:58 -05:00
Mark Backman
a64fa44811
Merge pull request #1186 from pipecat-ai/mb/whisper-multilingual
...
Add language support to WhisperSTTService
2025-02-10 17:26:10 -05:00
Aleix Conchillo Flaqué
baeb83484d
Merge pull request #1194 from Vaibhav159/vl_fix_elevenlabs_disconnect_issue
...
fixing disconnect issue
2025-02-10 13:41:59 -08:00
Vaibhav159
b0c3f80963
resolve merge conf
2025-02-11 03:03:32 +05:30
Aleix Conchillo Flaqué
eb3c9b1e75
AudioBufferProcessor: handle continuous and intermittent user audio
...
Fixes #1172
2025-02-10 11:26:31 -08:00
Mark Backman
ad4cbdb1ec
Merge pull request #1159 from Canonical-AI-Inc/gemini-rag
...
Gemini 2.0 Flash Lite RAG example
2025-02-10 13:42:11 -05:00
Aleix Conchillo Flaqué
32baee924b
RTVI: fix premature bot-tts-text messages ( #1193 )
2025-02-10 10:37:54 -08:00
Adrian Cowham
9cc53509d1
PR feedback: renamed file, added docstring, changed file read logic
2025-02-10 09:39:01 -08:00
Vaibhav159
2c62d3bf32
break once ConnectionClosed error
2025-02-10 23:04:05 +05:30
Vaibhav159
b06b16adb7
fixing disconnect issue
2025-02-10 22:55:20 +05:30
Mark Backman
cd52d73027
Add language support to OpenAI and Groq hosted Whisper
2025-02-10 10:18:00 -05:00
Mark Backman
c9d8c572c7
Add language support to WhisperSTTService
2025-02-09 10:51:23 -05:00
Mark Backman
d9439fd398
Add missing pip install -e . step to the README, and clarify steps
2025-02-09 09:15:10 -05:00
Mark Backman
081abcedb3
Merge pull request #1176 from pipecat-ai/mb/stt-mute-deprecate-stt-service
...
Deprecate stt_service parameter in STTMuteFilter
2025-02-09 08:35:22 -05:00
Mark Backman
1455e24ad1
Add keyword args, collocated warnings import with the deprecation
2025-02-09 08:29:20 -05:00
Mark Backman
4613cf4790
Merge pull request #1181 from pipecat-ai/mb/daily-docstrings
...
Add docstrings to daily.py
2025-02-09 08:05:59 -05:00
Mark Backman
7aa2e1209d
Merge pull request #1177 from pipecat-ai/mb/perplexity
...
Add PerplexityLLMService
2025-02-09 08:05:46 -05:00
Mark Backman
76daaab6ca
Add PerplexityLLMService
2025-02-09 08:00:31 -05:00
Mark Backman
37cfe870cc
Merge pull request #1183 from pipecat-ai/mb/add-groq-stt
...
Add GroqSTTService, BaseWhisperSTTService, and refactor OpenAISTTService
2025-02-09 07:56:35 -05:00
Mark Backman
160167758b
Add docstrings to daily.py
2025-02-09 07:53:51 -05:00
Mark Backman
4b634713a5
Merge pull request #1182 from pipecat-ai/mb/28c-optional-db
...
Update 28c option to output to log line only by default
2025-02-09 07:52:21 -05:00
Mark Backman
72954d5f15
Remove to base_whisper.py
2025-02-09 07:51:30 -05:00
Mark Backman
f2b07271c1
Update GroqLLMService to use llama-3.3-70b-versatile as the default model
2025-02-09 07:51:30 -05:00
Mark Backman
32b9de5f51
Add GroqSTTService, BaseWhisperSTTService, and refactor OpenAISTTService
2025-02-09 07:51:28 -05:00
Mark Backman
71ce8f9bcf
Merge pull request #1179 from pipecat-ai/mb/remove-command-dash-badge
...
Remove CommandDash badge from README
2025-02-09 07:47:32 -05:00
Mark Backman
7d05728e2f
Update 28c option to output to log line only by default
2025-02-08 10:00:45 -05:00
Mark Backman
dee5448b57
Merge pull request #1123 from pipecat-ai/cb/sqlite
...
Add SQLite storage to the Gemini persistent storage example
2025-02-08 09:07:52 -05:00
Mark Backman
d67861925a
Merge pull request #1128 from golbin/whisper-api
...
Add Whisper STT service using OpenAI API
2025-02-08 08:35:26 -05:00
Mark Backman
0180619d44
Merge pull request #1173 from TheCodingLand/local-pyaudio-device-ids
...
adds configurable device ids for local audio transport
2025-02-08 08:04:00 -05:00
Mark Backman
f07e498612
Remove CommandDash badge from README
2025-02-08 07:59:39 -05:00
TheCodingLand
57964cb929
fix LocalAudioTransport param type
2025-02-08 12:32:20 +01:00
TheCodingLand
6840c77684
apply ruff formatting
2025-02-08 12:03:23 +01:00
Mark Backman
a1b58115ce
Deprecate stt_service parameter in STTMuteFilter
2025-02-07 19:24:03 -05:00
chadbailey59
23eb6e3d46
storybot fixes ( #1175 )
...
* storybot fixes
* readme cleanup
2025-02-07 13:58:02 -06:00
Mark Backman
74a2c38c6c
Merge pull request #1174 from pipecat-ai/mb/bump-google-genai-version
...
Bump google-genai version to 1.0.0
2025-02-07 14:53:44 -05:00
Mark Backman
90b217fda8
Bump google-genai version to 1.0.0
2025-02-07 14:32:37 -05:00
Aleix Conchillo Flaqué
6855bc0ada
Merge pull request #1166 from pipecat-ai/aleix/google-rtvi-observer
...
rtvi: separate specific google RTVI into a GoogleRTVIObserver
2025-02-08 03:19:02 +08:00
TheCodingLand
a359434307
remove Doc and Annotated imports
2025-02-07 19:42:34 +01:00
TheCodingLand
856c8959c3
enhance doc
2025-02-07 19:38:26 +01:00
TheCodingLand
8da7a42137
adds configurable input and output device ids for local audio
2025-02-07 19:23:18 +01:00
Aleix Conchillo Flaqué
510a0f5ef5
rtvi: deprecate RTVI.observer()
2025-02-07 09:19:43 -08:00
Aleix Conchillo Flaqué
03ac744bcf
rtvi: deprecate frame processors
2025-02-07 09:17:29 -08:00
Aleix Conchillo Flaqué
b058461a7d
GoogleRTVIObserver: add explicit constructor
2025-02-07 09:15:32 -08:00
Mark Backman
abd9f16b90
Export .rtvi, update new-chatbot example, rename and update foundational 32
2025-02-07 09:15:32 -08:00
Aleix Conchillo Flaqué
d07732f2e8
rtvi: separate specific google RTVI into a GoogleRTVIObserver
2025-02-07 09:15:32 -08:00
Aleix Conchillo Flaqué
4d25582e16
dev-requirements: update pyright and ruff
2025-02-06 21:51:57 -08:00
Aleix Conchillo Flaqué
d4b2160f9c
Merge pull request #1161 from pipecat-ai/aleix/prepare-0.0.56
...
update CHANGELOG for 0.0.56
2025-02-06 13:50:04 -08:00
Aleix Conchillo Flaqué
dd7926aab5
update CHANGELOG for 0.0.56
2025-02-06 13:45:13 -08:00
Aleix Conchillo Flaqué
070bf66980
transports: fix local transports audio cleanup
2025-02-06 13:45:13 -08:00
Aleix Conchillo Flaqué
962fc27dbd
Merge pull request #1160 from pipecat-ai/aleix/fix-unit-test-logging
...
tests: remove logger from tests.utils
2025-02-06 13:26:37 -08:00
Mark Backman
3d4d6132fc
Merge pull request #1158 from pipecat-ai/mb/update-22c
...
Update foundation examples 22b, 22c, and 22d to be ready for function…
2025-02-06 16:25:05 -05:00
Aleix Conchillo Flaqué
a96d9294b7
tests: remove logger from tests.utils
2025-02-06 13:18:28 -08:00
Aleix Conchillo Flaqué
a6e78550d5
Merge pull request #1156 from pipecat-ai/aleix/prefer-optional
...
prefer Optional over to "| None"
2025-02-06 13:08:48 -08:00
Adrian Cowham
d9f6b7b93c
added an example using using Gemini's large context window for RAG
2025-02-06 12:49:29 -08:00
Mark Backman
969de92ad9
Update foundation examples 22b, 22c, and 22d to be ready for function calling
2025-02-06 15:36:16 -05:00
Aleix Conchillo Flaqué
c4dbe92b30
prefer Optional over to "| None"
2025-02-06 11:11:37 -08:00
Aleix Conchillo Flaqué
684764fece
Merge pull request #1155 from pipecat-ai/aleix/sentry-fixes-and-example
...
sentry fixes and example
2025-02-06 11:09:31 -08:00
Aleix Conchillo Flaqué
c4be07693f
examples: added sentry-metrics example
2025-02-06 10:46:04 -08:00
Aleix Conchillo Flaqué
c5d5ca8232
SentryMetrics: use transactions and call parent methods
2025-02-06 10:44:38 -08:00
Mark Backman
428e763814
Merge pull request #1149 from pipecat-ai/mb/update-google-default-llm-model
...
Use gemini-2.0-flash-001 as the default model for GoogleLLMService
2025-02-06 12:41:13 -05:00
Mark Backman
0efa2711ff
Merge pull request #1152 from pipecat-ai/mb/docstrings
...
Add docstrings for PipelineTask and related classes/functions
2025-02-06 12:30:12 -05:00
Mark Backman
4904f52cee
Use gemini-2.0-flash-001 as the default model for GoogleLLMService
2025-02-06 12:29:15 -05:00
Aleix Conchillo Flaqué
dbcf14ddb4
Merge pull request #1154 from pipecat-ai/aleix/twilio-telnyx-sample-rates
...
serializers: don't update twilio/telnyx sample rates
2025-02-06 09:27:42 -08:00
Aleix Conchillo Flaqué
7c13ec10d9
examples: cleanup ElevenLabsTTSService constructor arguments
2025-02-06 09:25:52 -08:00
Aleix Conchillo Flaqué
29b9dccc53
serializers: don't update twilio/telnyx sample rates
2025-02-06 09:25:52 -08:00
Aleix Conchillo Flaqué
e8ce826473
Merge pull request #1151 from pipecat-ai/aleix/base-output-transport-resample
...
BaseOutputTransport: resample incoming audio if needed
2025-02-06 09:25:07 -08:00
Aleix Conchillo Flaqué
bbb991dfd8
Merge pull request #1153 from pipecat-ai/aleix/base-input-transport-show-vad
...
BaseInputTransport: show VAD results when interruptions not allowed
2025-02-06 09:24:12 -08:00
Mark Backman
4432e7e4f7
Add docstrings for PipelineTask and related classes/functions
2025-02-06 11:04:54 -05:00
Aleix Conchillo Flaqué
ee9cce64b2
BaseInputTransport: show VAD results when interruptions not allowed
2025-02-06 07:40:03 -08:00
Aleix Conchillo Flaqué
1ae4f0150d
BaseOutputTransport: resample incoming audio if needed
2025-02-06 07:37:43 -08:00
Mark Backman
4c77c3ed34
Merge pull request #1148 from pipecat-ai/mb/fix-twilio-serializer
...
Fix sample rate handling in Twilio and Telnyx serializers
2025-02-06 10:25:13 -05:00
Aleix Conchillo Flaqué
975b97472a
Merge pull request #1144 from pipecat-ai/aleix/frame-processor-missing-init-warning
...
FrameProcessor: add an error about missing super().process_frame(...)
2025-02-06 07:18:35 -08:00
Mark Backman
c8ccf13bc7
fix: Use audio_in_sample_rate to deserialize data for TelnyxFrameSerializer
2025-02-06 09:59:21 -05:00
Mark Backman
ba59736f87
fix: Use audio_in_sample_rate to deserialize data for TwilioFrameSerializer
2025-02-06 09:55:15 -05:00
Jin Kim
5989e1ed16
Merge branch 'main' into whisper-api
2025-02-06 13:14:36 +09:00
Aleix Conchillo Flaqué
bc21a0b817
FrameProcessor: add an error about missing super().process_frame(...)
2025-02-05 18:33:03 -08:00
Aleix Conchillo Flaqué
99d3227ff5
Merge pull request #1126 from pipecat-ai/aleix/prepare-0.0.55
...
update CHANGELOG for 0.0.55
2025-02-05 11:32:39 -08:00
Aleix Conchillo Flaqué
7730f59635
update CHANGELOG for 0.0.55
2025-02-05 11:30:40 -08:00
Aleix Conchillo Flaqué
ba31546c32
Merge pull request #1139 from pipecat-ai/aleix/task-start-metadata
...
pipeline task start metadata and unit test improvements
2025-02-05 10:51:51 -08:00
Aleix Conchillo Flaqué
a363d12d1f
dev-requirements: fix conflicts because of nvidia-riva-client
2025-02-05 10:34:46 -08:00
Aleix Conchillo Flaqué
feab9c8fa2
tests: run_test() now uses PipelineTask
2025-02-05 10:34:38 -08:00
Aleix Conchillo Flaqué
61f6669926
task: allow passing StartFrame metadata via start_metadata param
2025-02-05 10:34:38 -08:00
Aleix Conchillo Flaqué
3be69908d2
Merge pull request #1131 from pipecat-ai/aleix/global-audio-sample-rates
...
introduce PipelineParams audio input/output sample rates
2025-02-05 08:11:25 -08:00
Aleix Conchillo Flaqué
fcb80ec330
playht: don't set sample_rate in _settings
2025-02-05 07:46:24 -08:00
Mark Backman
c9f5684e2f
OpenAITTSService: Add warning about changing sample_rate
2025-02-05 10:13:46 -05:00
Mark Backman
c257fa1573
AzureTTSService, AzureHttpTTSService: add start() method
2025-02-05 10:05:19 -05:00
Mark Backman
97c55da29f
PlayHTHttpTTSService: add start() method to set sample_rate
2025-02-05 09:54:41 -05:00
Aleix Conchillo Flaqué
49426aa9a1
transport(websocket): improve exception logging
2025-02-04 23:50:45 -08:00
Aleix Conchillo Flaqué
0a333c26da
services(elevenlabs): warn if sample rate not supported
2025-02-04 23:50:21 -08:00
Aleix Conchillo Flaqué
75a29424ff
examples(telnyx-chatbot): use cartesia so we can use 8khz
2025-02-04 23:49:50 -08:00
Filipi da Silva Fuchter
cd1b429308
Merge pull request #1133 from pipecat-ai/fixing_krisp_issue
...
Fixing the issue in Krisp when trying to create more than one
2025-02-04 20:44:29 -03:00
Filipi Fuchter
7f1ae4b8cc
Fixing the issue in Krisp when trying to create more than one filter in the same process.
2025-02-04 20:10:56 -03:00
Aleix Conchillo Flaqué
af9fd811cd
examples(moondream-chatbot): fix UserImageRequester
2025-02-04 14:37:53 -08:00
Aleix Conchillo Flaqué
69f5c9b9d3
update anthropic and openpipe versions
2025-02-04 14:37:36 -08:00
Aleix Conchillo Flaqué
ab45e481be
introduce PipelineParams audio input/output sample rates
2025-02-04 14:12:56 -08:00
Pedro Moreira
79ac696973
Add support for Piper TTS
2025-02-04 13:51:33 -03:00
Jin Kim
ef1e4277d3
Add an example for Whisper using OpenAI API
2025-02-04 10:32:55 +09:00
Jin Kim
823b763b25
Change OpenAI example file name
2025-02-04 10:28:06 +09:00
Jin Kim
3cb189eb1f
Add whisper STT service using OpenAI API
2025-02-04 10:27:28 +09:00
Aleix Conchillo Flaqué
cc54255c41
Merge pull request #1125 from pipecat-ai/aleix/twilio-chatbot-improvements
2025-02-03 11:10:33 -08:00
Aleix Conchillo Flaqué
1cdb66f889
examples(twilio-chatbot): create sample rate variable
2025-02-03 10:58:06 -08:00
Aleix Conchillo Flaqué
51a86a509c
examples: multiple twilio-chatbot improvements
2025-02-03 10:36:24 -08:00
Aleix Conchillo Flaqué
824898f7b7
Merge pull request #1121 from pipecat-ai/aleix/audio-resamplers
...
introduce audio resamplers
2025-02-03 10:32:55 -08:00
Aleix Conchillo Flaqué
57dadb6359
audio(utils): some variable renames
2025-02-03 09:33:04 -08:00
Aleix Conchillo Flaqué
5dcdc68ef5
examples: fix 22 series initial gate state
2025-02-03 09:16:58 -08:00
Aleix Conchillo Flaqué
aafb2db620
GatedOpenAILLMContextAggregator: use keyword argument and add start_open
2025-02-03 09:16:44 -08:00
Aleix Conchillo Flaqué
f3f22cf61c
AudioBufferProcessor: add start_recording()/stop_recording()
2025-02-01 11:06:58 -08:00
Aleix Conchillo Flaqué
371c2f3704
canonical: do not reset audio buffers
2025-02-01 11:06:58 -08:00
Aleix Conchillo Flaqué
1f14f62696
AudioBufferProcessor: fix audio buffer silence computation
2025-02-01 11:06:58 -08:00
Aleix Conchillo Flaqué
06449eff2c
BaseAudioResampler: make resample() async
2025-02-01 11:06:58 -08:00
Aleix Conchillo Flaqué
dcfb86583d
serializers: serialize()/deserialize() are now async
2025-02-01 11:06:58 -08:00
Aleix Conchillo Flaqué
cda34a1320
AudioBufferProcessor: fix user/bot audio buffers silence padding
2025-02-01 11:06:58 -08:00
Aleix Conchillo Flaqué
13611fd8e1
AudioBufferProcessor: call callback on CancelFrame
2025-02-01 11:06:58 -08:00
Aleix Conchillo Flaqué
fc89aad469
introduce audio resamplers
2025-02-01 11:06:55 -08:00
Aleix Conchillo Flaqué
6c7474e1a2
frames: add pass to DTMFFrames
2025-01-31 18:37:40 -08:00
Aleix Conchillo Flaqué
95f0dbf3f3
CHANGELOG.md: task.cancel() and EndFrame clarification
2025-01-31 18:35:35 -08:00
Aleix Conchillo Flaqué
11aeb68ddb
frames: fix type s/OuputDTMFFrame/OutputDTMFFrame/
2025-01-31 18:28:38 -08:00
Aleix Conchillo Flaqué
a43c102fc8
Merge pull request #1064 from jcbjoe/jg/additional_dtmf_frames
...
Added: Additional DTMF frames
2025-01-31 18:25:08 -08:00
Chad Bailey
d236973c0f
moved sqlite code back to a single example
2025-01-31 23:18:06 +00:00
Mark Backman
16b49bdce6
Merge pull request #1122 from pipecat-ai/mb/openai-org-id
...
Add organization and project level auth in OpenAILLMService
2025-01-31 14:35:26 -05:00
Mark Backman
41477c8f78
Add organization and project level auth in OpenAILLMService
2025-01-31 14:27:25 -05:00
Aleix Conchillo Flaqué
bb9a2560c3
Merge pull request #1118 from pipecat-ai/aleix/task-manager
...
introduce TaskManager
2025-01-31 10:24:52 -08:00
Aleix Conchillo Flaqué
002699f16c
rtvi: delay creating tasks until we get StartFrame
2025-01-31 10:06:11 -08:00
chadbailey59
a17243bc1e
More Storybot updates ( #1116 )
...
* initial changes for gemini storybot
* storybot updates for gemini
* more storybot updates
* interim interruptible commit
* cleanup
* cleanup
* cleanup
* first draft
* wip
* more storybot fixes
* more storybot updates WIP
* committing before changing the image prompting strategy
* wip
* prompt updating
* cleanup
* cleanup
* cleanup
* readme cleanup
* fixup
2025-01-30 20:13:18 -06:00
Aleix Conchillo Flaqué
d95819746a
tests: make sure QueuedFrameProcessor push frames
2025-01-30 13:48:44 -08:00
Aleix Conchillo Flaqué
b65f32e8e1
task: start TaskObserver when tasks can be created
...
We have to start proxy observer tasks once we know the TaskManager has an event
loop.
2025-01-30 13:46:56 -08:00
Aleix Conchillo Flaqué
0131d0a531
examples: make sure unhandled frames are always pushed
2025-01-30 13:15:49 -08:00
Aleix Conchillo Flaqué
642affb2fe
add missing super().process_frame() calls
2025-01-30 13:15:17 -08:00
Aleix Conchillo Flaqué
a145005498
SyncParallelPipeline: cleanup source/sink processors
2025-01-30 13:13:02 -08:00
Aleix Conchillo Flaqué
241f241ed9
SyncParallelPipeline: don't add source/sink processors inside pipeline
2025-01-30 13:12:37 -08:00
Aleix Conchillo Flaqué
85e572e2d8
gladia: cleanup receive messages task
2025-01-30 13:10:47 -08:00
Aleix Conchillo Flaqué
10716e8ec1
utils: protect obj_id() and obj_count() with a lock
2025-01-30 13:10:36 -08:00
Aleix Conchillo Flaqué
41d60a14cc
introduce TaskManager and PipelineRunner event loop
2025-01-30 13:10:36 -08:00
Aleix Conchillo Flaqué
e69c065a86
update CHANGELOG and fix formatting
2025-01-30 08:55:29 -08:00
Aleix Conchillo Flaqué
f90c17ab30
Merge pull request #1083 from team-telnyx/creating_telnyx_chatbot
...
Creating telnyx chatbot
2025-01-30 08:49:20 -08:00
Aleix Conchillo Flaqué
bc4fdd587a
Merge pull request #1103 from pipecat-ai/aleix/tts-service-push-silence-before-tts-stop-frame
...
services(tts): allow pushing silence audio before TTSStoppedFrame
2025-01-30 08:48:41 -08:00
Aleix Conchillo Flaqué
665a6017f9
services(tts): allow pushing silence audio before TTSStoppedFrame
2025-01-30 08:46:56 -08:00
Aleix Conchillo Flaqué
4119d7a115
Merge pull request #1104 from pipecat-ai/aleix/twilio-transport-message-frames
...
serializers(twilio): handle transport message frames
2025-01-30 08:45:55 -08:00
Aleix Conchillo Flaqué
2634b03ffa
serializers(twilio): handle transport message frames
2025-01-30 08:30:09 -08:00
Aleix Conchillo Flaqué
6a50759b9f
Merge pull request #1105 from pipecat-ai/aleix/websocket-client
...
added new websocket client transport
2025-01-30 08:28:26 -08:00
Mark Backman
7982faba67
Merge pull request #1115 from pipecat-ai/mb/elevenlabs-language-fixes
...
Improve ElevenLabs language checking logic
2025-01-30 10:03:22 -05:00
Mark Backman
2b4bf57c04
Improve ElevenLabs language checking logic
2025-01-30 09:52:36 -05:00
Filipi Fuchter
7e3e126730
Migrating the base API URL for the react native example to an .env file.
2025-01-30 10:42:16 -03:00
Filipi Fuchter
75ca0571bb
Improving the layout from the bot ready react native demo.
2025-01-30 10:31:04 -03:00
Filipi Fuchter
a48e5d0714
Only sending the message when it is a remote audio track.
2025-01-30 10:14:37 -03:00
Filipi Fuchter
2b6a992207
Sending the app-message to start playing audio once the track has started.
2025-01-30 09:37:33 -03:00
Filipi Fuchter
24cf106ed2
Refactoring the code to ask for the room that it should connect.
2025-01-30 09:14:18 -03:00
Rafal Skorski
b93e4ab9cb
Formatting adjusted and the encoding selection moved from TelnyFrameSerilaizer to websocket_endpoint function in server.py
2025-01-30 12:52:30 +01:00
Dominic Stewart
c140c04b9a
Merge pull request #1080 from DominicStewart/dom/voicemail-detection-bot
...
Add voicemail detection example
2025-01-30 09:20:12 +09:00
Dominic
a7c8d2af8e
Removed extra space too
2025-01-30 09:18:29 +09:00
Dominic
f3f520a76a
Removed formatting that vs code automatically adds to readme file
2025-01-30 09:17:27 +09:00
Mark Backman
5e0f42a3e0
Merge pull request #1111 from pipecat-ai/mb/gemini-restructure-messages
...
GoogleLLMContext: Allow _restructure_from_openai_messages to handle c…
2025-01-29 19:06:47 -05:00
Filipi Fuchter
95c8346cb5
Starting to create a react native client for the bot ready example.
2025-01-29 19:00:42 -03:00
Mark Backman
220ce9fd0f
GoogleLLMContext: Allow _restructure_from_openai_messages to handle context frames that contain function call data and / or messages
2025-01-29 16:01:39 -05:00
Filipi da Silva Fuchter
5d0486a26f
Merge pull request #1008 from pipecat-ai/cutting_initial_words
...
Avoid cutting off the beginning of the audio
2025-01-29 17:02:40 -03:00
Chad Bailey
bc98c2e36c
added sqlite storage example
2025-01-29 19:12:15 +00:00
Aleix Conchillo Flaqué
091258f617
improve create_task names
2025-01-29 11:11:40 -08:00
Aleix Conchillo Flaqué
2a1408eb2a
transports(websocket server): remove unused variable
2025-01-29 11:11:40 -08:00
Aleix Conchillo Flaqué
6393b41d58
transports(websocket): added WebsocketClientTransport
2025-01-29 11:11:37 -08:00
Filipi Fuchter
2a5728264c
Adding missing dependency to openai
2025-01-29 15:52:42 -03:00
Filipi Fuchter
2ef0735462
Adding readme to teach how to use.
2025-01-29 15:45:48 -03:00
Filipi Fuchter
80bbfff4be
Merge branch 'main' into cutting_initial_words
2025-01-29 15:36:52 -03:00
Aleix Conchillo Flaqué
4ff68e66b9
Merge pull request #1110 from pipecat-ai/aleix/frame-metadata
...
frames: added metadata field to Frame class
2025-01-29 10:30:59 -08:00
Aleix Conchillo Flaqué
3a688840fc
frames: added metadata field to Frame class
2025-01-29 09:53:21 -08:00
Aleix Conchillo Flaqué
2ca8b95bbf
Merge pull request #1106 from Vaibhav159/vl_moving_test_utils_to_pipecat_package
...
moving test utils inside of package
2025-01-29 09:44:34 -08:00
Mark Backman
2aafc6bd1d
Merge pull request #1107 from AngeloGiacco/angelo/increase-ws-connection
...
fix: elevenlabs tts increase websocket max message size limit to 16MB
2025-01-29 10:04:42 -05:00
Angelo Giacco
0ff9ef8707
fix: add changelog
2025-01-29 14:27:39 +00:00
Angelo Giacco
596cae994d
fix: elevenlabs tts increase websocket max message size limit to 16MB
2025-01-29 13:55:27 +00:00
Dominic
9ad9cb1ff8
Cleaned up formatting
2025-01-29 17:36:08 +09:00
Dominic Stewart
60e800e9ba
Merge branch 'main' into dom/voicemail-detection-bot
2025-01-29 17:30:56 +09:00
Dominic
1c8f0ed7da
Finalised code and added a bit about this example to the README
2025-01-29 17:27:44 +09:00
Vaibhav159
8407a86532
moving test utils inside of package
2025-01-29 12:46:43 +05:30
Dominic
417d661d28
Updated bot_runner and bot_daily with adjustments necessary to run voicemail detection from bot_daily code
2025-01-29 16:11:45 +09:00
Aleix Conchillo Flaqué
8cd23c42fc
Merge pull request #1100 from pipecat-ai/aleix/use-task-cancel-on-left-disconnected
...
use `task.cancel()` when participant leaves/disconnects
2025-01-28 16:02:02 -08:00
Aleix Conchillo Flaqué
0547a15695
task: allow queuing a CancelFrame to cancel the task
2025-01-28 15:59:36 -08:00
Aleix Conchillo Flaqué
3fe2124314
examples: use task.cancel() when participant leaves or disconnects
2025-01-28 15:46:20 -08:00
Aleix Conchillo Flaqué
ba358a4f0a
task: cleanup processors after task finishes running
2025-01-28 15:02:25 -08:00
Aleix Conchillo Flaqué
79ef8c947d
Merge pull request #1099 from pipecat-ai/aleix/daily-transport-queue-events
...
transports(daily): queue events until join completes
2025-01-28 14:38:25 -08:00
Aleix Conchillo Flaqué
f024476b08
transports(daily): queue events until join completes
2025-01-28 11:22:42 -08:00
Dominic
73690a13d9
Moved voicemail detection to phone-chatbot and working on that now
2025-01-28 22:31:08 +09:00
Dominic
6ebf06a6fb
Removed start_terminate_call function as unnecessary
2025-01-28 10:39:10 +09:00
Dominic
2f4f779c91
Fixed a few things
2025-01-28 10:39:10 +09:00
Dominic
941ee6e5e8
Add voicemail detection example
2025-01-28 10:39:10 +09:00
Aleix Conchillo Flaqué
cd5075ed7a
Merge pull request #1097 from pipecat-ai/aleix/pipecat-0.0.57
...
prepare CHANGELOG for 0.0.54
2025-01-27 14:56:51 -08:00
Aleix Conchillo Flaqué
6f41a667c8
prepare CHANGELOG for 0.0.54
2025-01-27 14:48:56 -08:00
Aleix Conchillo Flaqué
0b222a7eae
Merge pull request #1085 from pipecat-ai/aleix/task-creation-and-cancellation
...
improve task creation and cancellation
2025-01-27 14:47:20 -08:00
Aleix Conchillo Flaqué
f09f4b8fc4
services(tavus): fix EndFrame and CancelFrame processing
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
cca241a2b7
examples(22c): fix cancel_task call
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
1489e44740
gemini(multimodal live): fix model audio queue variable
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
f55f78e70e
update CHANGELOG.md
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
10202dc529
transports(websockets): cancel or wait for tasks to finish
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
498805a34c
FrameProcessor: add wait_for_task()
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
509f143e1b
update CHANGELOG.md
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
737e4fa3bd
gemini(multimodal live): connect on StartFrame
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
8b5228a105
utils: move task functions to asyncio module
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
6cc01bc5b0
examples: update 14 series with TTSSpeakFrame
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
2a2928d96c
gemini: create transcribe tasks only once
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
a3a6adbd17
user_idle_processor: add missing parent cleanup()
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
bf5ced18b2
fix parallel pipelines cleanup
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
2eccd1b1e9
utils: update some logging levels
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
9374bed878
tests: langchain fixes
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
c03d0352b1
utils/tasks: added new documentation
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
af90b8b4fa
utils: add wait_for_task()
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
0a9daa2f56
task: avoid canceling tasks more than once
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
e48c0e52ef
transports(daily): avoid canceling task more than once
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
6bca8396d3
utils: error if we try to cancel the same task multiple times
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
c2d8a45a07
runner: warn about remaining dangling tasks
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
80a7f1b1e7
runner: improve signal handler task cancellation
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
aff6e24560
pipeline: fix pipeline cleanup
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
cb93f6b368
utils: store created tasks and add current_tasks()
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
ff0bcec33a
transports: improve task naming
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
5885fcc230
add id and name properties
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
57b186cde8
base_transport: add name and id fields
2025-01-27 14:42:23 -08:00
Aleix Conchillo Flaqué
d1a3f404a5
improve task creation and cancellation
...
If a FrameProcessor needs to create a task it should use
FrameProcessor.create_task() and FrameProcessor.cancel_task(). This gives
Pipecat more control over all the tasks that are created in Pipecat.
Both functions internally use the utils module: utils.create_task() and
utils.cancel_task() which should also be used outside of FrameProcessors. That
is, unless strictly necessary, we should avoid using asyncio.create_task().
2025-01-27 14:42:23 -08:00
chadbailey59
179ddbea7d
Add dialout to the Daily phone example ( #998 )
...
* added dialout to daily phone example
* cleanup
* cleanup
* pre-commit hook
* Fix typo
* More explicit README instructions
---------
Co-authored-by: Mark Backman <mark@daily.co >
2025-01-27 12:21:30 -06:00
Mark Backman
86c1e6a3bd
Merge pull request #1081 from pipecat-ai/mb/user-idle-add-retry
...
Added retry functionality and a new callback to the UserIdleProcessor
2025-01-27 10:30:45 -05:00
Mark Backman
9e9822f17d
Use inspect.signature to determine which callback to use
2025-01-27 10:24:58 -05:00
Mark Backman
5f9671e2ca
Added retry functionality and a new callback to the UserIdleProcessor
2025-01-27 10:24:57 -05:00
Mark Backman
aac8961ae5
Merge pull request #1078 from pipecat-ai/mb/improve-error-handling-truncate-audio
...
Add better error handling for OpenAIRealtimeBetaLLMService truncate errors
2025-01-27 08:54:39 -05:00
Mark Backman
3e6377346a
Merge pull request #1093 from pipecat-ai/mb/update-example-6a
2025-01-26 19:43:39 -05:00
Mark Backman
9d9a622b1a
Merge pull request #1094 from pipecat-ai/mb/readme-service-section
2025-01-26 19:43:12 -05:00
Mark Backman
3e9a6b6262
Merge pull request #1095 from pipecat-ai/mb/elevenlabs-lang-codes
2025-01-26 12:21:28 -05:00
Mark Backman
fb3097560f
Remove eleven_multilinguagal_v2 from language code list
2025-01-26 07:17:38 -05:00
Mark Backman
ff6368add0
Update README.md
...
Adding a section so that table can be linked to.
2025-01-25 16:12:53 -05:00
Mark Backman
89fd03d86f
Merge pull request #1090 from vengad-arrowhead/main
...
Adding hindi danda symbol as end of sentence marker
2025-01-25 09:36:19 -05:00
Mark Backman
0672530d6b
Fix foundational example 6a to switch images when the bot is speaking
2025-01-25 08:40:42 -05:00
vengadanathan srinivasan
7a0cfc8d3d
Adding hindi danda symbol as end of sentence marker
2025-01-25 14:55:51 +05:30
Mark Backman
b881dd57b3
Merge pull request #1086 from pipecat-ai/mb/fix-expiry-time-type-mismatch
2025-01-24 17:31:08 -05:00
Mark Backman
abf0d0d053
Improve token parameter construction using DailyMeetingTokenProperties
2025-01-24 17:22:31 -05:00
Mark Backman
1acdf7aff7
Fix expiry_time type validation in get_token REST API helper
2025-01-24 17:21:50 -05:00
Mark Backman
96b90abda6
Merge pull request #1082 from pipecat-ai/mb/update-function-calling-examples
...
Update function calling examples to push a TextFrame in the start_cal…
2025-01-24 17:21:13 -05:00
Filipi da Silva Fuchter
202a844eeb
Merge pull request #1051 from pipecat-ai/gemini_grounding_metadata_rtvi
...
Sending Search Response to RTVI
2025-01-24 19:20:50 -03:00
Filipi Fuchter
655d56f634
Fixing pydantic validation when creating meeting token.
2025-01-24 19:15:56 -03:00
Filipi Fuchter
07c84b733b
Sending Search Response to RTVI
2025-01-24 18:59:46 -03:00
Filipi da Silva Fuchter
7c52736ff6
Merge pull request #1030 from pipecat-ai/gemini_grounding_metadata
...
Introduce support for extracting and processing grounding metadata from GoogleLLMService.
2025-01-24 15:41:54 -03:00
Mark Backman
48ce751602
Merge pull request #1075 from Vaibhav159/vl_add_daily_meeting_token_v2
...
adding models to DailyRestHelper
2025-01-24 13:21:52 -05:00
Vaibhav159
1f1e2dac2b
wrapping things up
2025-01-24 23:44:23 +05:30
Vaibhav159
71c2dc3d05
minor typing change
2025-01-24 23:38:44 +05:30
Vaibhav159
ef02ece662
doc string
2025-01-24 22:47:40 +05:30
Vaibhav159
d5818fad5b
addressing comments
2025-01-24 22:46:54 +05:30
Rafal Skorski
9c22bd8df1
Improving read me and encoding support
2025-01-24 16:44:11 +01:00
Mark Backman
dbea86baae
Update function calling examples to push a TextFrame in the start_callback
2025-01-24 10:21:08 -05:00
Vaibhav159
c5faac1cf8
adding RecordingsBucketConfig
2025-01-24 15:14:20 +05:30
Vaibhav159
e106d7a215
adding line space
2025-01-24 09:12:07 +05:30
Vaibhav159
40c1a8369a
updated changelog
2025-01-24 09:11:15 +05:30
Vaibhav159
6ab2404a98
adding more properties to daily room
2025-01-24 09:10:25 +05:30
Mark Backman
e61c996a2e
Merge pull request #1079 from ecdeng/patch-1
...
Update cartesia.py to use the new model pointer `sonic`
2025-01-23 22:15:30 -05:00
Eric Deng
2c81dc1f06
Update cartesia.py to use the new model pointer sonic instead of sonic-english
...
We are now using `sonic` as a pointer to the latest stable release (https://docs.cartesia.ai/build-with-sonic/models#continuous-updates ). sonic-english will forever point to `sonic-2024-10-19`, which is already out of date.
2025-01-23 15:47:07 -08:00
Mark Backman
53251dcb88
Add better error handling for OpenAIRealtimeBetaLLMService truncate errors
2025-01-23 14:25:08 -05:00
Mark Backman
d4e4b12109
Merge pull request #1071 from porcelaincode/patch-1
...
Update runner.py
2025-01-23 13:19:22 -05:00
Mark Backman
466d26a4f2
Merge pull request #1077 from Vaibhav159/vl_fix_missing_leftover_audio
...
adding missing audio buffer fix
2025-01-23 13:16:41 -05:00
Vaibhav159
ef511d580d
adding missing audio buffer fix
2025-01-23 23:17:49 +05:30
Vaibhav159
5957ddb038
adding missing audio buffer fix
2025-01-23 23:17:18 +05:30
Vaibhav159
799c2d14b8
adding meeting token v2 func
2025-01-23 21:40:42 +05:30
Rafal Skorski
8eef21db6e
Adding telnyx serializer
2025-01-23 15:39:46 +01:00
vatsal
dee1224530
Update runner.py
2025-01-23 13:21:49 +05:30
Mark Backman
fc6aa6eae8
Merge pull request #1060 from chhao01/patch-1
...
[bug]TypeError: object of type 'NoneType' has no len()
2025-01-22 19:14:35 -05:00
Mark Backman
ddd5bf70ab
Merge pull request #1061 from Allenmylath/patch-21
...
Update README.md
2025-01-22 19:13:15 -05:00
allenmylath
aa59744444
Update examples/README.md
...
Co-authored-by: Mark Backman <m.backman@gmail.com >
2025-01-23 05:38:37 +05:30
chadbailey59
067ddfe505
Storytelling chatbot updates ( #1066 )
...
* initial changes for gemini storybot
* storybot updates for gemini
* more storybot updates
* interim interruptible commit
* cleanup
* cleanup
* cleanup
* cleanup
2025-01-22 15:20:21 -06:00
Mark Backman
a64df978e7
Merge pull request #1046 from pipecat-ai/mb/transcript-tts
...
Modified `TranscriptProcessor` to use `TTSTextFrame`s
2025-01-22 15:46:01 -05:00
Mark Backman
7167719761
Emit a transcription callback when receiving a CancelFrame, update examples accordingly
2025-01-22 14:56:29 -05:00
Mark Backman
e1430be9f9
Code review fixes
2025-01-22 14:56:29 -05:00
Mark Backman
c2fe8e7fdb
Updated CHANGELOG
2025-01-22 14:56:28 -05:00
Mark Backman
31c77d8e35
Update examples for the updated TranscriptProcessor
2025-01-22 14:56:00 -05:00
Mark Backman
2a60d54830
Update the AssistantTranscriptProcessor to use TTSTextFrames in place of OpenAILLMContextFrames
2025-01-22 14:56:00 -05:00
Aleix Conchillo Flaqué
b3c99887dc
Merge pull request #1068 from Canonical-AI-Inc/import-fix
...
Fixing missing import
2025-01-22 11:37:49 -08:00
Mark Backman
38ad75cc17
Merge pull request #1065 from pipecat-ai/mb/fix-openai_realtime-function-calling
...
OpenAIRealtimeBetaLLMService: Fixed an error in function calling
2025-01-22 14:37:01 -05:00
Adrian Cowham
2debac314c
fixing missing import
2025-01-22 11:06:53 -08:00
Mark Backman
e0c9a1a1a2
Merge pull request #1041 from Allenmylath/patch-20
...
Update bot.py
2025-01-22 09:18:19 -05:00
allenmylath
4cdcca588e
Update examples/moondream-chatbot/bot.py
...
Co-authored-by: Mark Backman <m.backman@gmail.com >
2025-01-22 19:40:12 +05:30
allenmylath
a90e81e2eb
Update examples/moondream-chatbot/bot.py
...
Co-authored-by: Mark Backman <m.backman@gmail.com >
2025-01-22 19:38:36 +05:30
Mark Backman
0ba60c9e28
Merge pull request #975 from imsakg/main
...
fix(gemini): prevent non-audio modality processing
2025-01-22 09:03:18 -05:00
Mark Backman
5ca5fbd825
OpenAIRealtimeBetaLLMService: Fixed an error in function calling
2025-01-22 08:54:03 -05:00
Joe Garlick
b72504f1cb
Added: Additional DTMF frames
2025-01-22 13:47:23 +00:00
allenmylath
2b52e2c109
Update README.md
...
Silero-tts changed to VAD, also description regarding session handling added to websocket chatbot
2025-01-22 14:42:35 +05:30
Cheng Hao
7e8fc2e7e2
[bug]TypeError: object of type 'NoneType' has no len()
...
Sometimes the chunk.choices is None, and I got exception like:
```
TypeError: object of type 'NoneType' has no len()
```
2025-01-22 15:31:27 +08:00
Aleix Conchillo Flaqué
0d79a9eaa6
update CHANGELOG.md
2025-01-21 18:00:10 -08:00
Aleix Conchillo Flaqué
f89b9ec23f
Merge pull request #1057 from pipecat-ai/aleix/replace-resampy-soxr
...
improve audio resampling by switching from resampy to soxr
2025-01-21 17:52:49 -08:00
Mark Backman
20d5824e56
Merge pull request #1058 from pipecat-ai/mb/fix-trace-log
2025-01-21 20:44:50 -05:00
Aleix Conchillo Flaqué
f23baa78d8
test-requirements: add soxr and remove resampy
2025-01-21 17:40:17 -08:00
Aleix Conchillo Flaqué
cacd6ba3fa
improve audio resampling by switching from resampy to soxr
2025-01-21 17:40:17 -08:00
Aleix Conchillo Flaqué
f87ecd3a51
Merge pull request #1048 from pipecat-ai/aleix/add-unittest-utils
...
tests: add some initial run_test() utilities
2025-01-21 17:39:06 -08:00
Mark Backman
b96a922aa8
Fix trace log line for resume_processing_frames
2025-01-21 18:15:03 -05:00
Aleix Conchillo Flaqué
401d3ff267
tests: added PipelineTask tests
2025-01-21 11:45:43 -08:00
Aleix Conchillo Flaqué
ab4221a4db
task: added BaseTask
2025-01-21 11:45:43 -08:00
Aleix Conchillo Flaqué
bd6f82cf94
task: allow specifying heartbeat period
2025-01-21 11:45:43 -08:00
Aleix Conchillo Flaqué
dd21b424d6
pyproject: ignore 'audioop' deprecation warning
2025-01-21 10:27:34 -08:00
Aleix Conchillo Flaqué
76884877dd
tests: add pytest-asyncio dependency
2025-01-21 10:23:19 -08:00
Aleix Conchillo Flaqué
0d6c680133
README: add unit tests badge
2025-01-21 10:14:37 -08:00
Aleix Conchillo Flaqué
a27fe4bde2
tests: move test_ai_services to test_utils_string
2025-01-21 10:06:14 -08:00
Aleix Conchillo Flaqué
177cb2ca8b
tests: initial pipeline and parallelpipeline tests
2025-01-21 09:57:54 -08:00
Aleix Conchillo Flaqué
3c970a3cee
tests: add more filter tests
2025-01-21 09:43:57 -08:00
Aleix Conchillo Flaqué
af02f8f1cd
filters(frame_filter): allow more than one frame
2025-01-21 09:43:33 -08:00
Aleix Conchillo Flaqué
2e0fb198bf
frame_processor: allow pushing more frames after EndFrame
...
This can be useful for testing purposes. In real practice, there shouldn't be
any frames after an EndFrame is pushed.
2025-01-21 09:42:15 -08:00
Filipi da Silva Fuchter
4f758c5a3b
Merge pull request #1050 from pipecat-ai/fix_rtvi_warning_msg
...
Ignoring transport messages that are not intended to RTVI.
2025-01-21 13:36:50 -03:00
Rafal Skorski
89b87289e2
elevenlabs key added to env.example
2025-01-21 17:12:27 +01:00
Rafal Skorski
e0e190a1a2
Create telnyx chat bot example application
2025-01-21 17:09:55 +01:00
Filipi Fuchter
3e0836b340
Ignoring transport messages that are not intended to RTVI.
2025-01-21 10:08:14 -03:00
Aleix Conchillo Flaqué
2f23693bf3
tests: fix test_protobuf_serializer.py
2025-01-20 18:39:59 -08:00
Aleix Conchillo Flaqué
b7dd9748cf
serializers: fix special fix initialization
2025-01-20 18:39:41 -08:00
Aleix Conchillo Flaqué
d4d9c3b7ae
tests: fix test_aggregators.py
2025-01-20 18:16:14 -08:00
Aleix Conchillo Flaqué
090bc81ec5
tests: add some initial run_test() utilities
2025-01-20 17:41:21 -08:00
Filipi Fuchter
9b61633aa0
Introduce support for extracting and processing grounding metadata from Google LLM responses.
2025-01-20 11:28:12 -03:00
Mark Backman
e3d53d3d9a
Merge pull request #1044 from pipecat-ai/mb/elevenlabs-http-fix-voice-settings
...
Fixed a type error when using voice_settings in ElevenLabsHttpTTSService
2025-01-20 08:11:38 -05:00
Mark Backman
262d3a19c9
Fixed a type error when using voice_settings in ElevenLabsHttpTTSService
2025-01-20 07:57:02 -05:00
allenmylath
491feb691c
Update bot.py
...
quiet and talking frames are determined based on BotStartedSpeakingFrame and BotStoppedSpeakingFrame not ttsframe
2025-01-20 14:00:17 +05:30
Aleix Conchillo Flaqué
e4f83b237e
update CHANGELOG (remove 07d-interruptible-elevenlabs-http.py)
2025-01-19 11:36:18 -08:00
Aleix Conchillo Flaqué
a169e0cde9
Merge pull request #1035 from pipecat-ai/aleix/prepare-0.0.53
...
update CHANGELOG for 0.0.53
2025-01-18 14:50:35 -08:00
Aleix Conchillo Flaqué
c6d643d4ec
update CHANGELOG for 0.0.53
2025-01-18 14:48:48 -08:00
Aleix Conchillo Flaqué
2abbd4bb27
Merge pull request #1039 from pipecat-ai/aleix/fish-audio-websocket-service
...
services(fish): FishAudioTTSService to use WebsocketService
2025-01-18 14:48:20 -08:00
Aleix Conchillo Flaqué
e0011a3996
services(fish): FishAudioTTSService to use WebsocketService
2025-01-18 14:29:45 -08:00
Aleix Conchillo Flaqué
ea44c59ddd
Merge pull request #1037 from Vaibhav159/fixing_unused_11labs_package
...
removing unused 11labs package imports
2025-01-17 22:08:04 -08:00
Vaibhav159
a9c7dbbc05
removing unused code
2025-01-18 10:58:07 +05:30
Vaibhav159
8a87e92b2b
adding missing 11labs package
2025-01-18 10:48:57 +05:30
Mark Backman
982f2becc6
Merge pull request #1002 from pipecat-ai/mb/add-on-error-callback
...
Register the on_error handler
2025-01-17 21:58:59 -05:00
Mark Backman
e049ae470d
Register the on_error handler
2025-01-17 21:49:42 -05:00
Mark Backman
e159f2dce1
Merge pull request #1024 from pipecat-ai/mb/elevenlabs-http
...
Add ElevenLabsHttpTTSService
2025-01-17 21:30:31 -05:00
Aleix Conchillo Flaqué
e9162ae467
Merge pull request #1004 from Fluentsai/feature/dtmf_input
...
Twilio serializer reading dtmf websocket messages
2025-01-17 18:14:46 -08:00
Aleix Conchillo Flaqué
bb65512ff4
Merge pull request #1034 from pipecat-ai/aleix/ulaw-resample-update
...
ulaw resample update
2025-01-17 17:47:18 -08:00
Mark Backman
b81323d676
Code review fixes + docstrings
2025-01-17 20:12:43 -05:00
Aleix Conchillo Flaqué
65fa77dfa5
audio: use resample_audio to resample ulaw bytes
2025-01-17 15:24:41 -08:00
Aleix Conchillo Flaqué
9ddd9ae27c
Merge pull request #1011 from Vaibhav159/vl_deepgram_metrics_without_vad
...
adding metric generation without deepgram VAD
2025-01-17 14:47:19 -08:00
Aleix Conchillo Flaqué
12fc6e17ef
Merge pull request #1033 from pipecat-ai/aleix/observers-performance
...
task: add TaskObserver and avoid pipeline blocking
2025-01-17 14:43:26 -08:00
Aleix Conchillo Flaqué
3e4020cdba
task: add TaskObserver and avoid pipeline blocking
...
Observers now process frames in separate tasks. This avoids blocking the
pipeline while the observer is processing the frame.
2025-01-17 11:15:52 -08:00
Aleix Conchillo Flaqué
4f883ee31f
Merge pull request #1023 from pipecat-ai/aleix/introduce-heartbeat-frames
...
introduce heartbeat frames
2025-01-17 10:31:07 -08:00
Mark Backman
3ff360f042
Merge pull request #1032 from pipecat-ai/mb/user-idle-fixes
...
Start UserIdleProcessor on speaking frame, fix bug not pushing EndFrame
2025-01-17 13:18:09 -05:00
Aleix Conchillo Flaqué
45cbad5b3e
task: add HEARTBEAT_MONITOR_SECONDS
2025-01-17 10:11:28 -08:00
Aleix Conchillo Flaqué
477d0d154b
frame_processor: make sure clock is initialized
2025-01-17 10:05:23 -08:00
Aleix Conchillo Flaqué
4b3c776f58
task: don't use push queue to send a heartbeat
...
This is because we might be waiting for the EndFrame. Currently, if we push an
EndFrame to the task, the task will block until the EndFrame traverses all the
pipeline.
2025-01-17 10:04:24 -08:00
Aleix Conchillo Flaqué
da0c4cfd99
task: increase heartbeat monitoring to 5 seconds
2025-01-17 10:04:05 -08:00
Aleix Conchillo Flaqué
f22a00570d
task: start heartbeats task when push task starts
2025-01-17 10:03:13 -08:00
Mark Backman
85f4663a41
Start UserIdleProcessor on speaking frame, fix bug not pushing EndFrame
2025-01-17 12:54:17 -05:00
Aleix Conchillo Flaqué
915e3bb3c7
Merge pull request #1029 from Vaibhav159/vl_fixing_idle_frame_processor_logic
...
fixing IdleFrameProcessor and UserIdleProcessor init logic
2025-01-17 06:48:13 -08:00
Vaibhav159
80779c48d6
sort fix
2025-01-17 20:07:25 +05:30
Vaibhav159
c444557965
fixing IdleFrameProcessor and UserIdleProcessor init logic
2025-01-17 19:50:53 +05:30
Mark Backman
d51893f61c
Refactor for aiohttp, correct use of settings
2025-01-16 23:49:53 -05:00
Mark Backman
740d2743df
Add TTFB metrics
2025-01-16 23:05:53 -05:00
Mark Backman
0dd22fb879
Merge pull request #1022 from pipecat-ai/mb/fix-abstractmethod
...
Remove @abstractmethod from set_model and set_model in TTSService class
2025-01-16 22:59:26 -05:00
Mark Backman
225b65c3d2
Add ElevenLabsHttpTTSService
2025-01-16 22:46:32 -05:00
Aleix Conchillo Flaqué
2503f76107
examples: add 31-heartbeats.py
2025-01-16 19:31:13 -08:00
Aleix Conchillo Flaqué
ff8aa68942
introduce heartbeat frames
2025-01-16 19:31:13 -08:00
Maxim Makatchev
c5edbf4b75
Made InputDTMFFrame a DataFrame and moved up to data frames
2025-01-17 12:27:04 +09:00
Aleix Conchillo Flaqué
799777774b
Merge pull request #1018 from pipecat-ai/aleix/streamline-thread-pool-executors
...
transports: streamline max_workers for ThreadPoolExecutors
2025-01-16 19:05:41 -08:00
Mark Backman
fdef8a97e2
Remove @abstractmethod from set_model and set_model in TTSService class
2025-01-16 21:36:51 -05:00
Mark Backman
0163247410
Merge pull request #1021 from pipecat-ai/mb/improve-30
...
Add a second observer to the 30-observer.py example
2025-01-16 21:19:35 -05:00
James Hush
221e044046
demo: Update translator bot example ( #1005 )
...
* docs: Update translator bot example
Updates the translator bot to do the following:
- Allow you to specify the in and out languages
- Uses TranscriptionProcessor to handle transcriptions
* Simplify the example, improve performance
---------
Co-authored-by: Mark Backman <mark@daily.co >
2025-01-17 10:08:15 +08:00
Mark Backman
532fd31fd7
Add a second observer to the 30-observer.py example
2025-01-16 19:46:18 -05:00
Mark Backman
3e178fd46f
Merge pull request #1020 from pipecat-ai/mb/observer-foundational
...
Add foundational example 30 to show how to use an Observer
2025-01-16 19:28:26 -05:00
Mark Backman
07cb8b7a89
Extend the example to include BotStartedSpeakingFrame and BotStoppedSpeakingFrame
2025-01-16 19:24:01 -05:00
Mark Backman
e805738d4c
Merge pull request #1009 from pipecat-ai/mb/tts-ignore-interim-transcripts
...
TTSService should only process LLMTextFrames
2025-01-16 17:09:24 -05:00
Mark Backman
119bc7e35f
Update check to exclude transcription frames
2025-01-16 16:43:46 -05:00
Mark Backman
b9b02845a3
Add foundational example 30 to show how to use an Observer
2025-01-16 16:37:32 -05:00
Aleix Conchillo Flaqué
3714f12edc
Merge pull request #1019 from Canonical-AI-Inc/canonical-transcripts
...
Add transcript to Canonical Metrics Service
2025-01-16 13:36:55 -08:00
Aleix Conchillo Flaqué
d2b8171197
transports: streamline max_workers for ThreadPoolExecutors
2025-01-16 13:34:04 -08:00
Filipi Fuchter
c4c15eff39
Sending a silence frame to prevent the audio from clipping.
2025-01-16 18:30:19 -03:00
Adrian Cowham
d0b48c95bb
updated the example to use stereo audio and pass in the context. also updated the service to send the transcripts if they're available
2025-01-16 13:12:38 -08:00
Aleix Conchillo Flaqué
73ed0c1ad7
Merge pull request #1017 from pipecat-ai/aleix/additional-trace-logging
...
additional trace logging
2025-01-16 12:38:47 -08:00
Vanessa Pyne
c211580fec
Merge pull request #1016 from pipecat-ai/vp-1007-nonetype
...
services(gemini_multimodal_live): set content to [] if not present in messages
2025-01-16 14:14:50 -06:00
Aleix Conchillo Flaqué
359b55a85e
additional trace logging
2025-01-16 11:19:42 -08:00
Filipi Fuchter
7efd00e0f7
Asking for the bot to send the audio only when the audio element is already on playing state.
2025-01-16 16:00:56 -03:00
kompfner
8b602a3f62
Merge pull request #1010 from pipecat-ai/ios-simplechatbot-assorted-improvements
...
iOS SimpleChatbot assorted improvements
2025-01-16 13:59:45 -05:00
kompfner
485c231f69
Merge pull request #1012 from pipecat-ai/simplechatbot-readme-local-pipecat
...
Add to the SimpleChatbot server README a step for pointing to the loc…
2025-01-16 13:46:19 -05:00
vipyne
8ba3b150eb
services(gemini_multimodal_live): set content to [] if not present in messages
...
... which it will be if the message is a tool call
2025-01-16 11:59:02 -06:00
Paul Kompfner
b5f72b4378
Add to the SimpleChatbot server README a step for pointing to the local version of pipecat
2025-01-16 11:59:44 -05:00
Vaibhav159
85e7d62f94
fixing log text
2025-01-16 21:36:51 +05:30
Vaibhav159
923d33eeff
fixing ruff
2025-01-16 21:32:48 +05:30
Vaibhav159
7ee6e7193d
adding metric generation without deepgram VAD
2025-01-16 21:23:56 +05:30
Paul Kompfner
156fffe6fc
In iOS SimpleChatbot demo, add clarifying note to Audio Settings section header explaining that "(No selection = system default)".
...
Ideally we could add a row showing that the system default is selected, but this is OK as a short-term fix. Also, the presence of that row might suggest that "system default" is selectable, but it's not: this is currently a limitation in the Pipecat Client.
2025-01-16 10:32:55 -05:00
Paul Kompfner
c9834e2712
In iOS SimpleChatbot demo, remove unused LLMHelperDelegate protocol conformance
2025-01-16 10:31:17 -05:00
Paul Kompfner
1e7e307f69
In iOS SimpleChatbot demo, call release() when disconnecting the voice client, since we're not using it after disconnecting
2025-01-16 10:30:06 -05:00
Mark Backman
67e47a388d
TTSService should only process LLMTextFrames
2025-01-16 10:03:24 -05:00
Filipi Fuchter
119c0da299
Configuring a proxy so we can test from mobile
2025-01-16 11:02:53 -03:00
Filipi Fuchter
ea1323723d
Handling the signalling to play the audio
2025-01-16 10:42:22 -03:00
Filipi Fuchter
d2efe27350
Improving the logs and updating status
2025-01-16 10:36:45 -03:00
Filipi Fuchter
5dc7d2a378
Creating the bot when pressing to connect.
2025-01-16 10:28:39 -03:00
Filipi Fuchter
88c540f9bc
Starting to create the example signalling through app message.
2025-01-16 10:14:38 -03:00
Maxim Makatchev
dcf317f2fa
Twilio serializer reading dtmf websocket messages and generating InputDTMFFrame containing the corresponding value of KeypadEntry
2025-01-16 17:43:12 +09:00
Aleix Conchillo Flaqué
b8ffd7b16b
Merge pull request #996 from pipecat-ai/aleix/introduce-observers
...
introduce pipeline frame observers
2025-01-15 18:05:33 -08:00
Aleix Conchillo Flaqué
08f1dda94e
observers: add a timestamp to on_push_frame()
2025-01-15 17:45:00 -08:00
Aleix Conchillo Flaqué
45039e7cde
update CHANGELOG.md
2025-01-15 17:45:00 -08:00
Aleix Conchillo Flaqué
e50c76d075
examples(simple-chatbot): use RTVIObserver for server-client messages
2025-01-15 17:45:00 -08:00
Aleix Conchillo Flaqué
dd9f9179cc
rtvi(RTVIObserver): use observers for RTVI server->client messages
2025-01-15 17:45:00 -08:00
Aleix Conchillo Flaqué
c8da531402
pipeline(task): add support for pipeline frame observers
2025-01-15 17:43:59 -08:00
Aleix Conchillo Flaqué
25bcaf5c7c
observers: introduce pipeline observers
2025-01-15 17:43:59 -08:00
Aleix Conchillo Flaqué
2d0f3341c3
frames: add LLMTextFrame and TTSTextFrame
...
This is to distinguish what type of service has generated the TextFrames.
2025-01-15 17:43:59 -08:00
Aleix Conchillo Flaqué
7626d7b04b
Merge pull request #999 from pipecat-ai/aleix/add-pre-commit-hooks
...
add pre-commit hooks
2025-01-15 17:39:34 -08:00
Aleix Conchillo Flaqué
f78520f7d0
add pre-commit hooks
...
Fixes #945
2025-01-15 13:44:21 -08:00
Aleix Conchillo Flaqué
bb4766455d
Merge pull request #997 from pipecat-ai/aleix/update-dependencies-01-15-25
...
update dependencies (go back to numpy1)
2025-01-15 13:35:46 -08:00
Aleix Conchillo Flaqué
9dacbbbbf4
fix ruff formatting
2025-01-15 13:02:13 -08:00
Aleix Conchillo Flaqué
4de192fbb0
update dependencies (go back to numpy1)
...
Fixes #911 , #913
2025-01-15 12:04:28 -08:00
kompfner
80b6c28431
Merge pull request #992 from pipecat-ai/live-updates-to-selected-and-available-mics
...
In the iOS SimpleChatbot demo, wire up live updates to the selected m…
2025-01-15 15:00:14 -05:00
Mark Backman
f471744bca
Merge pull request #995 from pipecat-ai/vp-riva-bump
...
deps(riva): bump to 2.18.0
2025-01-15 14:35:39 -05:00
Mark Backman
d5df4b064b
Merge pull request #987 from pipecat-ai/mb/deepseek-typo
...
Fix error log in DeepSeekLLMService and CerebrasLLMService
2025-01-15 14:31:34 -05:00
Mark Backman
06a0e29920
Merge pull request #991 from pipecat-ai/mb/update-web-simple-chatbot
...
Update simple-chatbot example to use the latest client SDKs
2025-01-15 13:36:03 -05:00
Aleix Conchillo Flaqué
64eb8e7262
Merge pull request #994 from Vaibhav159/vl_deepgram_with_vad
...
finalize on DeepgramSTTService on VAD
2025-01-15 10:28:11 -08:00
Filipi da Silva Fuchter
d8386c12dc
Merge pull request #990 from pipecat-ai/bumping_ios_example
...
Using PipecatClient version 0.3.2
2025-01-15 14:29:01 -03:00
vipyne
50e798bcd9
deps(riva): bump to 2.18.0
2025-01-15 10:24:57 -06:00
Vaibhav159
d1ac7751da
finalize on DeepgramSTTService
2025-01-15 20:43:23 +05:30
Paul Kompfner
110ce27c91
In the iOS SimpleChatbot demo, wire up live updates to the selected mic and available mics list. This is beneficial for a few reasons:
...
- Live updates are nice! We can now more easily see what's going on when we connect or disconnect a mic.
- Resolves an issue where the initial selected mic was not shown.
- Let us see when the Pipecat client automatically switches to a new mic, like when one is connected.
2025-01-15 09:56:27 -05:00
Mark Backman
8b657158ca
Update React simple-chatbot client to use latest client SDKs
2025-01-15 09:50:43 -05:00
Mark Backman
cce14fca97
Update JS simple-chatbot client to use latest client SDKs
2025-01-15 09:47:20 -05:00
Filipi Fuchter
7c051516d8
Using PipecatClient version 0.3.2
2025-01-15 09:57:57 -03:00
Mark Backman
5f402ad741
Merge pull request #988 from pipecat-ai/mb/readme-openrouter
...
Update README.md
2025-01-14 18:38:35 -05:00
Mark Backman
a80b186cea
Update README.md
...
Add OpenRouter to the README
2025-01-14 18:08:14 -05:00
Mark Backman
c65aaf3b2e
Merge pull request #967 from sahilsuman933/openrouter-integration
...
feat(services): Add OpenRouter LLM Service Integration
2025-01-14 18:06:13 -05:00
Mark Backman
e815d7776f
Fix error log in DeepSeekLLMService and CerebrasLLMService
2025-01-14 18:03:29 -05:00
sahil suman
11fc08ef24
fix changelog
...
Signed-off-by: sahil suman <sahilsuman933@gmail.com >
2025-01-15 02:57:09 +05:30
sahil suman
6f3b0fdf73
fix changelog
...
Signed-off-by: sahil suman <sahilsuman933@gmail.com >
2025-01-15 02:56:16 +05:30
sahil suman
885bc32827
added changes in changelog.
...
Signed-off-by: sahil suman <sahilsuman933@gmail.com >
2025-01-15 02:53:04 +05:30
sahil suman
7339cc7197
Merge remote-tracking branch 'origin/main' into openrouter-integration
2025-01-15 02:52:19 +05:30
sahil suman
62e9e6bc5a
changed the file name.
...
Signed-off-by: sahil suman <sahilsuman933@gmail.com >
2025-01-15 02:21:58 +05:30
Sahil Suman
329da50338
Update src/pipecat/services/openrouter.py
...
Co-authored-by: Mark Backman <m.backman@gmail.com >
2025-01-15 02:20:22 +05:30
sahil suman
4d307d26d8
made the required changes.
...
Signed-off-by: sahil suman <sahilsuman933@gmail.com >
2025-01-15 02:19:05 +05:30
Mark Backman
a74b9354ec
Merge pull request #962 from pipecat-ai/mb/improve-tts-reconnection-logic
...
Improve websocket based TTS service reconnection logic
2025-01-14 14:48:00 -05:00
sahil suman
11381a536f
added example for function calling and made the required changes.
...
Signed-off-by: sahil suman <sahilsuman933@gmail.com >
2025-01-15 01:00:33 +05:30
Mark Backman
b53bc8a879
_calculate_wait_times as private, add and use WebsocketServiceException
2025-01-14 13:20:13 -05:00
Mark Backman
e3d8910814
Update CHANGELOG
2025-01-14 13:12:40 -05:00
Mark Backman
e60a59434f
Refactor LMNTTTSService to make a websocket connection directly, then use the WebsocketService base class
2025-01-14 13:09:58 -05:00
Mark Backman
5e5de618f3
Update PlayHTTTSService to use the WebsocketService base class
2025-01-14 13:09:58 -05:00
Mark Backman
8af92f7923
Update ElevenLabsTTSService to use the WebsocketService base class
2025-01-14 13:09:58 -05:00
Mark Backman
f39e17857e
Add a WebsocketService base class to retry, ensure that retries reset after a successful connection, update Cartesia to use the new WebsocketService
2025-01-14 13:09:58 -05:00
Aleix Conchillo Flaqué
5b632de04a
Merge pull request #982 from pipecat-ai/aleix/pipelinetask-cleanup-sink
...
pipeline(task): cleanup Sink processor
2025-01-14 09:14:03 -08:00
Mark Backman
6bcc196489
Merge pull request #969 from pipecat-ai/mb/deepseek
...
Add support for DeepSeek LLM
2025-01-14 09:40:06 -05:00
Mark Backman
66375e9dff
Update dot-env.template API keys
2025-01-14 09:34:34 -05:00
Mark Backman
bc839492b6
Add support for DeepSeek LLM
2025-01-14 09:34:33 -05:00
Filipi da Silva Fuchter
4854645637
Merge pull request #960 from pipecat-ai/example_gemini_with_goolge_search
...
Example with Gemini using google search to retrieve news.
2025-01-14 10:07:15 -03:00
Mark Backman
98e80b7d4a
Merge pull request #970 from pipecat-ai/mb/user-controlled-run-llm
...
Add an override_run_llm option to optionally defer function call completion
2025-01-13 18:48:00 -05:00
Mark Backman
8c0ecb89de
Refactor for new on_context_updated callback and new frame properties
2025-01-13 17:20:41 -05:00
Aleix Conchillo Flaqué
4c8fcb2cfc
pipeline(task): cleanup Sink processor
...
Fixes #953
2025-01-13 13:29:44 -08:00
Aleix Conchillo Flaqué
92313d6ce7
Merge pull request #972 from pipecat-ai/aleix/simple-chatbot-android-workflow-update
...
github: only run android simple-chatbot worflow if android example modified
2025-01-13 13:26:12 -08:00
Mark Backman
1ca6ecc46e
Update CHANGELOG
2025-01-13 09:49:09 -05:00
Mark Backman
f1947d7d38
Update Anthropic and Gemini to allow overriding run_llm
2025-01-13 09:48:43 -05:00
Mark Backman
0852570212
Update Grok for function call override
2025-01-13 09:48:43 -05:00
Mark Backman
874b8bb136
Allow for an override of running a completion after a function call completes, OpenAI
2025-01-13 09:48:43 -05:00
Mark Backman
da1878537b
Merge pull request #974 from pipecat-ai/mb/26d-example
...
Align 26d example with foundation norms
2025-01-12 19:44:31 -05:00
Mark Backman
f406d93b0f
Align 26d example with foundation norms
2025-01-12 19:19:16 -05:00
Aleix Conchillo Flaqué
3cd2b90177
Merge pull request #971 from pipecat-ai/aleix/update-copyright-keep-original-year
...
update copyright keeping original year (2024)
2025-01-12 11:37:15 -08:00
Aleix Conchillo Flaqué
c4f0c7bcfd
github: only run android simple-chatbot worflow if android example modified
2025-01-12 11:35:34 -08:00
Aleix Conchillo Flaqué
95e69597f3
update copyright keeping original year (2024)
2025-01-12 11:34:00 -08:00
Aleix Conchillo Flaqué
710baa5e17
Merge pull request #973 from pipecat-ai/aleix/simple-chatbot-clients
...
examples/simple-chatbot: move clients to client directory
2025-01-12 11:28:21 -08:00
Mert Sefa AKGUN
14e5419913
fix(gemini): prevent non-audio modality processing
...
Add an early return in the _handle_transcribe_model_audio method to
prevent unnecessary processing when the modalities setting is not set
to audio. This change ensures that audio transcription only occurs
when appropriate.
2025-01-12 22:17:10 +03:00
Mark Backman
8c953bac41
Merge pull request #966 from imsakg/main
...
fix(services): handle TranscriptionFrame separately in TTSService
2025-01-12 11:33:38 -05:00
Mark Backman
4c0861ce39
Some addition links and README changes
2025-01-12 09:27:23 -05:00
Mark Backman
12b1e1db9d
Merge pull request #965 from pipecat-ai/mb/aws-add-session-token
...
Add optional aws_session_token for PollyTTSService
2025-01-12 09:13:03 -05:00
Mark Backman
53bfdfd83f
Merge pull request #963 from pipecat-ai/mb/cleanup-examples
...
Update examples to align with latest best practices
2025-01-12 09:12:34 -05:00
Mark Backman
2a5593afea
Merge pull request #968 from pipecat-ai/mb/readme-websocket
...
Update README.md
2025-01-12 09:12:19 -05:00
Aleix Conchillo Flaqué
a04a920e54
examples/simple-chatbot: move clients to client directory
2025-01-11 19:16:05 -08:00
Aleix Conchillo Flaqué
2ce6d92455
Merge pull request #959 from KevGTL/fix-livekit-transport
...
fix: push input audio frame only via push_audio_frame()
2025-01-11 19:03:35 -08:00
Mark Backman
1ecd5da219
Update README.md
...
Add websocket docs links to README.
2025-01-11 08:37:17 -05:00
sahil suman
e04da334d7
add support for openrouter.
...
Signed-off-by: sahil suman <sahilsuman933@gmail.com >
2025-01-11 17:50:58 +05:30
Mert Sefa AKGUN
7ec351813c
style(ai_services): fix import order with ruff
2025-01-11 13:04:26 +03:00
Mert Sefa AKGUN
df6c2fc403
fix(services): handle TranscriptionFrame separately in TTSService
...
Exclude TranscriptionFrame from text frame processing in TTSService by updating the type check condition. This resolves unintended processing behavior when handling different frame types.
2025-01-11 13:00:38 +03:00
Mark Backman
71e107725c
Add optional aws_session_token for PollyTTSService
2025-01-10 19:33:47 -05:00
Mark Backman
4d0c11fcab
Update examples to align with latest best practices
2025-01-10 15:07:06 -05:00
Mark Backman
a8ae79831e
Merge pull request #921 from pipecat-ai/mb/playht-http
...
PlayHTHttpTTSService fixes
2025-01-10 13:26:45 -05:00
Mark Backman
86516d2415
PlayHTHttpTTSService fixes
2025-01-10 13:21:27 -05:00
Vanessa Pyne
5cd9dab14b
Merge pull request #949 from imsakg/main
...
fix(examples): correct TTS service import and setup
2025-01-10 10:58:50 -06:00
Kwindla Hultman Kramer
a3e2e06975
Merge pull request #961 from pipecat-ai/khk/tiny-chatbot-readme-fix
...
fixed 404 in SimpleChatbot iOS example README
2025-01-10 08:45:05 -08:00
Kwindla Hultman Kramer
e7107b99c5
fixed 404 in SimpleChatbot iOS example README
2025-01-10 08:37:13 -08:00
Filipi Fuchter
aa1b8879ee
Fixing ruff format
2025-01-10 13:21:51 -03:00
Mark Backman
6802459165
Merge pull request #956 from pipecat-ai/mb/tavus
...
Update the Tavus example and comment about using the PERSONA_ID
2025-01-10 11:18:05 -05:00
Filipi Fuchter
6719d1fddc
Example with Gemini using google search to retrieve news.
2025-01-10 13:13:59 -03:00
kompfner
a798bf18f2
Merge pull request #955 from pipecat-ai/ios-simple-chatbot-mainactor-fixes
...
iOS SimpleChatbot @MainActor fixes
2025-01-10 09:37:02 -05:00
Kevin Oury
f9d0cca60f
fix: push input audio frame only via push_audio_frame()
2025-01-10 15:02:38 +01:00
Mark Backman
cb22de0d13
Update the Tavus example and comment about using the PERSONA_ID
2025-01-10 08:01:00 -05:00
marcus-daily
7d161cc53b
Setting target SDK to 35
2025-01-10 09:50:37 +00:00
marcus-daily
255abf46ef
Updating Gradle and AGP
2025-01-10 09:50:37 +00:00
marcus-daily
27579bcb70
Fixing imports
2025-01-10 09:50:37 +00:00
marcus-daily
1295b64879
Updating library dependencies
2025-01-10 09:50:37 +00:00
marcus-daily
ca57670f65
Removing unnecessary drawables
2025-01-10 09:50:37 +00:00
marcus-daily
06d0a231b9
Android demo app for simple-chatbot example
2025-01-10 09:50:37 +00:00
Mert Sefa AKGUN
67af4e619b
style(examples): fix ruff formatting in Gemini text example
...
Refactor `CartesiaTTSService` instantiation to comply with line
length requirements from the ruff linter.
2025-01-10 12:32:53 +03:00
Mert Sefa AKGUN
21c274944e
Update examples/foundational/26d-gemini-multimodal-live-text.py
...
Co-authored-by: Vanessa Pyne <vipyne@gmail.com >
2025-01-10 12:28:13 +03:00
Paul Kompfner
3239249feb
In the iOS SimpleChatbot, fix @MainActor-related warnings (which would be errors in Swift 6). The delegate methods aren't contractually guaranteed to run on the main thread, so we can't mark them as @MainActor.
2025-01-09 17:35:44 -05:00
Paul Kompfner
216979c377
Bump iOS SimpleChatbot's pipecat-client-ios-daily dependency to version 0.3.1
2025-01-09 16:22:26 -05:00
Filipi da Silva Fuchter
b9db53d3cd
Merge pull request #952 from pipecat-ai/fixing_gemini_function_calling
...
Fixing GeminiMultimodalLiveLLMService function calling to work with pipecat-flows
2025-01-09 17:50:25 -03:00
Filipi Fuchter
58bfcc8370
Fixing GeminiMultimodalLiveLLMService function calling when using with pipecat-flows.
2025-01-09 12:22:37 -03:00
Mert Sefa AKGUN
6664c492ac
feat(gemini): enable audio transcription in live text example
...
Add options to transcribe both user and model audio during the GeminiMultimodalLiveLLMService setup in the 26d-gemini-multimodal-live-text.py example.
2025-01-09 15:38:33 +03:00
Mert Sefa AKGUN
7634058f97
fix(examples): correct TTS service import and setup
...
- Update import to use CartesiaTTSService instead of CartesiaMultiLingualTTSService.
- Adjust GeminiMultimodalLiveLLMService setup to use set_model_modalities with TEXT modality.
2025-01-09 02:19:08 +03:00
Mark Backman
39c6446bdc
Merge pull request #947 from pipecat-ai/mb/add-rime-set-voices
...
Add setters for model and voice to RimeHttpTTSService
2025-01-08 14:25:24 -05:00
Filipi da Silva Fuchter
2df7dfcc91
Merge pull request #943 from pipecat-ai/simple_chat_bot_ios
...
SimpleChatbot iOS app.
2025-01-08 16:17:39 -03:00
Mark Backman
c23c9e046c
Add setters for model and voice to RimeHttpTTSService
2025-01-08 14:17:32 -05:00
Mark Backman
9dae753e8c
Merge pull request #926 from imsakg/main
...
feat(gemini): add text handling to GeminiMultimodalLive
2025-01-08 13:42:17 -05:00
Mert Sefa AKGUN
40e9ee6d63
fix(examples): correct import order in Gemini example
...
- Move `CartesiaMultiLingualTTSService` import to maintain proper order.
- Reorganize `enum` import to adhere to styling standards.
2025-01-08 21:14:29 +03:00
Mert Sefa AKGUN
a342fe732e
docs: update CHANGELOG with Gemini modalities and examples
2025-01-08 19:34:42 +03:00
Mert Sefa AKGUN
a729834482
refactor(gemini): reposition WebSocket connection code
...
Move WebSocket connection setup earlier in the function for better
organization and to prepare for subsequent configuration steps.
2025-01-08 19:29:36 +03:00
Mert Sefa AKGUN
94a6f1086e
feat(gemini): change default modality to AUDIO
...
Modify the default modality in the `InputParams` class from TEXT to AUDIO
to better align with the intended use case for GeminiMultimodalLive
service.
2025-01-08 19:29:36 +03:00
Mert Sefa AKGUN
b42d3a8257
feat(gemini): add modality configuration for GeminiMultimodalLive
...
- Introduce `GeminiMultimodalModalities` enum for modality options.
- Add modality field to `InputParams`, defaulting to text.
- Simplify modality setup with `set_model_modalities` method.
- Refactor WebSocket configuration to support dynamic response modalities.
2025-01-08 19:29:36 +03:00
Mert Sefa AKGUN
12ae980abe
feat(gemini): handle full text response in GeminiMultimodalLive
...
- Add a buffer to store bot text responses.
- Push a `LLMFullResponseStartFrame` when text begins.
- Clear the text buffer and send `LLMFullResponseEndFrame` after processing.
2025-01-08 19:29:36 +03:00
Mert Sefa AKGUN
cdb909958c
feat(examples): add Gemini multimodal live text example
...
Introduce a new example `26d-gemini-multimodal-live-text.py` to
demonstrate the use of GeminiMultimodalLiveLLMService with text-only
responses. This example sets up a pipeline for audio input via DailyTransport,
processing with Gemini, and output via Cartesia TTS.
2025-01-08 19:29:35 +03:00
Mert Sefa AKGUN
c72c3025f6
feat(gemini): add configuration methods for response modalities
...
- Introduce `set_model_only_audio` and `set_model_only_text` methods
to toggle between audio-only and text-only response modes in
`GeminiMultimodalLiveLLMService`.
- Refactor configuration setup to a class attribute for improved
reusability and maintenance.
- Remove redundant configuration instantiation in the WebSocket
connection setup process.
2025-01-08 19:29:35 +03:00
Mert Sefa AKGUN
5cbd719780
feat(gemini): add text handling to GeminiMultimodalLive
...
- Introduce text attribute in Part class for handling string data.
- Incorporate text processing in GeminiMultimodalLiveLLMService to push TextFrame if text is present.
2025-01-08 19:29:35 +03:00
Filipi Fuchter
23d6290672
Removing not used class.
2025-01-08 12:05:04 -03:00
Filipi Fuchter
d4e7e11981
SimpleChatbot iOS app.
2025-01-08 12:00:11 -03:00
Mark Backman
8057fe3fcf
Merge pull request #742 from Vaibhav159/vl_feature_websocket_fastapi_timeout
...
adding session_timeout param
2025-01-08 09:05:41 -05:00
Vaibhav159
3b446234a7
fix hyperlink
2025-01-08 10:54:27 +05:30
Vaibhav159
768487ffb3
final changelog
2025-01-08 10:53:32 +05:30
Vaibhav159
2da5620d10
adding changelog
2025-01-08 10:50:09 +05:30
Vaibhav159
af90d65b3b
adding session timeout example in websocket-server example
2025-01-08 10:43:10 +05:30
Vaibhav159
c8569a7b67
Merge remote-tracking branch 'upstream/main' into vl_feature_websocket_fastapi_timeout
2025-01-08 10:21:36 +05:30
Vaibhav159
0ecd98c873
Merge branch 'main' into vl_feature_websocket_fastapi_timeout
2025-01-08 10:20:55 +05:30
Mark Backman
6f863ba2c6
Merge pull request #938 from jcbjoe/jg/optional-authentication-polly
...
Changed Polly authentication params to be optional
2025-01-07 15:37:23 -05:00
Mark Backman
602ca5ebe6
Merge pull request #939 from Vaibhav159/vl_adding_daily_room_properties
...
adding more daily room params
2025-01-07 14:33:59 -05:00
Vaibhav159
787ade41f3
adding missing doc string
2025-01-08 00:58:01 +05:30
Joe Garlick
bb767831d5
Added: Changelog entry
2025-01-07 19:05:02 +00:00
Mark Backman
bc25a771dc
Merge pull request #935 from pipecat-ai/hush/modalUpdate
...
docs: update dependencies for modal demo
2025-01-07 13:57:46 -05:00
Vaibhav159
f37626f81d
adding more daily room params
2025-01-07 21:38:05 +05:30
Mark Backman
9d54578e65
Merge pull request #934 from pipecat-ai/mb/bump-open-ai-version
...
Bump openai version to 1.59.0 for realtime and model updates
2025-01-07 08:29:45 -05:00
Joe Garlick
79afe7ec2a
Changed: Polly authentication information to be optional
2025-01-07 11:43:57 +00:00
James Hush
2c1fd3c3cc
docs: update dependencies for modal demo
2025-01-07 15:45:55 +08:00
Mark Backman
b0dd8e03a6
Bump openai version to 1.59.0 for realtime and model updates
2025-01-06 17:05:22 -05:00
Mark Backman
ee20e48ef8
Merge pull request #931 from pipecat-ai/mb/fix-openai-realtime-
...
Fix truncation timing of OpenAIRealtimeBetaLLMService
2025-01-06 16:25:09 -05:00
Mark Backman
12b5c5a646
Fix truncation timing of OpenAIRealtimeBetaLLMService
2025-01-06 15:37:58 -05:00
Mark Backman
7a021cc82d
Merge pull request #929 from pipecat-ai/mb/add-google-journey-support
...
Added support for Google Journey TTS voices
2025-01-06 15:13:00 -05:00
Mark Backman
3e1ec4a8ee
Added support for Google Journey TTS voices
2025-01-06 14:54:34 -05:00
Mark Backman
a1377b7f1a
Merge pull request #924 from xtreme-sameer-vohra/patch-1
...
Update frames.py
2025-01-06 14:13:10 -05:00
Mark Backman
d6335886e2
Merge pull request #848 from Vaibhav159/vl_add_audio_and_chat_livekit_example
...
adding example for livekit audio and chat version
2025-01-06 13:27:38 -05:00
Vaibhav159
b3b7a5f023
adding 2025 license
2025-01-06 22:10:46 +05:30
Vaibhav159
5138017b57
ruff changes
2025-01-06 22:07:59 +05:30
Vaibhav159
87670067d7
adding changelog
2025-01-06 22:03:11 +05:30
Vaibhav159
656cd2859e
Merge branch 'main' into vl_add_audio_and_chat_livekit_example
2025-01-06 21:57:43 +05:30
Mark Backman
15b2cc210c
Merge pull request #927 from pipecat-ai/mb/update-copyright
...
Update copyright to 2025
2025-01-06 10:33:04 -05:00
Mark Backman
4667624b60
Update copyright to 2025
2025-01-06 10:19:37 -05:00
Sameer Vohra
d07ba80572
Update frames.py
...
fix minor typo in docs
2025-01-05 22:57:54 -05:00
Aleix Conchillo Flaqué
386ba61483
Merge pull request #909 from pipecat-ai/aleix/pipecat-0.0.52
...
update CHANGELOG for 0.0.52
2024-12-24 08:16:05 -08:00
Aleix Conchillo Flaqué
e9d275f270
update CHANGELOG for 0.0.52
2024-12-23 19:52:34 -08:00
Aleix Conchillo Flaqué
3a4994370c
update README
2024-12-23 19:20:23 -08:00
Aleix Conchillo Flaqué
6125ea882d
update README
2024-12-23 19:19:39 -08:00
Aleix Conchillo Flaqué
0a1ce1bb63
update CHANGELOG
2024-12-23 19:13:59 -08:00
Kwindla Hultman Kramer
ab3bcde5f7
Merge pull request #907 from pipecat-ai/khk/gemini-20241221
...
Gemini unary API fixes and natural conversation demo
2024-12-23 17:34:57 -08:00
Kwindla Hultman Kramer
1368d3db5c
revert elevenlabs example changes
2024-12-23 17:33:59 -08:00
Aleix Conchillo Flaqué
cd7dec7391
Merge pull request #906 from pipecat-ai/aleix/fix-duplicate-base-output-frames
...
transports(base_output): fix duplicate push_frame()
2024-12-23 06:12:31 -08:00
Kwindla Hultman Kramer
a5e985094b
remove stray line
2024-12-22 19:45:57 -08:00
Aleix Conchillo Flaqué
c04c69df95
transports(base_output): fix duplicate push_frame()
2024-12-22 14:43:38 -08:00
Aleix Conchillo Flaqué
9c105e25ac
Merge pull request #905 from pipecat-ai/aleix/daily-python-0.14.2
...
pyproject: update daily-python to 0.14.2
2024-12-22 13:03:25 -08:00
Aleix Conchillo Flaqué
6901c4fa57
pyproject: update daily-python to 0.14.2
2024-12-22 12:30:17 -08:00
Mark Backman
469c13c07e
Merge pull request #903 from pipecat-ai/mb/send-prebuilt-chat
...
Add the ability to send_prebuilt_chat_message when using the DailyTra…
2024-12-22 14:33:50 -05:00
Mark Backman
46871ae686
Merge pull request #899 from pipecat-ai/mb/add-fish-audio
...
Add Fish Audio TTS service
2024-12-22 14:26:59 -05:00
Kwindla Hultman Kramer
ab5df1a236
feature complete gemini audio, transcription, and phrase endpointing demo
2024-12-22 11:19:02 -08:00
Kwindla Hultman Kramer
f5f0de00e4
still some cleanup to do
2024-12-21 23:04:00 -08:00
Kwindla Hultman Kramer
f3dd35bfd9
working but needs cleanup
2024-12-21 22:18:56 -08:00
Kwindla Hultman Kramer
53a5e63990
function calling dead-end
2024-12-21 18:10:25 -08:00
Kwindla Hultman Kramer
d435a6a6d6
fixes to audio buffer
2024-12-21 16:22:53 -08:00
Kwindla Hultman Kramer
59240c7b96
delay gemini multimodal live websocket connect
2024-12-21 14:36:37 -08:00
Mark Backman
6c11753985
Add the ability to send_prebuilt_chat_message when using the DailyTransport
2024-12-21 14:04:46 -05:00
Mark Backman
6fabb7e7d5
Fix metrics calculations
2024-12-21 13:25:43 -05:00
Mark Backman
bce218915e
Add Fish to the README
2024-12-21 12:54:07 -05:00
Mark Backman
627c91f4a6
Flush the audio
2024-12-21 12:52:28 -05:00
Mark Backman
dac4468ca1
Add Fish Audio TTS service
2024-12-21 12:42:56 -05:00
Mark Backman
503eddf7d6
Merge pull request #897 from pipecat-ai/mb/update-playht
...
Update PlayHT to use the latest Websocket connection endpoint
2024-12-20 20:31:41 -05:00
Aleix Conchillo Flaqué
1a0f6f2a21
Merge pull request #898 from pipecat-ai/aleix/reset-input-queue-flag-if-interruption
...
frame_processor: reset input queue flag with interruptions
2024-12-20 13:58:12 -08:00
Aleix Conchillo Flaqué
43759295cc
frame_processor: reset input queue flag with interruptions
2024-12-20 09:33:20 -08:00
Mark Backman
900b95eb92
Update PlayHT to use the latest Websocket connection endpoint
2024-12-20 10:44:47 -05:00
marcus-daily
41d07692ca
Fix import order
2024-12-20 14:30:38 +00:00
marcus-daily
dcf6b6e120
Add an RTVIProcessor to the simple-chatbot pipeline
2024-12-20 14:30:38 +00:00
Mark Backman
99dba3b6b9
Merge pull request #893 from pipecat-ai/mb/changelog-11L
...
Added an `auto_mode` input parameter to `ElevenLabsTTSService`
2024-12-19 21:38:06 -05:00
Aleix Conchillo Flaqué
4547609ffb
examples(01a): remove unused import
2024-12-19 17:49:27 -08:00
Mark Backman
9554804a49
Update 11L default model, allow language to be used by more models
2024-12-19 20:33:58 -05:00
Mark Backman
656cbc35e1
Make auto_mode an input parametere for ElevenLabsTTSService; add changelog entry
2024-12-19 20:33:56 -05:00
Aleix Conchillo Flaqué
6f7c4dd998
Merge pull request #894 from pipecat-ai/aleix/daily-python-0.14.0
...
transports(daily): update to daily-python 0.14.0
2024-12-19 17:14:31 -08:00
Aleix Conchillo Flaqué
8b496f8c6f
transports(daily): daily-python 0.14.0 (SIP transfer/refer, DTMF)
2024-12-19 17:08:29 -08:00
Aleix Conchillo Flaqué
15047f5f0a
Merge pull request #885 from pipecat-ai/aleix/parallelpipeline-wait-for-slowest-endframe
...
pipeline(parallel): wait for slowest endframe
2024-12-19 15:18:22 -08:00
Aleix Conchillo Flaqué
e08c24dc41
Merge pull request #883 from pipecat-ai/aleix/base-output-transport-avoid-pushing-endframe
...
transport(base output): avoid pushing EndFrame twice
2024-12-19 11:26:31 -08:00
Aleix Conchillo Flaqué
5341739ece
transport(base output): avoid pushing EndFrame twice
2024-12-19 11:19:49 -08:00
Mark Backman
5b0fc3fa15
Merge pull request #891 from louisjoecodes/louis/flush-shorter-messages-elevenlabs
...
feat: set auto_mode=true - ElevenLabs tts WSS
2024-12-19 12:08:04 -05:00
Louis Jordan
b7b8e59e9e
feat: set auto_mode=true - ElevenLabs tts WSS
2024-12-19 16:57:17 +00:00
Mark Backman
6e0d3aef32
Merge pull request #860 from pipecat-ai/mb/transcription
...
Add a TranscriptProcessor and new frames
2024-12-19 08:15:53 -05:00
Mark Backman
1ccc84dd7a
Merge pull request #888 from pipecat-ai/mb/add-cerebras
...
Add CerebrasLLMService and foundational example
2024-12-19 08:14:53 -05:00
Mark Backman
c9dd906057
Tailor chat completion inputs to Cerebras API
2024-12-19 08:10:33 -05:00
Mark Backman
4f093f11db
Add CerebrasLLMService and foundational example
2024-12-19 08:10:31 -05:00
Mark Backman
887a9170b2
Merge pull request #889 from pipecat-ai/mb/openai-realtime-model
...
Add model parameter to OpenAI realtime service constructor, update de…
2024-12-19 08:08:52 -05:00
Aleix Conchillo Flaqué
f2e191855a
Merge pull request #881 from pipecat-ai/aleix/langchain-updates
...
pyproject: update langchaing to 0.3.12
2024-12-18 19:42:39 -08:00
Aleix Conchillo Flaqué
78b90e9591
Merge pull request #884 from pipecat-ai/aleix/filters-handle-endframe
...
processors(filters): allow passing EndFrame
2024-12-18 19:35:56 -08:00
Aleix Conchillo Flaqué
17decee788
Merge pull request #882 from pipecat-ai/aleix/stop-transport-parent-first
...
transports: call parent stop() before disconnecting
2024-12-18 19:35:39 -08:00
Aleix Conchillo Flaqué
f89014d100
pyproject: update langchaing to 0.3.12
2024-12-18 19:34:49 -08:00
Mark Backman
3b3e22fe7c
Add model parameter to OpenAI realtime service constructor, update default model
2024-12-18 18:12:51 -05:00
Aleix Conchillo Flaqué
0df0194cc1
Merge pull request #886 from pipecat-ai/aleix/koala-noise-suppression
...
audio(koala): add new audio filter KoalaFilter
2024-12-18 14:02:04 -08:00
Mark Backman
8a7a61914e
Code review feedback
2024-12-17 22:35:13 -05:00
Mark Backman
1117c21483
Refactor TranscriptProcessor into user and assistant processors
2024-12-17 22:34:22 -05:00
Mark Backman
4211664a77
TranscriptProcessor to handle simple and list content
2024-12-17 22:34:03 -05:00
Mark Backman
1f8a217cd1
Code review changes
2024-12-17 22:34:02 -05:00
Mark Backman
b5bd662fe1
Add changelog and rename examples
2024-12-17 22:33:39 -05:00
Mark Backman
dd2703317a
Add timestamp frames and include timestamps in the transcription event and frame
2024-12-17 22:31:15 -05:00
Mark Backman
77aeda36eb
Update OpenAI's from_standard_message to convert back to OpenAI's simple format
2024-12-17 22:31:15 -05:00
Mark Backman
51b235df4b
Add docstrings for Google and Anthropic's to_standard_messages and from_standard_message functions
2024-12-17 22:31:15 -05:00
Mark Backman
4f2aee5fba
Update OpenAI's to_standard_messages to return the verboase message format
2024-12-17 22:31:15 -05:00
Mark Backman
55879bf365
Add TranscriptionProcessor
2024-12-17 22:31:15 -05:00
Aleix Conchillo Flaqué
7322badbe7
audio(koala): add new audio filter KoalaFilter
2024-12-17 18:45:10 -08:00
Aleix Conchillo Flaqué
42bea578e8
pipeline(parallel): wait for slowest endframe
...
If we are sending an EndFrame and a ParallelPipeline has multiple pipelines we
want to wait before pushing the EndFrame downstream until the slowest pipeline
is finished. Otherwise, we could be disconnecting from the transport too early.
2024-12-17 17:05:11 -08:00
Aleix Conchillo Flaqué
2dfdceb9e6
processors(filters): allow passing EndFrame
2024-12-17 16:22:19 -08:00
Aleix Conchillo Flaqué
5bfcac1f5c
transports: call parent stop() before disconnecting
...
This rollbacks a previous change https://github.com/pipecat-ai/pipecat/pull/855
which was trying to fix an issue in the wrong way.
The reasoning behind this fix is that the parent class might be sending audio or
messages (through the subclass) and if we disconnect before all the data is sent
we will run into incomplete audio or even errors. Therefore, we first make sure
the parent tasks stop and then it will be safe to disconnect.
2024-12-17 16:02:33 -08:00
Aleix Conchillo Flaqué
fb9f72d38b
Merge pull request #880 from pipecat-ai/aleix/ruff-check-import-linter
...
ruff check import linter
2024-12-17 14:14:47 -08:00
Aleix Conchillo Flaqué
146a341a38
Merge pull request #879 from Vaibhav159/vl_add_readme_for_ruff_formatter_in_pycharm
...
updating readme to support auto-formatting of ruff in pycharm
2024-12-17 11:49:01 -08:00
Aleix Conchillo Flaqué
b9ca667d31
pyproject: use tool.ruff.lint sections
2024-12-17 11:40:43 -08:00
Aleix Conchillo Flaqué
5c57cccea3
github: run ruff check import linter
2024-12-17 11:29:28 -08:00
Aleix Conchillo Flaqué
17162258a2
fix ruff linter import organization
2024-12-17 11:28:58 -08:00
Aleix Conchillo Flaqué
da3fb98101
examples(storytelling-chatbot): update dependencies
2024-12-17 11:24:50 -08:00
Aleix Conchillo Flaqué
6244124d14
README: added Emacs import re-organization with Ruff
2024-12-17 11:20:18 -08:00
Vaibhav159
53049adeea
removing --config flag
2024-12-18 00:47:00 +05:30
Vaibhav159
4208d2d7c4
updating readme to support auto-formatting of ruff in pycharm
2024-12-17 23:38:36 +05:30
Mark Backman
9f7f74e4d8
Merge pull request #869 from Vaibhav159/vl_fixing_deepgram_language_bug_#868
...
fixing [#868 ] bug where deepgram client fails due to langauge
2024-12-17 12:50:57 -05:00
Vaibhav159
f14d32d09e
fixing ruff issue
2024-12-17 23:11:18 +05:30
Vaibhav159
7351e281e2
ruff change
2024-12-17 22:21:56 +05:30
Vaibhav159
b94b10f7d6
added change log
2024-12-17 22:11:52 +05:30
Vaibhav159
1cc90eb1a3
Merge branch 'main' into vl_fixing_deepgram_language_bug_#868
2024-12-17 22:09:30 +05:30
Vaibhav159
5f7d28bb05
adding type check and value check
2024-12-17 22:07:35 +05:30
Mark Backman
204a08ab8f
Merge pull request #877 from pipecat-ai/mb/grok-function-calling-fix
...
Add custom assistant context aggregator for Grok due to content requi…
2024-12-17 10:51:19 -05:00
Aleix Conchillo Flaqué
141b0a6560
sentry: fix formatting
2024-12-17 07:14:31 -08:00
Mark Backman
ca086a856f
Add custom assistant context aggregator for Grok due to content requirement in function calling
2024-12-17 09:11:21 -05:00
Aleix Conchillo Flaqué
fe0a7d07bd
update CHANGELOG
2024-12-16 21:02:38 -08:00
Aleix Conchillo Flaqué
79eb29d614
Merge pull request #875 from pipecat-ai/aleix/update-dependencies
...
update dependencies
2024-12-16 20:58:30 -08:00
Aleix Conchillo Flaqué
da15c83bab
fix ruff formatting
2024-12-16 20:52:40 -08:00
Aleix Conchillo Flaqué
d6bac77b3c
pyproject: add audioop-lts for python 3.13
2024-12-16 20:50:25 -08:00
Aleix Conchillo Flaqué
7faa4eb295
update dev-requirements
2024-12-16 20:50:25 -08:00
Aleix Conchillo Flaqué
0e31413851
pyproject: update numpy, pydantic, loguru
2024-12-16 19:20:34 -08:00
Aleix Conchillo Flaqué
16948b251d
services: fix infinite websocket-bases TTS services retries
...
Fixes #871
2024-12-16 16:36:44 -08:00
Mark Backman
f3112a8638
Merge pull request #866 from pipecat-ai/mb/readme-links
...
Fix a bunch of README docs links
2024-12-16 10:51:01 -05:00
Mark Backman
0293d40e4e
Merge pull request #870 from pipecat-ai/mb/dotenv
...
Add python-dotenv to dev-requirements.txt
2024-12-16 10:50:46 -05:00
Mark Backman
64038442ed
Add python-dotenv to dev-requirements.txt
2024-12-16 09:23:12 -05:00
Vaibhav159
facc280599
fixing [ #868 ] bug where deepgram client fails due to langauge
2024-12-16 17:47:50 +05:30
Mark Backman
f90cbe8086
Fix a bunch of README docs links
2024-12-15 14:30:20 -05:00
Mark Backman
09a611d44b
Merge pull request #856 from pipecat-ai/mb/daily-rest-helpers
...
Remove default 5 min exp time for created rooms, add docstrings
2024-12-13 12:08:58 -05:00
Mark Backman
16d7fb2c4a
Remove default 5 min exp time for created rooms, add docstrings
2024-12-13 12:02:26 -05:00
Aleix Conchillo Flaqué
643160c960
Merge pull request #858 from pipecat-ai/aleix/fastpitch-timeout
...
riva: make sure we don't block on fastpitch
2024-12-13 08:20:38 -08:00
Aleix Conchillo Flaqué
aac907aadb
riva: make sure we don't block on fastpitch
2024-12-13 07:32:51 -08:00
Aleix Conchillo Flaqué
8f24ca4e58
Merge pull request #857 from pipecat-ai/aleix/fix-riva-tts-audio-stuttering
...
riva: fix FastPitchTTSService audio stuttering
2024-12-12 22:20:00 -08:00
Aleix Conchillo Flaqué
420ce16807
riva: fix FastPitchTTSService audio stuttering
2024-12-12 22:15:44 -08:00
Aleix Conchillo Flaqué
2b8c35c681
Merge pull request #855 from pipecat-ai/aleix/transport-services-disconnect-fixes
...
transports(services): disconnect client first
2024-12-12 19:40:03 -08:00
Mark Backman
3d96369193
Merge pull request #852 from pipecat-ai/mb/readme-docs-badge
...
Add docs badge to README
2024-12-12 22:21:41 -05:00
Aleix Conchillo Flaqué
d44b36a07c
Merge pull request #854 from pipecat-ai/aleix/aiservice-add-missing-process-frame
...
AIService: add missing super().process_frame()
2024-12-12 19:10:21 -08:00
Aleix Conchillo Flaqué
ccc96994e9
pyproject: update livekit
2024-12-12 19:09:36 -08:00
Aleix Conchillo Flaqué
337d421338
transports: disconnect client first
2024-12-12 19:09:06 -08:00
Aleix Conchillo Flaqué
752720b4d5
AIService: add missing super().process_frame()
2024-12-12 17:25:38 -08:00
Aleix Conchillo Flaqué
f8e69cfa00
Merge pull request #853 from pipecat-ai/revert-849-aleix/no-need-for-super-process-frame
...
Revert "no longer necessary to call super().process_frame(frame, direction)"
2024-12-12 17:21:20 -08:00
Aleix Conchillo Flaqué
6d11911d83
Revert "no longer necessary to call super().process_frame(frame, direction)"
2024-12-12 17:03:40 -08:00
Mark Backman
ec6e71c8ea
Add docs badge to README
2024-12-12 18:08:24 -05:00
Aleix Conchillo Flaqué
10f854aeba
Merge pull request #846 from pipecat-ai/aleix/base-output-transport-audio-sync
...
transport(output): fix non-audio frames sync after audio frames
2024-12-12 14:29:42 -08:00
Aleix Conchillo Flaqué
d8caf007b0
Merge pull request #849 from pipecat-ai/aleix/no-need-for-super-process-frame
...
no longer necessary to call super().process_frame(frame, direction)
2024-12-12 14:29:10 -08:00
Mark Backman
26ea64ef12
Merge pull request #850 from pipecat-ai/mb/fix-docs-builds
...
Fix docs generation build issues
2024-12-12 17:27:00 -05:00
Mark Backman
19c178ebc7
Fix docs generation build issues
2024-12-12 17:18:04 -05:00
Aleix Conchillo Flaqué
3c3fd67d96
no longer necessary to call super().process_frame(frame, direction)
2024-12-12 13:03:41 -08:00
Mark Backman
7bbc0ee8df
Merge pull request #845 from pipecat-ai/mb/more-docs-updates
...
Docs auto-gen improvements
2024-12-12 15:42:34 -05:00
Mark Backman
67804edce6
Remove formats from .readthedocs.yaml
2024-12-12 15:41:11 -05:00
Mark Backman
ec082d0888
Remove deprecated VAD module
2024-12-12 15:32:38 -05:00
Mark Backman
8631d71d5a
Fix more missing docs
2024-12-12 15:16:37 -05:00
Vaibhav159
62fc95300b
adding livekit audio and chat version
2024-12-13 01:09:47 +05:30
Aleix Conchillo Flaqué
db7eaed980
transport(output): fix non-audio frames sync after audio frames
2024-12-12 10:56:02 -08:00
Mark Backman
44c5220104
Update README
2024-12-12 13:28:05 -05:00
Mark Backman
276fd86ecb
More fixes for missing packages
2024-12-12 13:25:13 -05:00
Mark Backman
2de0737056
Merge pull request #844 from pipecat-ai/cb-gemini-example-fix
...
Update requirements.txt for simple-chatbot
2024-12-12 11:18:58 -05:00
Mark Backman
b5d5a0e923
Add special cases for displaying some names
2024-12-12 11:15:36 -05:00
Mark Backman
f3ed12c30b
Clean up module and package display names
2024-12-12 11:11:53 -05:00
Mark Backman
e14399727b
Add README and build script for local testing
2024-12-12 11:06:53 -05:00
Mark Backman
414dcf9810
Improve TOC in sidebar, fix missing services
2024-12-12 11:06:09 -05:00
chadbailey59
88d530e840
Update requirements.txt for simple-chatbot
...
The gemini example doesn't actually work from a fresh install, because the requirements.txt file doesn't include google :)
2024-12-12 09:31:15 -06:00
Aleix Conchillo Flaqué
af821d8e95
Merge pull request #841 from pipecat-ai/aleix/aws-to-polly
...
polly: renamed AWSTTSService to PollyTTSService
2024-12-11 18:13:02 -08:00
Aleix Conchillo Flaqué
133e1aff6c
polly: renamed AWSTTSService to PollyTTSService
2024-12-11 17:56:43 -08:00
Aleix Conchillo Flaqué
def415f476
Merge pull request #840 from pipecat-ai/aleix/11labs-playht-more-languages
...
tts: support more languages in playht and elevenlabs
2024-12-11 14:58:03 -08:00
Aleix Conchillo Flaqué
a34d16dabe
tts: support more languages in playht and elevenlabs
2024-12-11 14:53:24 -08:00
Mark Backman
ec7260b237
Merge pull request #839 from pipecat-ai/mb/bump-versions
...
Bump openai and aiohttp package versions
2024-12-11 17:06:15 -05:00
Mark Backman
96c6c71d5b
Bump openai and aiohttp package versions
2024-12-11 16:48:36 -05:00
Aleix Conchillo Flaqué
8e140b2be6
Merge pull request #838 from pipecat-ai/aleix/prepare-0.0.50
...
update CHANGELOG fot 0.0.50
2024-12-11 11:49:15 -08:00
Aleix Conchillo Flaqué
a70c785b2e
update CHANGELOG fot 0.0.50
2024-12-11 11:33:13 -08:00
Aleix Conchillo Flaqué
f1d3c5e9ad
Merge pull request #837 from pipecat-ai/aleix/update-protobuf-to-5.29.1
...
pyproject: update protobuf to 5.29.1
2024-12-11 11:31:49 -08:00
Aleix Conchillo Flaqué
346329ba73
pyproject: update protobuf to 5.29.1
2024-12-11 11:29:48 -08:00
Aleix Conchillo Flaqué
6089d4255c
Merge pull request #836 from pipecat-ai/aleix/moondream-studypal-fixes
...
examples: fixes for moondream-chatbot and studypal
2024-12-11 11:16:09 -08:00
Aleix Conchillo Flaqué
cff9bb6068
Merge pull request #835 from pipecat-ai/aleix/even-more-parallel-pipeline-fixes
...
parallel_pipeline: fix system frames and parallel pipelines again
2024-12-11 11:15:59 -08:00
Aleix Conchillo Flaqué
fdefdc9d68
Merge pull request #834 from pipecat-ai/aleix/transcription-are-text
...
frames: transcriptions should be TextFrames as before
2024-12-11 11:15:43 -08:00
Aleix Conchillo Flaqué
2dd418a38d
parallel_pipeline: fix system frames and parallel pipelines again
...
The previous fixes didn't take into account that system frames can be generated
inside the internal pipelines.
2024-12-11 10:55:04 -08:00
Aleix Conchillo Flaqué
42f5ec20f6
examples: fixes for moondream-chatbot and studypal
2024-12-11 10:46:38 -08:00
Aleix Conchillo Flaqué
5b5125b74c
frames: transcriptions should be TextFrames as before
2024-12-11 10:42:38 -08:00
Mark Backman
be4df5f713
Merge pull request #833 from pipecat-ai/mb/update-changelog-for-gemini
...
Update the CHANGELOG and README for Gemini Multimodal Live
2024-12-11 11:41:42 -05:00
Mark Backman
5418cdc4d1
Update the CHANGELOG and README for Gemini Multimodal Live
2024-12-11 11:40:16 -05:00
Mark Backman
6c9f5a81dc
Merge pull request #832 from pipecat-ai/khk/gemini-live-function-calling
...
Gemini Multimodal Live function calling example
2024-12-11 11:39:19 -05:00
Mark Backman
027e360436
Fix demo numbering and prompt the bot to say hi in 26b
2024-12-11 11:36:38 -05:00
Kwindla Hultman Kramer
c219172266
Gemini Multimodal Live function calling example
2024-12-11 08:29:09 -08:00
Mark Backman
7b040be209
Merge pull request #830 from pipecat-ai/khk/gemini-multimodal-live
...
Gemini Multimodal Live API service
2024-12-11 11:25:55 -05:00
Mark Backman
0d74531f36
Minor changes to demos
2024-12-11 11:23:59 -05:00
Mark Backman
3341c4f608
Merge pull request #831 from pipecat-ai/mb/gemini-simple-chatbot
...
Gemini updates to the simple-chatbot demo
2024-12-11 11:15:15 -05:00
Mark Backman
1e45e55528
Add copyright block to audio_transcriber
2024-12-11 11:06:48 -05:00
Mark Backman
8086a94e49
Renumber foundational demos
2024-12-11 10:56:51 -05:00
Kwindla Hultman Kramer
81895f4a5c
Gemini Multimodal Live API service
2024-12-11 07:38:23 -08:00
Mark Backman
2846d6f461
Update READMEs and comment files
2024-12-11 00:06:35 -05:00
Mark Backman
14f309ce2b
Add Gemini Live bot file
2024-12-10 22:25:17 -05:00
Aleix Conchillo Flaqué
62ec2f5d1e
Merge pull request #814 from pipecat-ai/aleix/simli-updates
...
minor simli updates
2024-12-10 18:48:29 -08:00
Aleix Conchillo Flaqué
4f9a4ebce2
Merge pull request #820 from pipecat-ai/aleix/more-parallelpipeline-fixes
...
parallel_pipeline: fix system frames again
2024-12-10 18:43:34 -08:00
Aleix Conchillo Flaqué
5b478a5c7a
add SimliVideoService to CHANGELOG
2024-12-10 18:42:26 -08:00
Aleix Conchillo Flaqué
87c1f2bcce
services(simli): remove ready flag, events vs sleep, handle CancelledError
2024-12-10 18:42:12 -08:00
Aleix Conchillo Flaqué
b85072637f
examples(26-simli-layer): use room returned by configure()
2024-12-10 18:42:12 -08:00
Aleix Conchillo Flaqué
ffe1e023e7
Merge pull request #819 from pipecat-ai/aleix/fix-openaillmcontext-from-image-frame
...
fix OpenAILLMContext from image frame
2024-12-10 18:39:55 -08:00
Aleix Conchillo Flaqué
9a358b2e86
Merge pull request #824 from pipecat-ai/aleix/openpipe-use-openai-base-service
...
services(openpipe): use OpenAILLMService to get access to aggregators
2024-12-10 18:34:46 -08:00
Aleix Conchillo Flaqué
b034c6e247
Merge pull request #821 from pipecat-ai/aleix/update-pyproject
...
pyproject: update onnxruntime, whisper and azure
2024-12-10 18:34:27 -08:00
Aleix Conchillo Flaqué
c7ca0eea0f
Merge pull request #823 from pipecat-ai/aleix/fix-15a-switch-languages
...
examples: fix 15a-switch-languages pipeline
2024-12-10 18:34:13 -08:00
Aleix Conchillo Flaqué
29d931cdcd
Merge pull request #822 from pipecat-ai/aleix/fix-11-sound-effects
...
examples: fix 11-sound-effects
2024-12-10 18:33:53 -08:00
Aleix Conchillo Flaqué
ecf0c61af9
services(openpipe): use OpenAILLMService to get access to aggregators
2024-12-10 18:29:03 -08:00
Aleix Conchillo Flaqué
67e8252d76
examples: fix 15a-switch-languages pipeline
2024-12-10 18:27:49 -08:00
Aleix Conchillo Flaqué
775aa9493e
examples: fix 11-sound-effects
2024-12-10 18:25:43 -08:00
Aleix Conchillo Flaqué
c446f91d4a
pyproject: update onnxruntime, whisper and azure
2024-12-10 18:16:27 -08:00
Aleix Conchillo Flaqué
7b6bbc29ed
parallel_pipeline: fix system frames again
2024-12-10 18:12:33 -08:00
Aleix Conchillo Flaqué
9e7ecccf1e
google: fix VisionImageRawFrame context
2024-12-10 17:39:52 -08:00
Aleix Conchillo Flaqué
a618bd3fa6
openai: remove from_image_frame() and use add_image_frame_message()
2024-12-10 17:39:52 -08:00
Aleix Conchillo Flaqué
246c825a82
examples: rename 07p-interruptible-google-audio-in to 07s
2024-12-10 17:07:17 -08:00
Aleix Conchillo Flaqué
9e6fabf110
Merge pull request #818 from pipecat-ai/aleix/fastpitch-rename
...
riva: rename FastpitchTTSService to FastPitchTTSService
2024-12-10 13:36:38 -08:00
Aleix Conchillo Flaqué
d2dabe4358
riva: rename FastpitchTTSService to FastPitchTTSService
2024-12-10 13:30:43 -08:00
Vanessa Pyne
1db624575f
Merge pull request #795 from pipecat-ai/vp-nvidia-riva
...
[WIP] add nvidia riva
2024-12-10 15:17:26 -06:00
vipyne
a49b4e450b
services(riva): check service config before running tts
2024-12-10 15:15:46 -06:00
vipyne
9211a37efc
services(riva): convention tweaks
2024-12-10 15:15:46 -06:00
vipyne
3f9d39329c
services(riva): model -> function_id
2024-12-10 15:15:46 -06:00
vipyne
5a98ae6380
chore: update test-requirements
2024-12-10 15:15:46 -06:00
vipyne
8caad15e9b
examples trivial update
2024-12-10 15:15:46 -06:00
vipyne
9222d9f721
services(riva): cleanup
2024-12-10 15:15:46 -06:00
vipyne
5a467a30a3
add nvidia riva - fastpitch
2024-12-10 15:15:46 -06:00
Aleix Conchillo Flaqué
d74e728332
pyproject: update google-cloud-texttospeech to 2.21.1
2024-12-10 15:15:46 -06:00
vipyne
8a9fdaf441
services(riva): cleanup
2024-12-10 15:15:46 -06:00
Aleix Conchillo Flaqué
4b55c73fbe
services(riva): make FastpitchTTSService asyncio
2024-12-10 15:15:46 -06:00
Aleix Conchillo Flaqué
7e407e5548
services(riva): first working version of ParakeetSTTService
2024-12-10 15:15:46 -06:00
Aleix Conchillo Flaqué
ce94421c90
pyproject: add riva option and update protobuf and playht
2024-12-10 15:15:46 -06:00
vipyne
49ce3dcb27
add nvidia riva - fastpitch
2024-12-10 15:15:46 -06:00
Aleix Conchillo Flaqué
6ba2dea6f0
Merge pull request #812 from zzz-heygen/zzz/fix_serializer_backward_compat
...
fix: make ProtobufFrameSerializer backwards compatible
2024-12-10 13:11:09 -08:00
Aleix Conchillo Flaqué
9ac34ac371
Merge pull request #816 from pipecat-ai/aleix/rtvi-version-update
...
rtvi: update protocol version to 0.3.0
2024-12-10 11:52:28 -08:00
Aleix Conchillo Flaqué
a8644d2129
Merge pull request #815 from pipecat-ai/aleix/identity-filter
...
processors(filters): add IdentityFilter
2024-12-10 11:09:20 -08:00
Aleix Conchillo Flaqué
3bf15476a4
processors(filters): add IdentityFilter
2024-12-10 11:01:59 -08:00
Aleix Conchillo Flaqué
acb3e21432
rtvi: update protocol version to 0.3.0
2024-12-10 10:57:42 -08:00
Mark Backman
8c9c81d84b
Merge pull request #810 from pipecat-ai/mb/read-the-docs
...
Changes for Read the Docs hosting
2024-12-10 12:48:26 -05:00
Aleix Conchillo Flaqué
e51e2f781d
Merge pull request #765 from simliai/simli
...
Add Simli Service
2024-12-10 09:23:06 -08:00
Dan Goodman
af6f5ecc86
customize Anthropic client via kwargs, also bumps default model version ( #813 )
...
* customize Anthropic client via kwargs
* bump default model
2024-12-10 09:13:44 -08:00
antonyesk601
81a18633ca
Remove duplicate frame push if simli connection isn't ready
2024-12-10 10:18:31 +00:00
antonyesk601
397342d0b9
Inizialize simli_client on StartFrame; Follow variable naming scheme; Use logger instead of print statements;
2024-12-10 10:11:07 +00:00
zzz
d6b3a50108
x
2024-12-10 07:50:50 +00:00
Mark Backman
66b08161f1
Changes for Read the Docs hosting
2024-12-10 00:54:21 -05:00
Mark Backman
e7fa1cacce
Merge pull request #800 from pipecat-ai/mb/autogen-docs
...
Auto-generate API reference docs
2024-12-09 22:05:08 -05:00
Mark Backman
2d3864ee09
Move API docs generation to docs/api
2024-12-09 20:44:10 -05:00
Aleix Conchillo Flaqué
0287f06379
Merge pull request #809 from pipecat-ai/aleix/parallel-pipeline-fix-system-frames
...
fix system frames parallel pipeline
2024-12-09 15:48:27 -08:00
Mark Backman
681c8ffb1d
Merge pull request #807 from pipecat-ai/mb/stt-mute-strategy
...
Add new STT mute strategy, accept a set of strategies
2024-12-09 18:34:30 -05:00
Mark Backman
676643d558
Code review fixes
2024-12-09 18:27:07 -05:00
Mark Backman
0c4cbc2615
Push FunctionCall Frames upstream and downstream; update example
2024-12-09 18:27:07 -05:00
Aleix Conchillo Flaqué
e690c98230
transports(daily): no need for joining flag
...
This was put back because of an issue in ParallelPipeline but that issue is now
fixed so the joining check is not really necessary.
2024-12-09 09:38:30 -08:00
Aleix Conchillo Flaqué
e0a6c6871c
parallel_pipeline: don't queue system frames
2024-12-09 09:38:30 -08:00
Mark Backman
29a042a101
Add changelog entry
2024-12-09 10:52:32 -05:00
Mark Backman
1cc2da571e
Add new STT mute strategy, accept a set of strategies
2024-12-09 10:50:08 -05:00
Kwindla Hultman Kramer
c6b401b5d1
Merge pull request #805 from pipecat-ai/khk/parallel-pipeline-fix
...
Check to avoid double-join in ParallelPipeline case
2024-12-07 21:49:16 -08:00
Kwindla Hultman Kramer
315b7fcc34
check to avoid double-join
2024-12-07 21:22:36 -08:00
Mark Backman
e9f5fe0f37
Merge pull request #802 from Allenmylath/patch-22
...
Update README.md
2024-12-07 10:14:44 -05:00
allenmylath
64faf2218e
Update examples/patient-intake/README.md
...
Co-authored-by: Mark Backman <m.backman@gmail.com >
2024-12-07 19:08:00 +05:30
allenmylath
e77a785a7d
Update README.md
2024-12-07 13:36:50 +05:30
Mark Backman
03a269fb87
Merge pull request #801 from pipecat-ai/aleix/rtvi-handle-transport-urgent-frames
...
rtvi: handle transport urgent frames
2024-12-06 21:33:18 -05:00
Aleix Conchillo Flaqué
d1a55c6063
rtvi: handle transport urgent frames
2024-12-06 17:51:09 -08:00
Mark Backman
61d0fa42f1
Add a workflow to generate the docs
2024-12-06 20:32:33 -05:00
Mark Backman
16de1fca9b
Add Read the Docs config
2024-12-06 20:15:17 -05:00
Mark Backman
2ad83f23c8
Initial reference docs commit
2024-12-06 19:44:44 -05:00
Aleix Conchillo Flaqué
422ee98db0
Merge pull request #798 from pipecat-ai/aleix/functioncall-data-frames
...
frames: FunctionCallResultFrame should be a DataFrame as before
2024-12-06 16:38:23 -08:00
Aleix Conchillo Flaqué
3d4620cf95
frames: FunctionCallResultFrame should be a DataFrame as before
2024-12-06 11:54:50 -08:00
Aleix Conchillo Flaqué
752a6f02b5
Merge pull request #799 from pipecat-ai/aleix/cartesia-interruptions-fix
...
cartesia: fix broken interruptions
2024-12-06 11:52:22 -08:00
Aleix Conchillo Flaqué
7e41809ec2
cartesia: fix broken interruptions
2024-12-06 11:49:03 -08:00
Aleix Conchillo Flaqué
e344a73d14
Merge pull request #797 from pipecat-ai/aleix/xtts-default-language
...
services(xtts): default language to Language.EN
2024-12-06 11:00:53 -08:00
Aleix Conchillo Flaqué
d6f480fa50
Merge pull request #791 from pipecat-ai/aleix/fastapi-generic-websocket
...
FastAPIWebsocketTransport: fix to work with text and binary
2024-12-06 10:46:16 -08:00
Aleix Conchillo Flaqué
423d6485f8
services(xtts): default language to Language.EN
2024-12-06 10:45:20 -08:00
Aleix Conchillo Flaqué
842b3de7f5
FastAPIWebsocketTransport: fix to work with text and binary
2024-12-06 10:31:42 -08:00
Aleix Conchillo Flaqué
3cb7829624
update CHANGELOG
2024-12-06 10:31:11 -08:00
Aleix Conchillo Flaqué
4292507616
Merge pull request #793 from balalofernandez/send-interruption-to-cartesia
...
fix: Send interruption to cartesia
2024-12-06 10:26:34 -08:00
Aleix Conchillo Flaqué
98c9759f41
Merge pull request #796 from pipecat-ai/aleix/improve-tts-reconnection
...
services: improve Cartesia, 11Labs, PlayHT and LMNT TTS reconnection
2024-12-06 10:22:54 -08:00
Aleix Conchillo Flaqué
bafb867ffc
services: improve Cartesia, 11Labs, PlayHT and LMNT TTS reconnection
2024-12-06 10:11:59 -08:00
Mark Backman
b05809be2e
Merge pull request #794 from pipecat-ai/mb/upgrade-anthropic
...
Upgrade Anthropic to the latest to avoid collision with aiohttp 3.11.9
2024-12-06 12:01:51 -05:00
Mark Backman
57d346ce13
Upgrade Anthropic to the latest to avoid collision with aiohttp 3.11.9
2024-12-06 11:59:19 -05:00
balalo
9001cb17ce
Fix interruption frame to avoid issues with sending None
2024-12-06 17:42:46 +01:00
Mark Backman
40cfd9776f
Merge pull request #792 from pipecat-ai/mb/cartesia-languages
...
Add additional languages for Cartesia
2024-12-06 09:57:38 -05:00
Mark Backman
d68b3ad1b2
Add additional languages for Cartesia
2024-12-06 09:22:05 -05:00
Kwindla Hultman Kramer
9b51588b92
Merge pull request #782 from pipecat-ai/khk/flash-transcription
...
Async Google LLM + Gemini Flash transcription example
2024-12-05 12:50:18 -08:00
Aleix Conchillo Flaqué
9a36a4ca32
Merge pull request #790 from pipecat-ai/aleix/base-output-transport-wait-for-output-tasks
...
transports(base_output): wait for output tasks on EndFrame
2024-12-05 11:30:55 -08:00
Aleix Conchillo Flaqué
f80a97b545
transports(base_output): wait for output tasks on EndFrame
2024-12-05 11:26:18 -08:00
Mark Backman
274278e229
Merge pull request #789 from pipecat-ai/mb/update-simple-chatbot-demo
...
Add RTVI transcripts, align styling
2024-12-05 11:56:07 -05:00
Mark Backman
6b94bcac03
Add RTVI transcripts, align styling
2024-12-05 11:12:48 -05:00
Aleix Conchillo Flaqué
969b87dee9
update aiohttp version to 3.11.9
2024-12-05 07:35:21 -08:00
balalo
bc699735a3
Send interruption message to cartesia
2024-12-05 16:23:40 +01:00
Mark Backman
00fd381808
Merge pull request #745 from pipecat-ai/mb/user-idle
...
Only run the UserIdleProcessor while pipeline is running
2024-12-05 10:12:02 -05:00
Mark Backman
672b1c6d73
Merge pull request #786 from Allenmylath/patch-21
...
Update README.md
2024-12-05 09:15:24 -05:00
Mark Backman
f455eb171b
Merge pull request #784 from pipecat-ai/mb/simple-bot-client
...
Update the simple-chatbot demo to have JS and React clients
2024-12-05 08:34:33 -05:00
allenmylath
62c8c90e17
Update README.md
2024-12-05 13:23:05 +05:30
Aleix Conchillo Flaqué
28bb448605
Merge pull request #783 from pipecat-ai/aleix/deepgram-vad-event-handlers
...
deepgram: add VAD event handlers
2024-12-04 19:35:22 -08:00
Aleix Conchillo Flaqué
3d76b30a7c
deepgram: add VAD event handlers
2024-12-04 19:31:09 -08:00
Aleix Conchillo Flaqué
0ae8ca0813
Merge pull request #781 from pipecat-ai/aleix/websocket-transports-mixer-fixes
...
websocket transports mixer fixes
2024-12-04 19:12:20 -08:00
Aleix Conchillo Flaqué
0935d773f5
transport(websockets): fix initial busy loop when using audio mixers
2024-12-04 19:10:39 -08:00
Aleix Conchillo Flaqué
e0f7a8a9f4
audio(mixer): SoundfileMixer doesn't resample files anymore
2024-12-04 19:09:50 -08:00
Aleix Conchillo Flaqué
2a0e01898f
Merge pull request #780 from pipecat-ai/aleix/gstreamer-default-sample-rate
...
gstreamer: update default sample rate to 24000
2024-12-04 19:09:02 -08:00
Aleix Conchillo Flaqué
9d25e325dd
Merge pull request #779 from pipecat-ai/aleix/websocket-server-audio-mixins-fix
...
frames: fix AudioRawFrame mixin
2024-12-04 19:08:41 -08:00
Aleix Conchillo Flaqué
37c21426bf
Merge pull request #778 from pipecat-ai/aleix/transports-disconnect-on-last-transport
...
transports: fix premature input transport closing
2024-12-04 19:08:23 -08:00
Mark Backman
c467ec8ded
Merge pull request #772 from pipecat-ai/mb/nim-llm
...
Add a NIM LLM service
2024-12-04 21:41:09 -05:00
Kwindla Hultman Kramer
a367a038f1
fix for finally clause
2024-12-04 18:31:30 -08:00
Mark Backman
e45a123eab
Add image to README
2024-12-04 21:29:22 -05:00
Mark Backman
2ecc0e2b13
Remove node modules
2024-12-04 21:28:17 -05:00
Mark Backman
d532e924cd
Add .gitignore
2024-12-04 21:28:17 -05:00
Mark Backman
36208049dc
Update changelog
2024-12-04 21:28:17 -05:00
Mark Backman
1d11419691
Update the simple-chatbot demo to have JS and React clients
2024-12-04 21:13:14 -05:00
Mark Backman
05451f882d
Merge pull request #777 from pipecat-ai/mb/twilio-example
...
Improve twilio-chatbot README
2024-12-04 20:26:45 -05:00
Kwindla Hultman Kramer
9c22f5b81b
async google llm
2024-12-04 15:52:52 -08:00
Aleix Conchillo Flaqué
891f261191
gstreamer: update default sample rate to 24000
2024-12-04 14:41:44 -08:00
Aleix Conchillo Flaqué
13c27eaa1d
frames: fix AudioRawFrame mixin
2024-12-04 13:25:37 -08:00
Mark Backman
c395d1a234
Merge pull request #773 from Allenmylath/patch-20
...
Update README.md
2024-12-04 14:45:38 -05:00
Mark Backman
49639c8631
Improve the twilio-chatbot README
2024-12-04 14:42:05 -05:00
Mark Backman
695a98a1f7
Remove streams.xml from version control
2024-12-04 14:26:10 -05:00
Mark Backman
5cbc37472c
Update .gitignore to exclude streams.xml
2024-12-04 14:25:10 -05:00
Aleix Conchillo Flaqué
5b6d9a1050
transports: fix premature input transport closing
2024-12-04 10:56:57 -08:00
allenmylath
332d36475b
Update examples/patient-intake/README.md
...
Co-authored-by: Mark Backman <m.backman@gmail.com >
2024-12-04 23:27:25 +05:30
Mark Backman
29b67578e3
Update README
2024-12-04 12:52:09 -05:00
Mark Backman
9db3743901
Update pyproject.toml with a nim optional dep
2024-12-04 12:52:09 -05:00
Mark Backman
496aded031
Update changelog
2024-12-04 12:38:05 -05:00
Mark Backman
1c1fa0db65
Add a NIM LLM service
2024-12-04 12:35:24 -05:00
Mark Backman
a2ad40d7e0
Merge pull request #775 from pipecat-ai/mb/llm-stubs
...
Added LLM services for GroqLLMService and GrokLLMService
2024-12-04 12:26:19 -05:00
Mark Backman
2bb3682d88
Update README
2024-12-04 12:24:39 -05:00
Kwindla Hultman Kramer
f33f08d667
partially working audio+transcription parallel pipelines
2024-12-04 08:51:35 -08:00
Mark Backman
d9bc2b618f
Update FireworksLLMService to use OpenAILLMService
2024-12-04 11:51:05 -05:00
Mark Backman
d5a50e2cad
Update AzureLLMService to use OpenAILLMService
2024-12-04 11:01:56 -05:00
Mark Backman
7013343bf0
Update the changelog
2024-12-04 10:10:55 -05:00
Mark Backman
728acba8a5
Add LLMService stubs for Grok and Groq, add examples
2024-12-04 10:08:28 -05:00
allenmylath
3b2c78747c
Update README.md
2024-12-04 10:24:17 +05:30
allenmylath
44a0acffc8
Update README.md
2024-12-04 10:21:17 +05:30
Aleix Conchillo Flaqué
c31d5a4f1a
Merge pull request #771 from pipecat-ai/aleix/daily-execute-callbacks-from-task
...
transports(daily): use a task to execute callbacks
2024-12-03 19:55:38 -08:00
Aleix Conchillo Flaqué
52caaa4afb
transports(daily): use a task to execute callbacks
...
This commit fixes an issue where we were not waiting for
`asyncio.run_coroutine_threadsafe` to complete which can cause a series of
undesired issues (e.g. not actually executing the coroutine).
2024-12-03 18:58:54 -08:00
Aleix Conchillo Flaqué
115e75d808
Merge pull request #770 from pipecat-ai/aleix/system-input-frames-and-audio-buffer-processor
...
system input frames and audio buffer processor fixes
2024-12-03 18:58:13 -08:00
Mark Backman
897e024dd8
Only run the UserIdleProcessor while pipeline is running
2024-12-03 21:09:03 -05:00
Aleix Conchillo Flaqué
1cf93f1dcb
FrameProcessor: ignore other frames during CancelFrame
2024-12-03 16:26:29 -08:00
Aleix Conchillo Flaqué
d278996d5b
updated CHANGELOG
2024-12-03 16:12:40 -08:00
Aleix Conchillo Flaqué
322dd0cea1
AudioBufferProcessor: use on_audio_data event handler to retrieve audio
2024-12-03 16:12:40 -08:00
Aleix Conchillo Flaqué
a6a4910931
transports(services): incoming transport messages should be urgent
2024-12-03 14:30:15 -08:00
Aleix Conchillo Flaqué
52cefaa9d6
frames: remove AppFrame
2024-12-03 14:30:15 -08:00
Aleix Conchillo Flaqué
42658ecd92
frames: use mixins for audio and image data
2024-12-03 14:30:15 -08:00
Aleix Conchillo Flaqué
a6606a4040
transports(base_output): remove unused code
2024-12-03 14:30:15 -08:00
Aleix Conchillo Flaqué
d6c944cdc1
processors(audio): fix AudioBufferProcessor interruptions
2024-12-03 14:30:15 -08:00
Aleix Conchillo Flaqué
a5c7b02a73
frames: input frames are now system frames
...
Input frames from a transport should be processed fast and there's no need for
them to be queued internally in each element.
2024-12-03 14:30:15 -08:00
Aleix Conchillo Flaqué
6b9223d87e
Merge pull request #768 from pipecat-ai/aleix/websocket-server-interruptions
...
transports(websockets): use frame serializers during interruptions
2024-12-02 19:18:20 -08:00
Aleix Conchillo Flaqué
c2135cbe11
transports(websockets): use frame serializers during interruptions
2024-12-02 19:17:17 -08:00
Aleix Conchillo Flaqué
32495ddd0b
Merge pull request #769 from pipecat-ai/aleix/daily-subscribe-video-source
...
transports(daily): subscribe to the desired video source
2024-12-02 19:16:14 -08:00
Aleix Conchillo Flaqué
4301f0abf7
Merge pull request #767 from pipecat-ai/aleix/warn-transcription-no-token
...
transports(daily): warn if transcription enabled but no token provided
2024-12-02 15:06:35 -08:00
Aleix Conchillo Flaqué
5e854c4d03
transports(daily): subscribe to the desired video source
2024-12-02 12:13:23 -08:00
Aleix Conchillo Flaqué
bec46a87ae
Merge pull request #766 from Allenmylath/patch-20
...
Update requirements.txt
2024-12-02 10:32:36 -08:00
Aleix Conchillo Flaqué
71cf94e936
transports(daily): warn if transcription enabled but no token provided
2024-12-02 09:55:17 -08:00
allenmylath
acbecf1c4c
Update requirements.txt
...
daily is not used here.transport is fastapi websocket.
2024-12-02 21:36:29 +05:30
Mark Backman
6095fd342e
Merge pull request #763 from Allenmylath/patch-19
...
Update README.md
2024-12-02 09:30:36 -05:00
Waleed
bf40b4936b
updated env template; added simli variables
2024-12-02 12:05:55 +01:00
Waleed
c60dd8d4d2
updated environment variable name for cartesia
2024-12-02 12:05:32 +01:00
Waleed
d472aaf391
updated readme. Added simli
2024-12-02 11:50:51 +01:00
Waleed
6cc0b74e6c
integrated simli
2024-12-02 11:35:46 +01:00
allenmylath
23316fbcf9
Update README.md
2024-12-02 13:35:44 +05:30
James Hush
5e22ef251d
fix: add logging and error handling for issue #721 ( #755 )
2024-11-29 13:06:45 +08:00
Mark Backman
c5324df807
Merge pull request #752 from pipecat-ai/mb/google-context-message-conversion
...
Use Google Gemini message format when adding message to the LLM context
2024-11-27 14:13:17 -05:00
Mark Backman
3c19a7ae3d
Use Google Gemini message format when adding message to the LLM context
2024-11-27 12:46:51 -05:00
Mark Backman
98c0a6e047
Merge pull request #749 from pipecat-ai/mb/pipecat-flows-standalone
...
Make Pipecat Flows an independent package
2024-11-25 17:09:11 -05:00
Mark Backman
f599e160de
Make Pipecat Flows an independent package
2024-11-25 13:42:08 -05:00
Mark Backman
11c5d822f9
Merge pull request #746 from pipecat-ai/mb/update-flows
...
Bumping pipecat-ai-flows version
2024-11-22 11:25:03 -05:00
Mark Backman
c3e22f0931
Bumping pipecat-ai-flows version
2024-11-22 11:21:40 -05:00
Kwindla Hultman Kramer
9409546f90
Merge pull request #743 from pipecat-ai/khk/gemini-exp
...
Empty text content bug fix for Gemini
2024-11-21 14:04:28 -08:00
Kwindla Hultman Kramer
8ddac0ccd8
Testing with gemini-exp-1114. Bug fix
2024-11-21 10:33:12 -08:00
Vaibhav159
6e8e7fa19a
adding session_timeout in fastapi
2024-11-21 14:56:42 +05:30
Vaibhav159
7dfa886669
moving logic to WebsocketServerInputTransport
2024-11-21 14:45:24 +05:30
Vaibhav159
da254c5143
correcting _monitor_websocket
2024-11-21 12:36:51 +05:30
Vaibhav159
e11f128110
adding on_session_timeout
2024-11-21 12:34:32 +05:30
Vaibhav-Lodha
3aa89fb13a
adding session_timeout param
2024-11-21 12:20:51 +05:30
Mark Backman
f938960d50
Merge pull request #736 from pipecat-ai/mb/language-support
...
Make language support more robust
2024-11-20 13:03:47 -05:00
Mark Backman
2981d87bc1
Update changelog
2024-11-20 12:56:35 -05:00
Mark Backman
106042bbb2
Make language support more robust
2024-11-20 12:56:11 -05:00
Filipi da Silva Fuchter
d25ddeb962
Merge pull request #739 from pipecat-ai/krisp_v7
...
bumping krisp to support v7
2024-11-20 11:39:39 -03:00
Filipi Fuchter
c441baa692
bumping krisp to support v7
2024-11-20 11:37:45 -03:00
Mark Backman
676ff14913
Merge pull request #735 from pipecat-ai/vp-internal-push-frame-fix
...
internal push frame fix
2024-11-20 06:34:40 -05:00
Vanessa Pyne
14893ade92
Update src/pipecat/processors/frame_processor.py
...
Co-authored-by: Mark Backman <mark@daily.co >
2024-11-19 22:37:58 -06:00
Mark Backman
2a39ff69d6
Merge pull request #720 from pipecat-ai/mb/conversation-flow
2024-11-19 21:46:20 -05:00
Mark Backman
e79289454a
Merge pull request #734 from pipecat-ai/mb/fix-cartesia
2024-11-19 21:27:52 -05:00
Mark Backman
25d02da1b2
Merge pull request #738 from pipecat-ai/mb/natural-conversation-demo
2024-11-19 21:27:38 -05:00
Mark Backman
a36fc370fa
Improve the 22c foundational example
2024-11-19 15:49:40 -05:00
Mark Backman
e4c2f6d4c2
Update changelog
2024-11-18 21:32:53 -05:00
Mark Backman
97659ca3f0
Use the new pipecat-ai-flows module
2024-11-18 21:29:35 -05:00
vipyne
e00c75ce3f
fix: raise exception in internal_push_frame
2024-11-18 16:01:04 -06:00
Mark Backman
cf62167f54
Revert: services(cartesia): generated TTSStoppedFrame after no more audio
2024-11-18 12:25:04 -05:00
Mark Backman
b3dfeb61c4
Add CHANGELOG entry
2024-11-18 12:18:20 -05:00
Mark Backman
bd020320cd
Support a list of messages
2024-11-18 12:18:20 -05:00
Mark Backman
7a55d2d7db
Add end session handler and update example
2024-11-18 12:18:20 -05:00
Mark Backman
b7308dca5d
Fix issue where actions would execute on terminating nodes
2024-11-18 12:18:20 -05:00
Mark Backman
5301f44b3b
Add pre- and post-actions
2024-11-18 12:18:20 -05:00
Mark Backman
686165b95a
Add ability to register actions
2024-11-18 12:18:20 -05:00
Mark Backman
4e0ecdd673
Class name updates and remove FrameProcessor base class
2024-11-18 12:18:20 -05:00
Mark Backman
1b74560f9d
Move function registration into the ConversationFlowProcessor class
2024-11-18 12:18:20 -05:00
Mark Backman
0c1070433f
Clean up and commenting
2024-11-18 12:18:20 -05:00
Mark Backman
ece2c08cde
debugging
2024-11-18 12:18:20 -05:00
Mark Backman
0b9742da9e
Add a conversation flow processor
2024-11-18 12:18:20 -05:00
Aleix Conchillo Flaqué
635aa6eb5b
Merge pull request #729 from pipecat-ai/aleix/fastapi-websocket-dont-close
...
transports(fastapi): don't try to close socket
2024-11-18 16:01:41 +01:00
Mark Backman
1ff17cc2b6
Merge pull request #733 from pipecat-ai/aleix/add-missing-init-files
...
processors: add missing __init__.py
2024-11-18 09:44:56 -05:00
Mark Backman
41ce9e9087
Merge pull request #697 from pipecat-ai/cst/leave-message
...
add handler for disconnect-bot message
2024-11-18 09:38:11 -05:00
Mark Backman
4803c54ecf
Update CHANGELOG
2024-11-18 09:36:19 -05:00
Christian Stuff
5d7b3f2b38
add handler for disconnect-bot message
2024-11-18 09:33:30 -05:00
Aleix Conchillo Flaqué
23e5b1ec4d
processors: add missing __init__.py
2024-11-18 11:32:20 +01:00
Aleix Conchillo Flaqué
7f5a8928b8
transports(fastapi): don't try to close socket
...
The websocket is passed from outside (in the transport constructor) so we should
not be trying to close it. FastAPI does actually close it later. We didn't see
any issue because these functions were not implemented properly. The value to
check was `application_state` instead of `client_state`. But in any case,
Pipecat should not be responsible for closing things passed from outside.
2024-11-18 01:15:19 +01:00
Aleix Conchillo Flaqué
53f675f5cf
Merge pull request #727 from pipecat-ai/aleix/pipecat-0.0.49
...
update CHANGELOG for 0.0.49
2024-11-18 06:27:12 +08:00
Aleix Conchillo Flaqué
8173e4ce55
update CHANGELOG for 0.0.49
2024-11-17 23:26:09 +01:00
Aleix Conchillo Flaqué
5445bb0363
rtvi: add on_bot_started event
2024-11-17 22:40:00 +01:00
Mark Backman
a2a94724e5
Merge pull request #725 from pipecat-ai/mb/fix-simple-chatbot
...
Fix simple-chatbot example
2024-11-16 12:10:05 -05:00
Aleix Conchillo Flaqué
a8f9b0635a
Merge pull request #722 from pipecat-ai/aleix/more-dailin-events
...
transports(daily): add more dial-in events
2024-11-17 01:09:01 +08:00
Mark Backman
4273a31fd5
Fix simple-chatbot example
2024-11-16 07:48:42 -05:00
Aleix Conchillo Flaqué
67f975a2c8
transports(daily): add more dial-in events
2024-11-16 01:22:50 +01:00
Mark Backman
d0bca67666
Merge pull request #716 from pipecat-ai/mb/mute-stt-service
...
Add STTMuteFilter to un/mute the STT
2024-11-14 19:55:00 -05:00
Mark Backman
966974bfc6
Change STTMuteProcessor to STTMuteFilter
2024-11-14 19:47:37 -05:00
Mark Backman
f807f233bd
Suppress UserStartedSpeakingFrame and UserStoppedSpeakingFrame when muted
2024-11-14 17:11:51 -05:00
Mark Backman
33108f5798
Code review feedback
2024-11-14 17:05:08 -05:00
Mark Backman
52de825af8
Update CHANGELOG
2024-11-14 13:47:08 -05:00
Mark Backman
5fe679039c
Add STTMuteProcessor to un/mute the STT
2024-11-14 13:35:02 -05:00
Kwindla Hultman Kramer
534f710f5d
Merge pull request #688 from pipecat-ai/khk/natural-conversation
...
More work on llm-as-judge phrase endpointing
2024-11-14 09:15:16 -08:00
Mark Backman
53a11744a8
Merge pull request #712 from pipecat-ai/aleix/some-languages-tweaks
...
some languages tweaks
2024-11-14 09:33:26 -05:00
Mark Backman
72412cc0c4
Code review feedback
2024-11-14 09:31:04 -05:00
Mark Backman
b77ac07bc6
Merge pull request #715 from pipecat-ai/mb/update-readme-2
...
Add visual divider below Pipecat README image
2024-11-14 08:54:25 -05:00
Mark Backman
eb6926e0ce
Add visual divider below Pipecat README image
2024-11-14 08:51:07 -05:00
Mark Backman
3b2c9de944
Merge pull request #713 from pipecat-ai/mb/update-readme
...
Update README
2024-11-14 08:45:28 -05:00
Mark Backman
27ff868e5a
Move CONTRIBUTING to top directory
2024-11-14 08:43:03 -05:00
Mark Backman
57ef525a8e
Update README
2024-11-14 08:43:03 -05:00
Aleix Conchillo Flaqué
d1db54d5fe
examples(playht): use a 2.0 engine
2024-11-13 17:19:23 +01:00
Aleix Conchillo Flaqué
4f88fc0eb8
services(tts): initialize language to the proper language code
2024-11-13 17:19:23 +01:00
Aleix Conchillo Flaqué
37d1f4c4e1
services(tts): some language to service language cleanup
2024-11-13 17:19:23 +01:00
Aleix Conchillo Flaqué
ef9e86d997
services(playht): make sure we only skip wav header no matter the size
2024-11-13 17:19:23 +01:00
Aleix Conchillo Flaqué
2d2ef5a417
services(playht): voice engine is Play3.0-mini
2024-11-13 17:19:23 +01:00
Aleix Conchillo Flaqué
c1fff00586
services(playht): fix language codes
2024-11-13 17:19:23 +01:00
Mark Backman
0af2196f50
Merge pull request #708 from pipecat-ai/mb/add-rime-ai
...
Add RimeTTSService
2024-11-12 18:29:53 -05:00
Mark Backman
cd42320788
Update changelog
2024-11-12 18:28:04 -05:00
Mark Backman
70fce52499
Merge pull request #710 from pipecat-ai/mb/update-readme-krisp
...
Update Krisp README instructions
2024-11-12 11:15:25 -05:00
Mark Backman
70b60c0593
Update Krisp README instructions
2024-11-12 10:26:12 -05:00
Jon Taylor
2d8aa03f31
Merge pull request #706 from pipecat-ai/jpt/modal-example
...
barebones modal.com deployment example
2024-11-12 11:41:00 +00:00
Kwindla Hultman Kramer
581ff26704
Merge pull request #707 from pipecat-ai/khk/clean-up
...
tiny PR to remove old comment lines
2024-11-11 21:14:16 -08:00
Kwindla Hultman Kramer
335178ff06
some gemini audio input examples
2024-11-11 21:04:50 -08:00
Kwindla Hultman Kramer
ee53535f41
gemini audio-in with no transcription
2024-11-11 21:04:50 -08:00
Kwindla Hultman Kramer
91ac40307e
small fix and more prompt examples
2024-11-11 21:04:50 -08:00
Kwindla Hultman Kramer
b6c2c1f730
anthropic natural conversation example using claude haiku
2024-11-11 21:04:50 -08:00
Kwindla Hultman Kramer
b56c789ae4
fixes for proposed judge pipeline
2024-11-11 21:04:50 -08:00
Kwindla Hultman Kramer
bd435d9e62
missing commit
2024-11-11 21:04:50 -08:00
Kwindla Hultman Kramer
55a81df84f
contributing to llm-as-judge phrase endpointing work
2024-11-11 21:04:50 -08:00
Kwindla Hultman Kramer
87434460f5
temp hacking
2024-11-11 21:04:50 -08:00
Mark Backman
958ec42e8d
Add Rime.ai TTS service
2024-11-11 21:58:09 -05:00
Jon Taylor
d1fff60d1d
barebones modal.com deployment example
2024-11-11 22:30:07 +00:00
Kwindla Hultman Kramer
1438e5654a
remove old comment
2024-11-10 16:08:10 -08:00
Aleix Conchillo Flaqué
1d4be0139a
Merge pull request #705 from pipecat-ai/aleix/prepare-0.0.48
...
update CHANGELOG for 0.0.48
2024-11-10 14:08:33 -08:00
Aleix Conchillo Flaqué
f58c3ee322
update CHANGELOG for 0.0.48
2024-11-10 23:01:03 +01:00
Aleix Conchillo Flaqué
379750df91
Merge pull request #704 from pipecat-ai/aleix/cartesia-tts-stopped-frame
...
services(cartesia): generated TTSStoppedFrame after no more audio
2024-11-10 05:17:36 -08:00
Aleix Conchillo Flaqué
d125a38737
services(cartesia): generated TTSStoppedFrame after no more audio
...
The TTSStoppedFrame should be generated when the TTS services stoped generating
audio not when the bot stops speaking.
2024-11-10 09:55:45 +01:00
Mark Backman
446bb0aeaf
Merge pull request #702 from pipecat-ai/mb/azure-websocket
...
Add an Azure TTS websocket service
2024-11-09 17:41:53 -05:00
Aleix Conchillo Flaqué
d839080834
Merge pull request #642 from pipecat-ai/aleix/input-queues-block-frames
...
introduce frame processor input queues block frames
2024-11-09 14:30:17 -08:00
Mark Backman
9b85d0642b
Add a changelog entry
2024-11-09 12:37:29 -05:00
Mark Backman
230b51a117
Add an Azure TTS websocket service
2024-11-09 12:37:29 -05:00
Mark Backman
3a965ca396
Merge pull request #701 from pipecat-ai/khk/anthropic-function-calling-fix
...
fixes for anthropic function calling
2024-11-09 06:39:34 -05:00
Kwindla Hultman Kramer
33fc5bf990
improved 20c-persistent-context-anthropic.py
2024-11-08 16:42:30 -08:00
Kwindla Hultman Kramer
a54ca08405
fixes for anthropic function calling
2024-11-08 16:33:02 -08:00
Filipi da Silva Fuchter
4379db43ed
Merge pull request #689 from pipecat-ai/filipi/krisp
...
Making pipecat work with Krisp
2024-11-08 16:22:52 -03:00
Filipi Fuchter
e915c676aa
Added support for Krisp audio filter
2024-11-08 16:18:10 -03:00
Mark Backman
e0a003afa1
Merge pull request #695 from pipecat-ai/mb/initialize-azure-lang
...
Initialize the speech_recognition_language for Azure TTS
2024-11-08 06:40:40 -05:00
James Hush
d5666727ce
feat: toggle looping with soundfile mixer ( #693 )
...
* feat: toggle looping with soundfile mixer
* Implement PR changes
2024-11-07 21:08:37 -08:00
Mark Backman
f6d7402530
Update changelog
2024-11-07 15:16:03 -05:00
Mark Backman
aefe190c9f
Initialize the speech_recognition_language for Azure TTS
2024-11-07 15:14:05 -05:00
Vanessa Pyne
29925a8f21
Merge pull request #551 from Allenmylath/patch-3
...
Frame types and short descriptionCreate Frames.md
2024-11-07 10:05:32 -06:00
Aleix Conchillo Flaqué
beb3271168
services(tts): make sure word timestamp is reset properly
2024-11-06 18:54:12 -08:00
Aleix Conchillo Flaqué
b959ac6e1e
Merge pull request #694 from pipecat-ai/aleix/daily-add-on-transcription-message
...
transports(daily): call on_transcription_message event handler
2024-11-06 15:21:17 -08:00
Aleix Conchillo Flaqué
17f4286942
transports(daily): call on_transcription_message event handler
2024-11-06 15:10:58 -08:00
Aleix Conchillo Flaqué
ce89bbb16e
tts(elevenlabs): support pausing and resuming frames while speaking
2024-11-06 14:38:33 -08:00
Aleix Conchillo Flaqué
865768039b
processors: remove block_on_frames and add pause_processing_frames() instead
2024-11-06 14:20:25 -08:00
Aleix Conchillo Flaqué
7071482583
try to use queue_frame() instead of process_frame()
2024-11-06 14:18:21 -08:00
Aleix Conchillo Flaqué
5353d13151
update CHANGELOG
2024-11-06 13:16:58 -08:00
Aleix Conchillo Flaqué
a9e565f355
processors: fix input queue interruptions
2024-11-06 13:12:24 -08:00
Aleix Conchillo Flaqué
b6f0c16591
examples: restore EndFrame() on 01 and 02 foundational
2024-11-06 13:05:03 -08:00
Aleix Conchillo Flaqué
49005d02f5
services(tts): use TTSSpeakFrame in say() method
2024-11-06 13:05:03 -08:00
Aleix Conchillo Flaqué
6d8b885071
transports(base_output): push bot started/stopped frames downstream
2024-11-06 13:04:37 -08:00
Aleix Conchillo Flaqué
2eccb33e73
processors: allow passing a callback when queued frame is processed
2024-11-06 13:04:37 -08:00
Aleix Conchillo Flaqué
22ca4c5a02
processors: cancel input task and empty queue with interruptions
2024-11-06 13:04:37 -08:00
Aleix Conchillo Flaqué
84f26ac1ca
processors: introduce input queues
...
Frame processors can now decide if they should continue processing frames or
not, and if so also decide when to continue processing frames. For example,
asynchronous TTS services will stop processing frames until they have generated
all the audio for an LLM response.
2024-11-06 12:13:49 -08:00
Aleix Conchillo Flaqué
74937411e6
Merge pull request #691 from pipecat-ai/aleix/rtvi-manual-bot-ready
...
rtvi: bot-ready message needs to be sent manual
2024-11-06 10:53:25 -08:00
Aleix Conchillo Flaqué
8aab068ffd
rtvi: bot-ready message needs to be sent manual
2024-11-05 10:52:54 -08:00
Aleix Conchillo Flaqué
bd50201ce4
transports(daily): just make it clear we subscribe to camera
2024-11-04 17:32:46 -08:00
Aleix Conchillo Flaqué
6082da284e
Merge pull request #611 from pipecat-ai/aleix/audio-filters
...
introduce audio filters
2024-11-04 16:34:47 -08:00
Aleix Conchillo Flaqué
358c458265
transports(base_input): handle filter contorl frames
2024-11-04 16:19:52 -08:00
Aleix Conchillo Flaqué
807dbbe326
audio(noisereduce): allow enabling/disabling filter
2024-11-04 16:13:29 -08:00
Aleix Conchillo Flaqué
3c116b291d
audio(mixers): some cosmetics
2024-11-04 15:37:08 -08:00
Aleix Conchillo Flaqué
0dd413ee90
audio(filters): add noisereduce filter
2024-11-04 15:37:08 -08:00
Aleix Conchillo Flaqué
abc8ede3d7
introduce audio filters
2024-11-04 15:37:08 -08:00
Aleix Conchillo Flaqué
126324ca1b
Merge pull request #687 from pipecat-ai/aleix/transport-audio-mixers
...
introduce transport audio mixers
2024-11-04 13:14:36 -08:00
Aleix Conchillo Flaqué
602915ae18
examples(websocket-server): allow interruptions
2024-11-04 13:05:02 -08:00
Aleix Conchillo Flaqué
0ac9e2dd3f
transports(network): synchronize with time before sending data
2024-11-04 13:04:18 -08:00
Aleix Conchillo Flaqué
a9ef5ca95d
examples: add bot background sound example
2024-11-03 11:13:02 -08:00
Aleix Conchillo Flaqué
81c476dd4c
introduce output transport audio mixers
2024-11-03 11:13:02 -08:00
Kwindla Hultman Kramer
151242d3a0
Merge pull request #666 from pipecat-ai/khk/realtime-pipecat-vad
...
Support using Pipecat turn detection instead of OpenAI Realtime API turn detection
2024-11-02 08:36:31 -07:00
Kwindla Hultman Kramer
93c6e5098c
added comment explaining config of TurnDetection
2024-11-02 08:24:54 -07:00
Aleix Conchillo Flaqué
4455b2a428
rtvi: create queues before tasks
2024-11-01 23:06:50 -07:00
Aleix Conchillo Flaqué
94062592ef
base_output: generate smaller audio frames of the same incoming type
2024-11-01 23:06:50 -07:00
Aleix Conchillo Flaqué
d2401a76c8
base_output: only generate bot speaking with TTS audio frames
2024-11-01 23:06:50 -07:00
Aleix Conchillo Flaqué
e2b1b56e86
examples: don't require room token if using an STT
2024-11-01 23:06:50 -07:00
Mark Backman
84bd767312
Merge pull request #685 from pipecat-ai/mb/add-recording-events
...
Add recording events and callbacks
2024-11-01 12:02:46 -04:00
Mark Backman
802c29e9e1
Add recording events and callbacks
2024-11-01 10:20:00 -04:00
Aleix Conchillo Flaqué
f83381860c
Merge pull request #677 from pipecat-ai/aleix/add-notifier-and-notifier-filters
...
add notifiers and more frame filters
2024-10-31 15:55:07 -07:00
Aleix Conchillo Flaqué
4dad1bfe49
examples: add foundational/22-natural-conversation.py
2024-10-31 12:10:33 -07:00
marcus-daily
9ee8896b64
Removing unnecessary ruff arguments from README
2024-10-31 18:02:29 +00:00
marcus-daily
5f7a2f66d4
Add .idea to .gitignore
2024-10-31 18:02:29 +00:00
marcus-daily
76e5f1e847
Remove unnecessary ruff params in CI
2024-10-31 15:07:28 +00:00
marcus-daily
6975340d6c
Set Ruff config for the project
2024-10-31 15:07:28 +00:00
marcus-daily
0f4cf56418
Load dotenv in simple chatbot server ( fixes #415 )
2024-10-31 12:08:30 +00:00
Aleix Conchillo Flaqué
018e51e8a3
add notifiers and more frame filters
2024-10-30 16:36:17 -07:00
Vanessa Pyne
b050143952
Merge pull request #676 from RonakAgarwalVani/fix/chunk-choices-delta-none
...
Fix uncaught exception when accessing 'tool_calls' in NoneType delta in response handling
2024-10-30 14:44:32 -05:00
Mark Backman
98ea1f0791
Merge pull request #675 from pipecat-ai/mb/playht-add-request-id
...
Add a request_id to each TTS sequence
2024-10-30 13:56:15 -04:00
Mark Backman
8272c35527
Use a request_id in TTS commands for the PlayHT websocket service
2024-10-30 13:54:18 -04:00
Mark Backman
e973e82e05
Merge pull request #672 from pipecat-ai/mb/fix-playht
...
Fix PlayHT TTFB metrics
2024-10-30 13:53:02 -04:00
RonakAgarwalVani
d1396bf618
Update openai.py
2024-10-30 14:26:49 +05:30
Vanessa Pyne
8186e423de
Merge pull request #637 from pipecat-ai/vp-issue-template
...
docs: add ISSUE_TEMPLATE.md
2024-10-29 15:08:42 -05:00
vipyne
3010addb8b
docs: add CONTRIBUTING.md
2024-10-29 15:03:07 -05:00
vipyne
029e0d391e
docs: add ISSUE_TEMPLATE.md
2024-10-29 15:03:07 -05:00
Vanessa Pyne
bf31223577
Merge pull request #671 from pipecat-ai/vp-issue-635
...
docs: small fix
2024-10-29 14:34:13 -05:00
vipyne
42cc79154f
docs: small fix
2024-10-29 14:33:57 -05:00
Mark Backman
05b857006a
Update changelog
2024-10-28 20:56:29 -04:00
Mark Backman
2e57d21b89
Fix ttfb metrics
2024-10-28 20:27:24 -04:00
Aleix Conchillo Flaqué
fa05ec46be
Merge pull request #667 from pipecat-ai/aleix/base-output-bot-speaking-detection
...
transports(base_output): use audio frames for bot speaking detection
2024-10-28 10:54:54 -07:00
Aleix Conchillo Flaqué
e3ce619284
transports(base_output): use audio frames for bot speaking detection
2024-10-28 10:07:37 -07:00
Vanessa Pyne
fb512dcd74
Merge pull request #630 from MoofSoup/update-readme
...
docs: simplify readme
2024-10-28 10:26:30 -05:00
Aleix Conchillo Flaqué
ca15d97383
Merge pull request #662 from pipecat-ai/aleix/daily-transport-async-functions
...
transports(daily): make functions async
2024-10-25 16:14:06 -07:00
Aleix Conchillo Flaqué
b32448e967
transports(daily): make functions async
2024-10-25 15:01:52 -07:00
Aleix Conchillo Flaqué
7e30da6183
Merge pull request #661 from pipecat-ai/aleix/allow-updating-subscritption-before
...
transports(daily): allow updating subscriptions before join
2024-10-25 15:00:34 -07:00
Aleix Conchillo Flaqué
a6dd2600d2
examples(tavus): await update_subscriptions
2024-10-25 14:56:56 -07:00
Aleix Conchillo Flaqué
b905b57dfc
transports(daily): allow updating subscriptions before join
2024-10-25 14:46:17 -07:00
Kwindla Hultman Kramer
e1a7edfb58
make it possible to use Pipecat turn detection instead of OpenAI turn detection
2024-10-25 15:59:48 -05:00
Aleix Conchillo Flaqué
1b30b1fc23
Merge pull request #665 from pipecat-ai/aleix/fix-bot-started-stopped-speaking
...
transports(base_output): fix constant bot started/stopped speaking fr…
2024-10-25 13:00:38 -07:00
Aleix Conchillo Flaqué
55026898f6
transports(base_output): use vad stop secs for bot stopped speaking
2024-10-25 12:59:15 -07:00
Aleix Conchillo Flaqué
4283557894
audio(vad): expose params property
2024-10-25 12:59:15 -07:00
Aleix Conchillo Flaqué
5ab00e01aa
transports(base_output): fix constant bot started/stopped speaking frames
2024-10-25 12:10:24 -07:00
Aleix Conchillo Flaqué
fcfc729e83
Merge pull request #664 from pipecat-ai/aleix/fix-aws-stuttering
...
services(aws): read stream and resample in a thread
2024-10-25 11:49:28 -07:00
Aleix Conchillo Flaqué
4eacb34fd8
services(aws): read stream and resample in a thread
2024-10-25 11:22:28 -07:00
Aleix Conchillo Flaqué
3a8aacccf7
Merge pull request #663 from pipecat-ai/aleix/audio-resampling-with-resampy
...
audio: use resamply for audio resampling
2024-10-25 10:16:20 -07:00
roey
54c0bf0c70
Adding TavusVideoService layer ( #617 )
...
Co-authored-by: roey <159067767+roey-tavus@users.noreply.github.com >
Co-authored-by: Mert Gerdan <mert@tavus.io >
Co-authored-by: Aleix Conchillo Flaqué <aleix@daily.co >
2024-10-25 09:46:25 -07:00
Aleix Conchillo Flaqué
778b05a252
audio: use resamply for audio resampling
2024-10-25 09:22:22 -07:00
Mark Backman
f16a416c2b
Merge pull request #660 from pipecat-ai/mb/add-gemini-inputs
...
Add input params to Google Gemini
2024-10-24 20:58:19 -04:00
Aleix Conchillo Flaqué
1be63bccb8
Merge pull request #647 from pipecat-ai/aleix/daily-transport-only-transcribe-users
...
transport(daily): only transcribe users
2024-10-24 17:40:34 -07:00
Mark Backman
37820ac0df
Add input params to Google Gemini
2024-10-24 20:12:41 -04:00
Aleix Conchillo Flaqué
8ea80d43f4
transports(daily): only transcribe user audio
2024-10-24 17:06:43 -07:00
Aleix Conchillo Flaqué
e117d70a00
update to daily-python 0.12.0
2024-10-24 16:49:19 -07:00
Aleix Conchillo Flaqué
2ba753272a
Merge pull request #658 from pipecat-ai/aleix/default-to-24000-sample-rate
...
update TTS and transport output sample rate to 24000
2024-10-24 16:48:41 -07:00
Aleix Conchillo Flaqué
60c8c2f6e9
examples(15a): use daily transcription instead of local whisper
2024-10-24 16:47:41 -07:00
Aleix Conchillo Flaqué
cfb48200c2
services(azure): support sample rates
2024-10-24 16:47:35 -07:00
Aleix Conchillo Flaqué
6d317c6e8e
audio: don't resample if same sample rate
2024-10-24 16:47:35 -07:00
Aleix Conchillo Flaqué
158d52856f
transports(livekit): fix VADAnalyzer import
2024-10-24 16:47:35 -07:00
Aleix Conchillo Flaqué
92a69e404f
update TTS and transport output sample rate to 24000
2024-10-24 16:47:35 -07:00
Aleix Conchillo Flaqué
d24c6185d8
Merge pull request #654 from pipecat-ai/aleix/daily-allow-completion-futures
...
transport(daily): allow completion futures
2024-10-24 14:28:53 -07:00
Mark Backman
1fd21578a6
Merge pull request #657 from pipecat-ai/mb/add-elevenlabs-output-format-type
...
Add ElevenLabs output format type
2024-10-24 17:07:04 -04:00
Mark Backman
700db87127
Merge pull request #656 from pipecat-ai/mb/add-gemini-metrics
...
Add Gemini token usage metrics
2024-10-24 17:04:56 -04:00
Mark Backman
6f1310569c
Add ElevenLabs output format type
2024-10-24 17:03:45 -04:00
Aleix Conchillo Flaqué
14cedb0be8
Merge pull request #655 from pipecat-ai/aleix/fix-together-params
...
services(together): fix together AI InputParams
2024-10-24 13:51:38 -07:00
Mark Backman
fae97f9051
Add Gemini token usage metrics
2024-10-24 16:37:21 -04:00
Aleix Conchillo Flaqué
d930a46e64
services(together): fix together AI InputParams
2024-10-24 13:08:35 -07:00
Aleix Conchillo Flaqué
2e6b5d1843
transports(daily): fix aiohttp timeout
2024-10-24 11:44:30 -07:00
Aleix Conchillo Flaqué
88362db034
transports(daily): no more need for an output message queue
2024-10-24 11:44:30 -07:00
Aleix Conchillo Flaqué
f7f0c44c32
transports(daily): don't block event handlers
2024-10-24 11:44:30 -07:00
Mark Backman
33553b71d4
Merge pull request #653 from pipecat-ai/mb/align-tts-constructors
...
Align TTSService constructors
2024-10-24 13:52:43 -04:00
Mark Backman
be8ca505cd
Merge pull request #652 from pipecat-ai/khk/more-gemini
...
Gemini new context manager and rewrite to use google data structures internally
2024-10-24 13:47:38 -04:00
Mark Backman
e957cce422
Align TTSService constructors
2024-10-24 13:42:06 -04:00
Mark Backman
418a13a4ec
Merge pull request #650 from pipecat-ai/mb/assembly-fix
...
AssemblyAI: don't disconnect on language change
2024-10-24 11:26:56 -04:00
Mark Backman
fc445c0a1f
Merge pull request #649 from pipecat-ai/mb/open-ai-max-tokens
...
Add max_tokens and max_completion_tokens inputs for OpenAI
2024-10-24 11:26:44 -04:00
Mark Backman
f0c65468ed
AssemblyAI: don't disconnect on language change
2024-10-24 08:30:48 -04:00
Mark Backman
ce6a2bdcf7
Add max tokens inputs to OpenAI
2024-10-24 07:03:45 -04:00
Mark Backman
673542e235
Merge pull request #646 from pipecat-ai/mb/grok-function-calling
...
Support function calling for Grok
2024-10-23 21:56:38 -04:00
Kwindla Hultman Kramer
e032b0b70a
gemini context aggregators
2024-10-23 18:44:09 -07:00
Mark Backman
e39f7e965b
Support function calling for Grok
2024-10-23 17:22:26 -04:00
Mattie Ruth
d26751e968
add missing PipelineParams to enable the metrics ( #645 )
2024-10-23 16:46:46 -04:00
Aleix Conchillo Flaqué
e0ca4a9c23
Merge pull request #643 from pipecat-ai/aleix/daily-update-subscriptions
...
transports(daily): add update_subscriptions()
2024-10-22 17:07:07 -07:00
Aleix Conchillo Flaqué
801e52c095
transports(daily): add update_subscriptions()
2024-10-22 15:02:55 -07:00
Aleix Conchillo Flaqué
a46eaa838b
Merge pull request #641 from pipecat-ai/aleix/prepare-0.0.47
...
prepare 0.0.47
2024-10-22 10:30:42 -07:00
Aleix Conchillo Flaqué
7c432499db
update CHANGELOG for 0.0.47
2024-10-22 10:02:50 -07:00
Aleix Conchillo Flaqué
8d75fcc9f0
use warnings package to report deprecated code
2024-10-22 10:02:21 -07:00
Aleix Conchillo Flaqué
61d73f81ae
Merge pull request #639 from pipecat-ai/aleix/daily-transcription-model
...
transport(daily): use "nova-2-general" for transcription
2024-10-22 09:40:43 -07:00
Aleix Conchillo Flaqué
951255def9
transport(daily): use "nova-2-general" for transcription
2024-10-22 09:40:03 -07:00
Moof Soup
bf5a7c3562
docs: Clarify README example and token usage
...
clarified readme example
2024-10-21 19:54:34 -07:00
Mark Backman
e556f34094
Merge pull request #638 from pipecat-ai/mb/fix-silero-vad-import
...
Fix Silero VAD import issue
2024-10-21 20:48:06 -04:00
Mark Backman
ccc3691620
Fix Silero VAD import issue
2024-10-21 20:39:20 -04:00
Vanessa Pyne
5321affda7
Merge pull request #588 from Allenmylath/patch-11
...
Update README.md
2024-10-21 11:20:05 -05:00
Mark Backman
e5ad8dc67b
Merge pull request #627 from pipecat-ai/mb/upgrade-gladia-to-v2-api
...
Update GladiaSTTService to use the Gladia V2 API
2024-10-21 12:01:20 -04:00
Mark Backman
46927805bc
Update GladiaSTTService to use the Gladia V2 API
2024-10-21 07:10:38 -04:00
Aleix Conchillo Flaqué
b6b1ef0a40
Merge pull request #589 from Allenmylath/patch-12
...
Update Dockerfile
2024-10-20 10:59:43 -07:00
Mark Backman
e62f762382
Merge pull request #625 from pipecat-ai/mb/add-assemblyai-stt
...
Add support for AssemblyAI STT
2024-10-20 13:59:33 -04:00
Aleix Conchillo Flaqué
dbfda14342
Merge pull request #587 from Allenmylath/patch-9
...
Update env.example
2024-10-20 10:58:50 -07:00
Aleix Conchillo Flaqué
fee85418cd
Merge pull request #620 from gregschwartz/main
...
Start agent/call/bot at localhost root
2024-10-20 10:14:10 -07:00
Mark Backman
015faa3dbd
Update CHANGELOG and README
2024-10-20 08:57:57 -04:00
Mark Backman
1dbf4ff27d
Add AssemblyAI STT service
2024-10-20 08:57:57 -04:00
Aleix Conchillo Flaqué
4f1b2dce9b
Merge pull request #624 from pvilchez/fix_enable_usage_metrics
...
Fixing `enable_usage_metrics` setting.
2024-10-20 01:00:12 -07:00
Paul Vilchez
5640bd9447
Fixing a config mismatch which caused usage stats to only report when enable_metrics was true.
2024-10-20 03:33:13 -04:00
Aleix Conchillo Flaqué
ee5ae0d631
Merge pull request #621 from pipecat-ai/aleix/prepare-0.0.46
...
update CHANGELOG for 0.0.46
2024-10-19 18:26:05 -07:00
Aleix Conchillo Flaqué
4b8a4b86fe
update CHANGELOG for 0.0.46
2024-10-19 18:25:29 -07:00
Aleix Conchillo Flaqué
3556c9ce0f
Merge pull request #618 from pipecat-ai/aleix/examples-switch-to-llm-context
...
examples: use OpenAILLMContext in all the examples
2024-10-19 18:24:39 -07:00
Aleix Conchillo Flaqué
f971dbe027
examples(audio-recording): record audio into a file
2024-10-19 18:24:00 -07:00
Aleix Conchillo Flaqué
3815e9dec3
examples: fix dialin-chatbot python arguments
2024-10-19 18:24:00 -07:00
Aleix Conchillo Flaqué
320f622255
examples: upgrade storytelling frontend packages
2024-10-19 18:24:00 -07:00
Aleix Conchillo Flaqué
be4bdabdf4
examples: use OpenAILLMContext in all the examples
2024-10-19 18:24:00 -07:00
Greg Schwartz
1fa52b62aa
Put start agent/call at localhost root. Before you had to read in the docs to go to /start, or /start_call or /start_bot. Which isn't mentioned in the console output, and is inconsistent, adding friction to learning the codebase
2024-10-19 16:18:43 -07:00
Aleix Conchillo Flaqué
4f66e5d55f
Merge pull request #619 from pipecat-ai/aleix/split-vad
...
move SileroVAD processor to processors package
2024-10-18 23:30:07 -07:00
Aleix Conchillo Flaqué
3502509d3e
move SileroVAD processor to processors package
2024-10-18 23:28:29 -07:00
Aleix Conchillo Flaqué
d71ea1c0e0
Merge pull request #615 from DamienDeepgram/patch-1
...
Update default Deepgram model
2024-10-18 22:47:30 -07:00
Kwindla Hultman Kramer
07712cdb16
gemini function calling and partial implementation of standard context stuff
2024-10-18 17:14:57 -07:00
DamienDeepgram
13f232bafc
Update default model
2024-10-18 15:33:50 -07:00
Aleix Conchillo Flaqué
9dd3354b89
Merge pull request #613 from pipecat-ai/aleix/examples-endframe
...
examples: use EndFrame() when the participant leaves
2024-10-18 11:18:26 -07:00
Aleix Conchillo Flaqué
8c006c24a3
README: update example
2024-10-18 11:18:03 -07:00
Aleix Conchillo Flaqué
4550545528
examples: use EndFrame() when the participant leaves
2024-10-18 11:18:03 -07:00
Aleix Conchillo Flaqué
020f371ecb
pyproject: update onnxruntime to support python 3.12
2024-10-18 10:20:28 -07:00
Aleix Conchillo Flaqué
f3c0767c81
Merge pull request #610 from pipecat-ai/aleix/stt-push-audio
...
allow STT services to passthrough audio frames
2024-10-17 21:02:30 -07:00
Aleix Conchillo Flaqué
c9318ecd5c
examples: minor fixes
2024-10-17 16:15:09 -07:00
Aleix Conchillo Flaqué
12eb9437c1
services(stt): allow STT service to passthrough audio
2024-10-17 16:15:09 -07:00
Aleix Conchillo Flaqué
71c8c0dcdb
Merge pull request #609 from pipecat-ai/aleix/livekit-force-specifying-vad
...
livekit force specifying vad
2024-10-17 14:08:55 -07:00
Aleix Conchillo Flaqué
8108423742
transport(livekit): force specifying a vad analyzer
...
Don't default to SileroVADAnalyzer(). Also, resample to input sample rate.
2024-10-17 14:06:43 -07:00
Aleix Conchillo Flaqué
d67e08be4d
Merge pull request #608 from pipecat-ai/aleix/add-audio-utils-and-resample
...
add audio utils and resample
2024-10-17 14:00:49 -07:00
Aleix Conchillo Flaqué
d3f4ac61b6
move utils.audio to audio.utils and add resample_audio()
2024-10-17 13:59:32 -07:00
Aleix Conchillo Flaqué
c6d28bb0db
Merge pull request #607 from pipecat-ai/aleix/pipecat-vad-deprecation
...
move vad package to audio.vad
2024-10-17 13:51:20 -07:00
Aleix Conchillo Flaqué
2a37b2459a
move vad package to audio.vad
2024-10-17 13:49:16 -07:00
Mark Backman
d1000f2fe4
Merge pull request #606 from pipecat-ai/mb/add-playht-options
...
PlayHT: Add websocket TTS service; rename existing service to PlayHTHttpTTSService, upgrade client, add input params
2024-10-17 16:46:59 -04:00
Mark Backman
e2d7af4b62
Update changelog
2024-10-17 16:16:29 -04:00
Mark Backman
da3810f1a2
Add websocket support for PlayHT
2024-10-17 15:41:33 -04:00
Aleix Conchillo Flaqué
eb21597d1a
Merge pull request #603 from pipecat-ai/aleix/silero-vad-processor-fixes
...
vad: add support for interruption to SileroVAD processor
2024-10-17 10:48:39 -07:00
Aleix Conchillo Flaqué
e3eea0c02f
vad: add support for interruption to SileroVAD processor
2024-10-17 10:48:25 -07:00
Mark Backman
45606e177c
Add input options to PlayHT, upgrade to latest PlayHT model
2024-10-17 11:56:12 -04:00
Aleix Conchillo Flaqué
197d7b3e2b
Merge pull request #604 from natestraub/patch-1
...
services(livekit) - Stop Sending EndFrame when Participant Disconnects
2024-10-17 08:48:57 -07:00
Nathan Straub
d4ec6827ce
services(livekit) - Stop Sending EndFrame when Participant Disconnects
...
How It Works Now:
A participant disconnecting triggers and EndFrame, invoking stop() on the input and output transports and causing the LiveKit room to disconnect.
Proposal:
Match the daily implementation, and just trigger the callbacks in the LiveKitTransport. Leave it up to the implementor to decide whether to send EndFrames when this happens.
2024-10-16 23:53:31 -07:00
Aleix Conchillo Flaqué
e31d1152db
Merge pull request #601 from pipecat-ai/aleix/openai-realtime-misc
...
services(openai): rename OpenAILLMServiceRealtimeBeta to OpenAIRealti…
2024-10-16 16:20:18 -07:00
Mark Backman
bb48a81103
Merge pull request #602 from pipecat-ai/mb/adjust-logger-levels
...
Adjust log levels for log messages
2024-10-16 18:00:35 -04:00
Mark Backman
55f1ae2564
Adjust log levels for log messages
2024-10-16 17:30:47 -04:00
Kwindla Hultman Kramer
280691b1b3
explanatory comment in 19-openai-realtime-beta.py
2024-10-16 14:27:48 -07:00
Kwindla Hultman Kramer
93c9e219ce
fix for message handling bug on initialization
2024-10-16 12:40:20 -07:00
Aleix Conchillo Flaqué
edd44cc181
services(openai): rename OpenAILLMServiceRealtimeBeta to OpenAIRealtimeBetaLLMService
2024-10-16 10:20:19 -07:00
Aleix Conchillo Flaqué
4075b19f7c
Merge pull request #600 from pipecat-ai/aleix/prepare-0.0.45
...
update CHANGELOG to 0.0.45
2024-10-16 09:18:37 -07:00
Aleix Conchillo Flaqué
bb14918a33
update CHANGELOG to 0.0.45
2024-10-16 09:17:33 -07:00
Mark Backman
2aee8a12f8
Merge pull request #599 from pipecat-ai/mb/remove-metrics-from-transport
...
Move metrics from transport to rtvi
2024-10-16 11:39:58 -04:00
Mark Backman
5760fadb44
Update changelog
2024-10-16 11:33:56 -04:00
Mark Backman
af5a7e9092
Move metrics from transport to rtvi
2024-10-16 11:33:56 -04:00
Mark Backman
8d9a7486d1
Merge pull request #598 from pipecat-ai/mb/add-daily-metrics-message-frame
...
Comply with RTVI format for sending metrics data via Daily transport
2024-10-16 10:14:44 -04:00
Mark Backman
00d0f9ae48
Comply with RTVI format for sending metrics data
2024-10-16 09:00:38 -04:00
Aleix Conchillo Flaqué
d255b7d1b2
Merge pull request #596 from pipecat-ai/aleix/prepare-0.0.44
...
prepare for pipecat 0.0.44
2024-10-15 18:13:07 -07:00
Aleix Conchillo Flaqué
4eb2c95b63
update CHANGELOG for 0.0.44
2024-10-15 17:51:01 -07:00
Aleix Conchillo Flaqué
3910aeb4de
transports(daily): don't send messages if not joined
2024-10-15 17:51:01 -07:00
Aleix Conchillo Flaqué
713dcb7a4d
transports(daily): cancel messages task when canceling
2024-10-15 17:51:01 -07:00
Aleix Conchillo Flaqué
04da51c7d8
transport(base_output): push EndFrame downstream at the right time
2024-10-15 17:51:01 -07:00
Aleix Conchillo Flaqué
e52d18e42d
processors(audiobuffer): make functions public
2024-10-15 15:31:59 -07:00
Aleix Conchillo Flaqué
0c4a513ca2
Merge pull request #595 from pipecat-ai/aleix/bot-speaking-system-frames
...
bot speaking system frames
2024-10-15 15:30:11 -07:00
Aleix Conchillo Flaqué
4a71eacac3
rtvi: reset bot transcription with interruptions
2024-10-15 14:58:21 -07:00
Aleix Conchillo Flaqué
f0d89e57ad
frames: some frames need to be SystemFrames
...
We want to process user and bot started/stopped speaking frames as fast as
possible. If we queue them they might be processed too late.
2024-10-15 14:37:56 -07:00
Mark Backman
79b52d4301
Merge pull request #594 from pipecat-ai/mb/more-text-filter-massaging
...
More edge case handling for text filtering
2024-10-15 14:51:43 -04:00
Mark Backman
bb00dbefbc
More edge case handling for text filtering
2024-10-15 14:08:27 -04:00
Aleix Conchillo Flaqué
0c250c0603
Merge pull request #583 from pipecat-ai/aleix/add-pts-to-llm-full-response-end-frame
...
add pts to llm full response end frame
2024-10-15 10:39:50 -07:00
Aleix Conchillo Flaqué
7bbaf4dfe9
rtvi: merge TTS/TTSText and LLM/LLMText processors
2024-10-15 10:24:43 -07:00
Aleix Conchillo Flaqué
3a3bf3fe34
services(cartesia): schedule TTSStoppedFrame after text
2024-10-15 10:06:28 -07:00
Aleix Conchillo Flaqué
616aa54f75
ruff formatting
2024-10-15 10:06:28 -07:00
Aleix Conchillo Flaqué
164f06415c
servcies(cartesia): no need to send LLMFullResponseEndFrame
...
Interruptions are already handled by context aggregators.
2024-10-15 10:06:28 -07:00
Aleix Conchillo Flaqué
51bc4839d1
transport(base_output): simplify code
2024-10-15 10:06:28 -07:00
Aleix Conchillo Flaqué
6d778e0491
services: add pts to LLMFullResponseEndFrame in WordTTSService
2024-10-15 10:06:28 -07:00
Aleix Conchillo Flaqué
fc4fa2faaa
Merge pull request #593 from pipecat-ai/aleix/bot-transcription-processor
...
rtvi: add RTVIBotTranscriptionProcessor to send `bot-transcription`
2024-10-15 10:03:39 -07:00
Aleix Conchillo Flaqué
90b7f65545
rtvi: add RTVIBotTranscriptionProcessor to send bot-transcription
2024-10-15 10:03:20 -07:00
Kwindla Hultman Kramer
f7b7f0d680
Merge pull request #541 from pipecat-ai/khk/openai-realtime-beta
...
openai realtime beta
2024-10-14 21:02:06 -07:00
Kwindla Hultman Kramer
5431c44e51
remove two debug lines
2024-10-14 21:01:20 -07:00
Kwindla Hultman Kramer
40b3e50815
fix system, consecutive same role, and empty message parsing for anthropic
2024-10-14 20:56:42 -07:00
allenmylath
ec98a13a08
Update Dockerfile
...
utils and assets not used in this example hence removed
2024-10-15 08:18:16 +05:30
allenmylath
b999b76f70
Update README.md
...
readme description still shows simple-chatbot definition hence made more accurate description
2024-10-15 08:14:43 +05:30
allenmylath
b64dbe7bb4
Update env.example
...
canonical api url is also used from env.
2024-10-15 08:10:07 +05:30
Kwindla Hultman Kramer
2f6232fac9
fix for initial-messages with single message, and hoisting system message into instructions
2024-10-14 18:14:35 -07:00
Aleix Conchillo Flaqué
b4f2525c76
Merge pull request #585 from pipecat-ai/aleix/daily-urgent-transport-message-hang
...
transports(daily): send transport messages in a task
2024-10-14 16:31:10 -07:00
Aleix Conchillo Flaqué
8e956a4e88
Merge pull request #584 from pipecat-ai/aleix/urgent-bot-tts-audio
...
rtvi: bot-tts-audio messages should also be urgent
2024-10-14 16:25:35 -07:00
Aleix Conchillo Flaqué
7b9712daad
transports(daily): send transport messages in a task
...
We queue transport messages and send them in a task to avoid potential hangs by
sending urgent transport messages from a transport event handler.
2024-10-14 16:19:53 -07:00
Kwindla Hultman Kramer
d4269acd67
user started/stopped speaking frames and interruption frames
2024-10-14 16:07:04 -07:00
Kwindla Hultman Kramer
d2ae82fb38
added back in missing LLMFullResponseStartFrame and LLMFullResponseEndFrame
2024-10-14 15:18:50 -07:00
Lewis Wolfgang
270949e6cd
Merge pull request #582 from pipecat-ai/lewis/update_readme_aboutsilerofirstrun
...
Minor README update about Silero VAD.
2024-10-14 16:26:28 -04:00
Aleix Conchillo Flaqué
cfada94c13
rtvi: bot-tts-audio messages should also be urgent
2024-10-14 12:46:11 -07:00
Lewis Wolfgang
68fd6f7c44
Minor README update about Silero VAD.
...
We no longer download the model during first run - it's part of the repo.
2024-10-14 13:11:16 -04:00
Mark Backman
96bfcc3dca
Merge pull request #571 from pipecat-ai/mb/add-code-filtering
...
Add code and table filtering option to MarkdownTextFilter
2024-10-14 12:54:16 -04:00
Mark Backman
b0890b1f75
Code review fixes
2024-10-14 12:52:16 -04:00
Aleix Conchillo Flaqué
802b3e42c4
Merge pull request #579 from Allenmylath/patch-16
...
Update Dockerfile
2024-10-14 08:58:02 -07:00
Aleix Conchillo Flaqué
bd134839ff
Merge pull request #578 from Allenmylath/patch-15
...
Create Dockerfile
2024-10-14 08:57:34 -07:00
Aleix Conchillo Flaqué
428ce63e17
Merge pull request #575 from Allenmylath/patch-12
...
Update README.md
2024-10-14 08:55:12 -07:00
Aleix Conchillo Flaqué
46d6cde383
Merge pull request #574 from Allenmylath/patch-11
...
Update requirements.txt
2024-10-14 08:54:44 -07:00
allenmylath
6de82b3c11
Create .env.example ( #562 )
...
* Create .env.example
.env.example file with required env variables not added hence adding
* Rename .env.example to env.example
file name corrected as directed
2024-10-14 08:52:46 -07:00
Mark Backman
ec0bc7a057
A few bug fixes
2024-10-14 09:44:20 -04:00
allenmylath
c62156a4c3
Update Dockerfile
...
assets and utils files not found hence removed
2024-10-14 12:00:29 +05:30
allenmylath
e8618a07d0
Create Dockerfile
...
there is Dockerfile in other examples. this docker file assumes that there is a .env file(i added env.example in another pull request)
2024-10-14 11:49:35 +05:30
allenmylath
0ba99514a9
Update README.md
...
env.example added hence addying copy command will be necessary
2024-10-14 11:22:56 +05:30
allenmylath
837c8dad27
Update requirements.txt
...
whisper not used but deepgram used hence changed
2024-10-14 11:20:12 +05:30
allenmylath
0e69625a01
Rename frames.md to frame.md
...
edited again to frame.md
2024-10-14 10:07:47 +05:30
allenmylath
4e0823fced
Rename Frames.md to frames.md
...
file name changed as requested
2024-10-14 10:05:26 +05:30
Kwindla Hultman Kramer
6f2a464451
conversation save/load for openai, openai-realtime, and anthropic
2024-10-13 18:12:03 -07:00
Kwindla Hultman Kramer
ac4c5ab369
response content item truncation when interrupted
2024-10-13 14:38:04 -07:00
Kwindla Hultman Kramer
9e95419301
much cleanup
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
f390ec9608
temp commit; debugging
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
ce8a83efba
tools frame support and wip message resetting/loading
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
e5a2bf9564
context management improvements
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
7838018686
fix default response properties getting appended to ResponseCreateEvent
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
31916ed9fd
turn on/off openai vad
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
3a2fbc2b19
send user started/stopped speaking event from openai realtime events
...
send user started/stopped speaking event from openai realtime events
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
43520b44da
add 'failed' case to Response event object
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
ab4a8d791a
RTVI processors should use TextFrame not TextFrame and all subclasses
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
40dc546b81
function call fix and user transcription frames
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
5426891feb
added input audio pause setting. no frame to update that state, yet.
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
1c5ccd3406
fixes for settings updates, context updates, and response creation
2024-10-12 21:58:11 -07:00
Mark Backman
3a745bfa3f
Handle self._context of None
2024-10-12 21:58:11 -07:00
Mark Backman
ac4e39991e
Update ai_services for OpenAI Realtime param inputs
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
c870832da6
types seem complete; some ws error handling
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
e782016c57
renamed a file
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
00badaf98e
more pydantic cleanup
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
7dfac0163b
bits of pydantic
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
09a3c2a82d
major functionality working (not configurable, occasional timing bugs maybe)
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
c32c65014b
definitely broke something in the pipeline
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
f082eb10a2
small cleanup
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
b8898e449e
lots of debugging statements. multiple function calls broken
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
d1f6d229ca
space exploration prompt
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
4fa0318005
configurability via constructor
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
93ebb9d541
working 19-openai-realtime-beta.py example
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
16101c79c5
beginning of realtime impl
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
c866b3f2c9
Merge pull request #572 from pipecat-ai/khk/fix-deepgram-settings
...
fix for Deepgram settings not merging properly
2024-10-12 20:07:04 -07:00
Mark Backman
c26a45721f
Set inputs as Optional
2024-10-12 21:52:56 -04:00
Mark Backman
d9c900f872
Satisfy minimal text requirements for Cartesia and OpenAI
2024-10-12 21:27:37 -04:00
chadbailey59
73becbad29
fixed parallel async function calls bug ( #569 )
2024-10-12 17:45:24 -05:00
Aleix Conchillo Flaqué
f1df3de263
Merge pull request #560 from Allenmylath/patch-7
...
Update requirements.txt aiohttp missing
2024-10-12 14:52:24 -07:00
Aleix Conchillo Flaqué
3bc5c8cda7
Merge pull request #557 from Allenmylath/patch-4
...
Update env.example wrong tts service in env
2024-10-12 14:51:54 -07:00
Aleix Conchillo Flaqué
7b3b1058b2
Merge pull request #559 from Allenmylath/patch-6
...
Update server.py
2024-10-12 14:51:24 -07:00
Aleix Conchillo Flaqué
87473f857f
Merge pull request #558 from Allenmylath/patch-5
...
Update env.example wrong tts
2024-10-12 14:50:52 -07:00
Aleix Conchillo Flaqué
a96209185c
Merge pull request #546 from Allenmylath/patch-2
...
Update README.md
2024-10-12 14:46:15 -07:00
Aleix Conchillo Flaqué
34cc2ed1a1
Merge pull request #532 from nmaswood/nmaswood/format-logs
...
Format and Support Unicode for LLM Message Debug Logs
2024-10-12 14:42:58 -07:00
Aleix Conchillo Flaqué
667aa0c25a
Merge pull request #542 from joachimchauvet/main
...
Update LiveKit audio transport for changes introduced in v0.0.42
2024-10-12 14:13:02 -07:00
Mark Backman
12707f4ff7
_settings needs to be Dict
2024-10-12 12:19:54 -04:00
Kwindla Hultman Kramer
53451899a7
fix for Deepgram settings not merging
2024-10-11 21:07:39 -07:00
Aleix Conchillo Flaqué
dc73b20c0b
Merge pull request #451 from Canonical-AI-Inc/recording
...
Audio recording FrameProcessor
2024-10-11 13:48:19 -07:00
Adrian Cowham
4330374ba4
passing kwargs and forcing keyword-only arguments
2024-10-11 12:01:51 -07:00
Adrian Cowham
79c8aa2c4a
ruff formatting
2024-10-11 11:35:02 -07:00
Adrian Cowham
083d221dd2
PR feedback
2024-10-11 11:29:01 -07:00
Mark Backman
74d47b725f
Add table filtering
2024-10-11 14:10:47 -04:00
Adrian Cowham
917e482876
Merge branch 'main' into recording
2024-10-11 10:36:04 -07:00
Adrian Cowham
522d931950
better interruption handling by moving the processors after the transport output
2024-10-11 10:33:12 -07:00
Mark Backman
d10c7ac7ce
Add Changelog entry
2024-10-11 13:28:34 -04:00
Mark Backman
84705427c5
Add code filtering option to MarkdownTextFilter
2024-10-11 11:11:58 -04:00
Aleix Conchillo Flaqué
66a76af341
Merge pull request #567 from pipecat-ai/aleix/prepare-0.0.43
...
update CHANGELOG for 0.0.43
2024-10-10 14:09:18 -07:00
Aleix Conchillo Flaqué
d402d91c2f
update CHANGELOG for 0.0.43
2024-10-10 14:06:18 -07:00
Mark Backman
b05130a089
Merge pull request #566 from pipecat-ai/mb/make-markdown-modifiable
...
Mark the Markdown processor a util, and allow it to take inputs
2024-10-10 17:00:19 -04:00
Mark Backman
b3cc0779f0
Update the changelog
2024-10-10 16:49:20 -04:00
Mark Backman
cbecae40a9
Mark the Markdown processor a util, and allow it to take inputs
2024-10-10 16:43:48 -04:00
Mark Backman
5b8753c8b6
Add speak_code input param
2024-10-10 13:17:37 -04:00
Mark Backman
3c5f9457f1
More edge case improvements
2024-10-10 12:07:00 -04:00
Mark Backman
e32e56d0bc
Merge pull request #565 from pipecat-ai/mb/add-markdown-remover
...
Add a new processor which removes markdown and special chars from TTS text
2024-10-10 07:16:42 -04:00
Mark Backman
788aec665b
Add a new processor which removes markdown and special chars from TTS text
2024-10-10 07:11:31 -04:00
Mark Backman
3cada03a92
Merge pull request #564 from pipecat-ai/mb/bot-tts-text-urgent
...
Make bot-tts-text messages urgent
2024-10-08 19:26:46 -04:00
Mark Backman
e21fb520f9
Make bot-tts-text messages urgent
2024-10-08 17:07:08 -04:00
allenmylath
864f4d385f
Update requirements.txt aiohttp missing
...
aiohttp is not included but uded in code
2024-10-08 16:39:25 +05:30
allenmylath
26ac2878ae
Update server.py
...
desccription of fastapi sagrgumentparser wrongly shown as stroy teller instead of patient-intake
2024-10-08 15:18:26 +05:30
allenmylath
cac63f5565
Update env.example wrong tts
...
cartesian used in code but elevenlabs in .env example
2024-10-08 14:24:23 +05:30
allenmylath
aadffd6199
Update env.example wrong tts service in env
...
cartesian used in code but env got elevenlabs
2024-10-08 14:15:54 +05:30
Aleix Conchillo Flaqué
3403197a90
Merge pull request #552 from pipecat-ai/aleix/rtvi-user-llm-text
...
rtvi: add RTVIUserLLMTextProcessor
2024-10-07 08:33:29 -07:00
Aleix Conchillo Flaqué
8cdb9ab1ad
rtvi: internal transport message should be urgent
2024-10-07 08:04:14 -07:00
Mark Backman
5dbf26d283
Handle cases where text is either a list or a string
2024-10-07 07:21:32 -04:00
Mark Backman
8001bab9b0
Remove another instance of urgent=true
2024-10-07 06:58:32 -04:00
Aleix Conchillo Flaqué
12d0686adc
rtvi: rename bot-audio to bot-tts-audio
2024-10-06 16:50:55 -07:00
Aleix Conchillo Flaqué
a28a5e954a
add TransportMessageSystemFrame
2024-10-06 16:50:12 -07:00
Aleix Conchillo Flaqué
bb966a89d2
rtvi: add RTVIUserLLMTextProcessor
2024-10-06 01:05:58 -07:00
Aleix Conchillo Flaqué
4a74eb3321
use isinstance tuples
2024-10-06 00:45:27 -07:00
Aleix Conchillo Flaqué
1f54ee6991
pyproject: update deepgram to 3.7.3
2024-10-06 00:40:47 -07:00
Allenmylath
40af3571f0
Create Frames.md
...
Made asmall explanation for diffrent types of frames in pipcat
2024-10-05 22:04:03 +05:30
joachimchauvet
86143f79a1
use new InputAudioRawFrame and OutputAudioRawFrame
2024-10-05 14:17:27 +03:00
joachimchauvet
b373bc82b5
match behavior of Daily's on_first_participant_joined
2024-10-05 14:17:27 +03:00
Mark Backman
ea2a05a04b
Merge pull request #545 from pipecat-ai/mb/fix-language-handling
...
Improve language string handling for TTS services
2024-10-04 10:03:06 -04:00
Mark Backman
5692ca586c
Merge pull request #547 from pipecat-ai/mb/update-test-requirements
...
Update fastapi version in test-requirements.txt
2024-10-04 08:28:05 -04:00
Mark Backman
a11ad81f02
Update fastapi version in test-requirements.txt
2024-10-04 07:35:48 -04:00
Allenmylath
805efdb144
Update README.md
...
the description provided is that of simple chatbot and also the video of simple chatbot hence changed
2024-10-04 10:19:38 +05:30
Mark Backman
c49b31e6ad
Add CHANGELOG entry
2024-10-03 23:13:59 -04:00
Mark Backman
7796a272ce
Improve language handling for TTS services
2024-10-03 23:09:27 -04:00
Adrian Cowham
678e87fd31
comment back in some code
2024-10-03 14:12:23 -07:00
Adrian Cowham
4d81a2ebfe
nuked the code that marks user audio in favor for InputAudioRawFrame. also moving to stereo instead of mono with the human and bot on their own channel.
2024-10-03 14:10:03 -07:00
Adrian Cowham
2d82702e04
merge from main
2024-10-03 09:42:06 -07:00
Mark Backman
27dcf83f37
Merge pull request #543 from pipecat-ai/mb/fix-deepgram-stt-language
...
Deepgram: disconnect and reconnect on language change
2024-10-03 12:40:27 -04:00
Mark Backman
72db83528d
Update changelog
2024-10-03 12:37:26 -04:00
Mark Backman
45c7d36b2e
Deepgram: disconnect and reconnect on language change
2024-10-03 12:31:42 -04:00
Aleix Conchillo Flaqué
65eeb0f1f6
Merge pull request #540 from pipecat-ai/cb/interruption-fix
...
Fixed RTVI `tts:interrupt` action not interrupting
2024-10-02 13:46:52 -07:00
Aleix Conchillo Flaqué
1d7d0bb1ea
Merge pull request #539 from pipecat-ai/aleix/pipecat-0.0.42-fixes
...
pipecat 0.0.42 fixes
2024-10-02 13:34:28 -07:00
Aleix Conchillo Flaqué
598936bc53
services: apply service language code before using service
2024-10-02 13:30:01 -07:00
Chad Bailey
b1bf6f7733
fixed botinterruptionframe
2024-10-02 19:43:51 +00:00
Aleix Conchillo Flaqué
75d27aeb9f
examples(storytelling): update packages
2024-10-02 12:00:00 -07:00
Aleix Conchillo Flaqué
0a37caf4b4
openai: fix image json logging
2024-10-02 11:57:50 -07:00
Aleix Conchillo Flaqué
6db65f4335
cartesia: use model_name instead of model_id
2024-10-02 11:57:36 -07:00
Aleix Conchillo Flaqué
3648874301
gladia: fix languages
2024-10-02 11:57:25 -07:00
Aleix Conchillo Flaqué
8bcb5d7fd2
services: async generators should yield frames
2024-10-02 11:57:08 -07:00
Aleix Conchillo Flaqué
8c01a900cd
google: allow using GOOGLE_APPLICATION_CREDENTIALS
2024-10-02 11:56:01 -07:00
Mark Backman
d378e699d2
Merge pull request #538 from Allenmylath/patch-2
...
Update env.example for wrong tts
2024-10-02 12:53:50 -04:00
Mark Backman
c25c375c41
Merge pull request #537 from pipecat-ai/mb/fix-nested-strings
...
Fix nested strings issue
2024-10-02 12:39:00 -04:00
Allenmylath
70c3ff31fd
Update env.example
...
elevenlabs is not used in code instead cartesian is used hence changed
2024-10-02 21:59:51 +05:30
Mark Backman
cd2e29f285
Fix nested strings issue
2024-10-02 12:26:30 -04:00
Aleix Conchillo Flaqué
6d4d7d763d
Merge pull request #534 from pipecat-ai/aleix/prepare-0.0.42
...
update CHANGELOG for 0.0.42
2024-10-02 08:36:32 -07:00
Aleix Conchillo Flaqué
6c1851eef8
update CHANGELOG for 0.0.42
2024-10-02 08:36:17 -07:00
Mark Backman
096a15eef6
Merge pull request #527 from pipecat-ai/mb/google-tts-inputs
...
Further consolidate service update settings into a single ServiceUpdateSettingsFrame class
2024-10-02 11:13:25 -04:00
Mark Backman
3d642df2b0
Revert aligning voice_id name in TTS service constructor
2024-10-02 11:07:48 -04:00
Mark Backman
d75a02dc51
Use Language enum and set languages accordingly
2024-10-01 21:03:01 -04:00
Mark Backman
28643b453d
Update to use LLM, STT, TTS subclasses and remove setter methods
2024-10-01 20:30:27 -04:00
Nasr Maswood
d5635de5f6
add new lines and unicode to JSON debug logs
2024-10-01 13:31:58 -04:00
Mark Backman
88cca7bf68
Consolidate service UpdateSettingsFrame into a single ServiceUpdateSettingsFrame
2024-10-01 11:01:04 -04:00
Mark Backman
a397b859fe
Add support for gender and google_style inputs to Google TTS
2024-10-01 10:39:45 -04:00
Kwindla Hultman Kramer
8aae4e9856
Merge pull request #531 from pipecat-ai/khk/function-calling-improvements
2024-10-01 07:23:38 -07:00
Kwindla Hultman Kramer
92d8b37229
implement vision for openai
2024-09-30 21:49:29 -07:00
Kwindla Hultman Kramer
0801fc578b
Merge pull request #530 from pipecat-ai/khk/tts-say-fix
...
fix for multi-sentence tts say utterances
2024-09-30 20:59:53 -07:00
Kwindla Hultman Kramer
0d5cb84531
function calling testing and improvements
2024-09-30 20:59:28 -07:00
Kwindla Hultman Kramer
47b943a117
Merge pull request #522 from pipecat-ai/rebase-openai-multi-function-call
...
Handle parallel function calls for OpenAI LLMs
2024-09-30 16:23:37 -07:00
Kwindla Hultman Kramer
128355add5
fix for multi-sentence tts say utterances
2024-09-30 16:19:31 -07:00
Kwindla Hultman Kramer
0499fe41e4
get rid of some debug log lines used during development
2024-09-30 16:08:33 -07:00
Kwindla Hultman Kramer
6ad3437fd2
throw error if the llm tries to call a function that's not registered
2024-09-30 16:08:33 -07:00
Kwindla Hultman Kramer
a5c73ec829
handle openai multiple function calls
2024-09-30 16:08:30 -07:00
JeevanReddy
def04ac0ce
openai can give multiple tool calls, current implementation assumes only one function call at a time. Fixed this to handle multiple function calls.
2024-09-30 16:07:56 -07:00
Kwindla Hultman Kramer
5d63615b1b
Merge pull request #528 from pipecat-ai/khk/sentence-splits
...
TTS sentence aggregation fix
2024-09-30 16:07:21 -07:00
Kwindla Hultman Kramer
90ee284fe0
Merge pull request #520 from pipecat-ai/khk/context-frame-push
...
pushing context frames from assistant aggregators
2024-09-30 16:06:54 -07:00
Kwindla Hultman Kramer
539e0b66fb
small fix as per aleix
2024-09-30 16:05:32 -07:00
Kwindla Hultman Kramer
fef393dcac
assistant aggregator switch for space padding or not
2024-09-30 16:05:32 -07:00
Kwindla Hultman Kramer
ed607d5c4b
typo fix
2024-09-30 16:05:32 -07:00
Kwindla Hultman Kramer
37da7e44cd
whitespace fix
2024-09-30 16:05:32 -07:00
Kwindla Hultman Kramer
69c7edd60c
pushing context frames from assistant aggregators
2024-09-30 16:05:28 -07:00
Aleix Conchillo Flaqué
392f210371
Merge pull request #524 from pipecat-ai/aleix/everything-is-async
...
all frame processors are asynchrnous
2024-09-30 15:59:03 -07:00
Mark Backman
9a63df1ea1
Merge pull request #529 from pipecat-ai/mb/daily-python-0-11-0
...
Update daily-python to 0.11.0
2024-09-30 18:29:27 -04:00
Mark Backman
f8a75cede9
Update daily-python to 0.11.0
2024-09-30 18:22:38 -04:00
Aleix Conchillo Flaqué
4d1e370e02
pipeline(task): since everything is async tasks should wait for EndFrame
2024-09-30 15:11:21 -07:00
Aleix Conchillo Flaqué
d080a31a5c
tests: fix langchanin tests
2024-09-30 15:11:21 -07:00
Aleix Conchillo Flaqué
a90ebdfe7c
syncparallelpipeline: fix now that all frames are asynchronous
2024-09-30 15:11:21 -07:00
Aleix Conchillo Flaqué
c8995b82e5
all frame processors are asynchrnous
...
In this commit we make all frame processors asynchronous, that is, they have an
internal queue and they push frames using a task from that queue.
2024-09-30 15:11:21 -07:00
Kwindla Hultman Kramer
6b7f924af6
tts sentence aggregation fix
2024-09-30 14:33:08 -07:00
Mark Backman
51580e5349
Merge pull request #526 from pipecat-ai/mb/google-tts-lang-update
...
Set Google TTS default language to en-US
2024-09-30 15:32:43 -04:00
Mark Backman
ed49cebf2c
Set Google TTS default language to en-US
2024-09-30 15:16:46 -04:00
Mark Backman
46ac76701e
Merge pull request #517 from pipecat-ai/mb/update-settings-frame
...
Consolidate update frames classes into a single UpdateSettingsFrame class
2024-09-30 12:56:45 -04:00
Mark Backman
1f77863aef
Code review feedback
2024-09-30 12:50:40 -04:00
Mark Backman
d7555609fd
Add TTS update settings options
2024-09-30 12:50:40 -04:00
Mark Backman
7fe118ce63
Align use of language param across TTS services
2024-09-30 12:50:40 -04:00
Mark Backman
44a349386c
Consolidate update frames classes into a single UpdateSettingsFrame class
2024-09-30 12:50:39 -04:00
Mark Backman
97cba92fa5
Merge pull request #516 from pipecat-ai/mb/google-tts
...
Add Google TTS
2024-09-30 12:25:16 -04:00
Aleix Conchillo Flaqué
d9b16d4f73
services: import cosmetics
2024-09-27 13:32:27 -07:00
Aleix Conchillo Flaqué
50b6580fbb
livekit: add license notice
2024-09-27 13:28:33 -07:00
Mark Backman
e7548f9494
Code review feedback
2024-09-27 08:02:44 -04:00
Mark Backman
830d2df671
Add Google TTS
2024-09-27 07:36:20 -04:00
Aleix Conchillo Flaqué
13b50a07db
Merge pull request #515 from pipecat-ai/aleix/rtvi-frame-processors
...
RTVI frame processors
2024-09-27 00:48:09 -07:00
Aleix Conchillo Flaqué
4501dca133
Merge pull request #467 from joachimchauvet/main
...
Add LiveKit audio transport
2024-09-26 22:58:25 -07:00
Aleix Conchillo Flaqué
2c8e566507
rtvi: update version to 0.2
2024-09-26 22:42:36 -07:00
Aleix Conchillo Flaqué
6e8a202107
rtvi: fix handling transport messages
2024-09-26 22:42:19 -07:00
Aleix Conchillo Flaqué
2a05cd35b0
rtvi: add multiple RTVI frame processors
2024-09-26 22:42:08 -07:00
Mark Backman
55a70cde8f
Merge pull request #514 from pipecat-ai/mb/aws-polly-tts
...
Add AWS Polly TTS support
2024-09-26 22:20:13 -04:00
Mark Backman
706c00d897
Code review feedback
2024-09-26 22:13:37 -04:00
Aleix Conchillo Flaqué
d323ea9e95
async_generator: keep pushing frames downstream
2024-09-26 16:44:49 -07:00
Aleix Conchillo Flaqué
b8ece84c6e
services: super should be super()
2024-09-26 10:39:26 -07:00
Mark Backman
a018112a13
Merge pull request #510 from pipecat-ai/mb/deepgram-tts-http
...
Improve usability of Deepgram TTS: use Deepgram client, remove aiohttp
2024-09-26 13:38:42 -04:00
Mark Backman
d3a477902b
Add changelog entry
2024-09-26 13:35:59 -04:00
Mark Backman
298b151486
Add setter methods
2024-09-26 13:35:59 -04:00
Mark Backman
6a6ea251ae
Add AWS Polly TTS support
2024-09-26 13:35:59 -04:00
Aleix Conchillo Flaqué
c7c709a0a7
github: cache venv when running tests
2024-09-26 10:32:22 -07:00
Aleix Conchillo Flaqué
6ac57b4854
Merge pull request #494 from badbye/full-width-punctuations
...
add full-width punctuations as end of the sentence
2024-09-26 10:17:10 -07:00
Aleix Conchillo Flaqué
f5e0b946c7
services(cartesia): fix string formatting
2024-09-26 09:08:37 -07:00
Mark Backman
b1818cc370
Merge pull request #435 from golbin/main
...
Add speed and emotion options for Cartesia.
2024-09-26 07:14:59 -04:00
Jin Kim
d05717a1bd
Apply Ruff formater
2024-09-26 19:52:25 +09:00
Aleix Conchillo Flaqué
d11daee31a
Merge pull request #509 from pipecat-ai/aleix/frameprocessor-event-handlers
...
frame processor event handlers
2024-09-25 19:50:30 -07:00
Mark Backman
73da8c1910
Improve usability of Deepgram TTS: use Deepgram client, remove aiohttp
2024-09-25 22:43:10 -04:00
Aleix Conchillo Flaqué
f06aa300d0
rtvi: add on_bot_ready event
2024-09-25 16:52:18 -07:00
Aleix Conchillo Flaqué
c4e94e280e
processors: add support for event handlers
2024-09-25 16:35:33 -07:00
Kwindla Hultman Kramer
8f2941c575
Merge pull request #492 from pipecat-ai/khk/flush-more-audio
...
add calls to flush_audio for say() and rtvi action
2024-09-25 12:35:50 -07:00
joachimchauvet
447baad5c3
update send_metrics() to support changes introduced in #474
2024-09-25 21:38:55 +03:00
Mark Backman
2703813e8a
Merge pull request #496 from pipecat-ai/mb/azure-tts-inputs
...
Add Azure TTS input params
2024-09-25 14:38:01 -04:00
Mark Backman
521e152150
Merge pull request #495 from pipecat-ai/mb/elevenlabs-input-lang
...
Add language_code support for ElevenLabs TTS
2024-09-25 14:37:44 -04:00
Kwindla Hultman Kramer
3d43ad0f4d
actually save the file
2024-09-25 10:59:00 -07:00
Kwindla Hultman Kramer
3621fceae2
fixes as noted by aleix
2024-09-25 09:19:58 -07:00
Aleix Conchillo Flaqué
e123f33c03
Merge pull request #506 from pipecat-ai/aleix/async-generator-processor
...
processors: add AsyncGeneratorProcessor
2024-09-25 00:04:09 -07:00
Aleix Conchillo Flaqué
b8713666c2
processors: add AsyncGeneratorProcessor
2024-09-25 00:01:04 -07:00
Aleix Conchillo Flaqué
cf0ab85e2c
Merge pull request #505 from pipecat-ai/aleix/init-task-variables
...
initialize task variables and add minor description
2024-09-24 23:59:38 -07:00
Aleix Conchillo Flaqué
8502c7c801
Merge pull request #504 from pipecat-ai/aleix/rtvi-handle-frame
...
rtvi: add RTVIProcessor.handle_message()
2024-09-24 23:59:26 -07:00
Aleix Conchillo Flaqué
e89814dc6b
Merge pull request #503 from pipecat-ai/aleix/end-cancel-task-frames
...
frames: add EndTaskFrame and CancelTaskFrame
2024-09-24 23:59:10 -07:00
Aleix Conchillo Flaqué
9461bacf0d
pyproject: update fastapi to 0.115.0
2024-09-24 19:24:37 -07:00
Aleix Conchillo Flaqué
e276dcbab7
initialize task variables and add minor description
2024-09-24 19:19:00 -07:00
Aleix Conchillo Flaqué
1a3de0e819
rtvi: add RTVIProcessor.handle_message()
2024-09-24 19:12:06 -07:00
Aleix Conchillo Flaqué
ee3786fe15
frames: add EndTaskFrame and CancelTaskFrame
2024-09-24 19:10:22 -07:00
Aleix Conchillo Flaqué
31b5667cee
frames: log text with [] so we can distinguish spaces better
2024-09-24 13:10:40 -07:00
Aleix Conchillo Flaqué
a483f1a083
rtvi: handle all actions from the action task
2024-09-24 10:48:15 -07:00
Aleix Conchillo Flaqué
2ecec1c9f8
Merge pull request #500 from pipecat-ai/aleix/rtvi-action-frames-task
...
RTVI action frames task
2024-09-24 10:13:43 -07:00
Aleix Conchillo Flaqué
08ac311971
rtvi: use task to process incoming action frames
2024-09-24 09:36:53 -07:00
Aleix Conchillo Flaqué
cb49b6a0d6
rtvi: add llm-text and tts-text server messages
2024-09-24 09:36:43 -07:00
Aleix Conchillo Flaqué
016da177db
Merge pull request #499 from mercuryyy/main
...
Fix syntax error in deepgram.py
2024-09-24 09:10:05 -07:00
joachimchauvet
ec5998bc36
remove _internal_push_frame from LiveKitInputTransport
2024-09-24 14:54:37 +03:00
mercuryyy
b1e17ee347
Fix syntax error in deepgram.py
2024-09-24 07:45:29 -04:00
joachimchauvet
b6e1d6e6ae
format with ruff
2024-09-24 10:21:02 +03:00
joachimchauvet
fa609f1afc
adjust output sample rate and create user token
2024-09-24 10:16:54 +03:00
joachimchauvet
470b5eafe7
move tenacity imports inside try block
2024-09-24 10:16:54 +03:00
joachimchauvet
2e5b0c1d6b
add tenacity dependency
2024-09-24 10:16:54 +03:00
joachimchauvet
a9390d96a1
add LiveKit audio transport
2024-09-24 10:16:54 +03:00
Mark Backman
8ee9621d66
Add setter functions
2024-09-23 21:12:01 -04:00
Jin Kim
49f2123893
Apply and Fix upstream changes for Cartesia
2024-09-24 07:59:26 +09:00
Jin Kim
cf72129852
Merge remote-tracking branch 'upstream/main'
2024-09-24 07:18:22 +09:00
Mark Backman
8edee8155d
Add input params to Azure TTS
2024-09-23 17:52:23 -04:00
chadbailey59
c262b272fa
Added RTVIActionFrame ( #464 )
...
* added RTVIActionFrame
* server-sent events
* reverted log changes
* fixup
2024-09-23 14:51:17 -05:00
Aleix Conchillo Flaqué
9ef9c1c58a
Merge pull request #497 from pipecat-ai/aleix/ruff-formater
...
introduce Ruff formatting
2024-09-23 10:42:54 -07:00
Aleix Conchillo Flaqué
c7ff79a652
processors: fix formatting string
2024-09-23 09:53:37 -07:00
Aleix Conchillo Flaqué
da81df5284
github: install dev-requirements when running tests
2024-09-23 09:53:37 -07:00
Aleix Conchillo Flaqué
a4420dc88b
README: add vscode and emacs ruff instructions
2024-09-23 09:53:37 -07:00
Aleix Conchillo Flaqué
eeb8338dce
introduce Ruff formatting
2024-09-23 09:53:37 -07:00
Cyril S.
dfa4ac81fd
Implement Sentry instrumentation for performance and error tracking ( #470 )
...
* feat: Add Sentry support in FrameProcessor
This update add optional Sentry integration for performance tracking and error monitoring.
Key changes include:
- Add conditional Sentry import and initialization check
- Implement Sentry spans in FrameProcessorMetrics to measure TTFB (Time To First Byte) and processing time when Sentry is available
- Maintain existing metrics functionality with MetricsFrame regardless of Sentry availability
* feat: Enable metrics in DeepgramSTTService for Sentry
This commit enhances the DeepgramSTTService class to enable metrics generation for use with Sentry.
Key changes include:
1. Enable general metrics generation:
- Implement `can_generate_metrics` method, returning True when VAD is enabled
- This allows metrics to be collected and used by both Sentry and the metrics system in frame_processor.py
2. Integrate Sentry-compatible performance tracking:
- Add start_ttfb_metrics and start_processing_metrics calls in the VAD speech detection handler
- Implement stop_ttfb_metrics call when receiving transcripts
- Add stop_processing_metrics for final transcripts
3. Enhance VAD support for metrics:
- Add `vad_enabled` property to check VAD event availability
- Implement VAD-based speech detection handler for precise metric timing
These changes enable detailed performance tracking via both Sentry and the general metrics system when VAD is active. This allows for better monitoring and analysis of the speech-to-text process, providing valuable insights through Sentry and any other metrics consumers in the pipeline.
* Update frame_processor.py
* Refactor to support flexible metrics implementation
- Modified the __init__ method to accept a metrics parameter that is either FrameProcessorMetrics or one of its subclasses
- Updated the metrics initialization to create an instance with the processor's name
- Moved all FrameProcessorMetrics-related logic to a new processors\metrics\base.py file
* Implement flexible metrics system with Sentry integration
1. Created a new metrics module in processors/metrics/
2. Implemented FrameProcessorMetrics base class in base.py:
3. Implemented SentryMetrics class in sentry.py:
- Inherits from FrameProcessorMetrics
- Integrates with Sentry SDK for advanced metrics tracking
- Implements Sentry-specific span creation and management for TTFB and processing metrics
- Handles cases where Sentry is not available or initialized
2024-09-23 08:44:14 -07:00
Lewis Wolfgang
ea16dca8aa
Merge pull request #469 from pipecat-ai/lewis/remove_torch_dependency
...
Remove torch dependency for using silero_vad
2024-09-23 09:59:40 -04:00
Mark Backman
306632b29a
Add language_code support for ElevenLabs TTS
2024-09-23 09:01:02 -04:00
duyalei
4533ed014f
add full-width punctuations as end of the sentence
2024-09-23 16:35:00 +08:00
Jin Kim
68cc4186ad
Merge remote-tracking branch 'upstream/main'
2024-09-23 16:34:31 +09:00
Mark Backman
9a4e749c7c
Merge pull request #491 from pipecat-ai/mb/elevenlabs-inputs
...
Add voice_settings and optimize_streaming_latency to ElevenLabs
2024-09-22 21:54:21 -04:00
Mark Backman
55c645c614
Add voice_settings and optimize_streaming_latency to ElevenLabs
2024-09-22 13:58:50 -04:00
Mark Backman
a1024bb365
Merge pull request #490 from pipecat-ai/mb/llm-rtvi-service-option
...
Add control frames for LLM param updates
2024-09-21 20:10:17 -04:00
Mark Backman
dfc82c3ba4
Merge pull request #486 from pipecat-ai/mb/llm-extra-params
...
Add extra input param to LLMs
2024-09-21 18:25:47 -04:00
Mark Backman
9e27a8aad0
Add control frames for LLM param updates
2024-09-21 00:02:58 -04:00
Mark Backman
c73111afea
Add extra input param to LLMs
2024-09-21 00:01:25 -04:00
Kwindla Hultman Kramer
26a64afd8d
Merge pull request #485 from pipecat-ai/khk/metrics-model-exclude-none
...
fixup for serialization issue
2024-09-20 18:24:19 -07:00
Kwindla Hultman Kramer
78a3f081de
fixup for serialization issue
2024-09-20 18:21:06 -07:00
Mark Backman
e8f8a49646
Merge pull request #484 from pipecat-ai/mb/llm-input-params
...
Add input params for OpenAI, Anthropic, Together AI LLMs
2024-09-20 20:35:49 -04:00
Mark Backman
219304c5ee
Added Changelog entries
2024-09-20 20:31:42 -04:00
Mark Backman
f3fd312b83
Add Together AI interruptible example
2024-09-20 20:21:19 -04:00
Mark Backman
357e66d64d
Input params for Together AI LLM
2024-09-20 20:21:19 -04:00
Mark Backman
4fa1ea8c4b
Input params for Anthropic LLM
2024-09-20 20:21:19 -04:00
Mark Backman
3b81cd462d
Input params to OpenAI LLM
2024-09-20 20:21:19 -04:00
Aleix Conchillo Flaqué
14acf05a26
Merge pull request #480 from pipecat-ai/aleix/input-output-frames
...
introduce input/output audio and image frames
2024-09-20 14:44:37 -07:00
Mattie Ruth
58d9c84bc9
Merge pull request #474 from pipecat-ai/ruthless/improve-metrics-types-2
...
Ruthless/improve metrics types 2
2024-09-20 09:47:24 -04:00
Aleix Conchillo Flaqué
7e39d9ad3d
introduce input/output audio and image frames
...
We now distinguish between input and output audio and image frames. We introduce
`InputAudioRawFrame`, `OutputAudioRawFrame`, `InputImageRawFrame` and
`OutputImageRawFrame` (and other subclasses of those). The input frames usually
come from an input transport and are meant to be processed inside the pipeline
to generate new frames. However, the input frames will not be sent through an
output transport. The output frames can also be processed by any frame processor
in the pipeline and they are allowed to be sent by the output transport.
2024-09-19 23:11:03 -07:00
mattie ruth backman
a4edb3dab1
Cleanup on aisle METRICS. Note: See below, this is a breaking change
...
1. Fleshed out MetricsFrames and broke it into a proper set of types
2. Add model_name as a property to the AIService so that it can be
automatically included in metrics and also remove that
overhead from all the various services themselves
Breaking change!
Because of the types improvements, the MetricsFrame type has
changed. Each frame will have a list of metrics simlilar to before
except each item in the list will only contain one type of metric:
"ttfb", "tokens", "characters", or "processing". Previously these
fields would be in every entry but set to None if they didn't apply.
While this changes internal handling of the MetricsFrame, it does NOT
break the RTVI/daily messaging of metrics. That format remains the same.
Also. Remember to use model_name for accessing a service's current
model and set_model_name for setting it.
2024-09-19 21:30:34 -04:00
Mattie Ruth
ed409d0460
Merge pull request #478 from pipecat-ai/ruthless/get-tests-running
...
Ruthless/get tests running
2024-09-19 21:01:27 -04:00
mattie ruth backman
50b45ac2da
get the test infrastructure running again
...
disable broken tests for now
2024-09-19 20:58:17 -04:00
Kwindla Hultman Kramer
29bcbc68c5
Merge pull request #479 from pipecat-ai/khk/small-fixes
...
fix small issues that crept into main
2024-09-19 17:25:27 -07:00
Kwindla Hultman Kramer
affbe9ac7d
fix small issues that crept into main
2024-09-19 17:17:33 -07:00
Aleix Conchillo Flaqué
1790fa452f
Merge pull request #436 from pipecat-ai/aleix/frameprocessor-single-task
...
introduce synchronous and asynchronous frame processors
2024-09-19 11:22:56 -07:00
Aleix Conchillo Flaqué
607a246572
updated CHANGELOG with sync/async frame processors
2024-09-19 01:32:17 -07:00
Aleix Conchillo Flaqué
4f1b06e6b2
pipeline: renamed ParallelTask to SyncParallelPipeline
2024-09-19 01:32:17 -07:00
Aleix Conchillo Flaqué
62e9a33a70
examples: use CartesiaHttpTTSService to synchronize frames
2024-09-19 01:32:17 -07:00
Aleix Conchillo Flaqué
3298f935ef
services(fal,moondream): add missing **kwargs
2024-09-19 01:32:17 -07:00
Aleix Conchillo Flaqué
0e8f56c752
services: move TTSService push_stop_frames to AsyncTTSService
2024-09-19 01:32:15 -07:00
Aleix Conchillo Flaqué
8224538372
services(cartesia): added CartesiaHttpTTSService
2024-09-19 01:31:12 -07:00
Aleix Conchillo Flaqué
fbf6eef68f
transports(base_output): wait for sink tasks before canceling audio/video tasks
2024-09-19 01:31:12 -07:00
Aleix Conchillo Flaqué
f078d156de
frames: StartFrame is now a SystemFrame
2024-09-19 01:31:12 -07:00
Aleix Conchillo Flaqué
23d6eed5ea
transports: input()/output() return subclass instead of base class
2024-09-19 01:31:12 -07:00
Aleix Conchillo Flaqué
0ed3d118d6
services(moondream); update revision to 2024-08-26
2024-09-19 01:31:12 -07:00
Aleix Conchillo Flaqué
337f048864
introduce synchronous and asynchronous frame processors
...
Pipecat has a pipeline-based architecture. The pipeline consists of frame
processors linked to each other. The elements travelling across the pipeline are
called frames.
To have a deterministic behavior the frames travelling through the pipeline
should always be ordered, except system frames which are out-of-band frames. To
achieve that, each frame processor should only output frames from a single task.
There are synchronous and asynchronous frame processors. The synchronous
processors push output frames from the same task that they receive input frames,
and therefore only pushing frames from one task. Asynchrnous frame processors
can have internal tasks to perform things asynchrnously (e.g. receiving data
from a websocket) but they also have a single task where they push frames from.
2024-09-19 01:31:10 -07:00
Mark Backman
6f3c421621
Merge pull request #475 from pipecat-ai/mb/tts-sample-rate
...
Add sample_rate setting to TTS services
2024-09-18 14:59:09 -04:00
Mark Backman
eadd68d40b
Add sample_rate setting to TTS services
2024-09-18 14:50:20 -04:00
Lewis Wolfgang
71202e3cd5
Remove torch dependency for using silero_vad
2024-09-17 16:48:52 -04:00
Jin Kim
75008d8f11
Add speed and emotion setting method to Cartesia TTS service
2024-09-18 00:51:45 +09:00
Jin Kim
2da0ecbe3c
Revert "model_id" as a main argument
2024-09-18 00:38:12 +09:00
Jin Kim
c7f814b2dc
Merge remote-tracking branch 'upstream/main'
2024-09-18 00:33:29 +09:00
Aleix Conchillo Flaqué
13a4a05388
Merge pull request #466 from pipecat-ai/aleix/elevenlabs-cartesia-close-websocket-first
...
services(cartesia,elevenlabs): close websocket before the receiving task
2024-09-16 23:55:28 -07:00
Aleix Conchillo Flaqué
20c019ae16
services(cartesia,elevenlabs): close websocket before the receiving task
2024-09-16 23:54:21 -07:00
Adrian Cowham
387a36dd8a
missed a debug print statement
2024-09-16 17:43:42 -07:00
Aleix Conchillo Flaqué
d9d6571c73
Merge pull request #465 from kunal-cai/ks--fix-ws
...
[Cartesia] Fix streaming truncation bug with Twilio Fast API WS
2024-09-16 17:17:13 -07:00
Kunal Shah
540cad4844
Undo sorting
2024-09-16 16:07:19 -07:00
Kunal Shah
0a26b650c0
Undo sorting
2024-09-16 16:06:25 -07:00
Kunal Shah
adaac003e5
[Cartesia] Fix streaming truncation bug with Twilio Fast API WS
2024-09-16 15:59:06 -07:00
Adrian Cowham
2e02ab740d
PR feedback
2024-09-15 20:59:17 -07:00
Aleix Conchillo Flaqué
3d4f125071
Merge pull request #454 from pipecat-ai/aleix/initial-pipeline-clock-support
...
initial pipeline clock support
2024-09-13 13:51:04 -07:00
Aleix Conchillo Flaqué
bce87f8717
update CHANGELOG.md
2024-09-13 13:50:03 -07:00
Aleix Conchillo Flaqué
1fe940bd6b
servceis(cartesia,elevenlabs): use word start times instead
2024-09-13 13:10:44 -07:00
Aleix Conchillo Flaqué
cb36a71381
fix some linting
2024-09-13 09:56:12 -07:00
Aleix Conchillo Flaqué
5acc4928fe
examples: add 07d-interruptible-elevenlabs.py
2024-09-13 09:43:18 -07:00
Aleix Conchillo Flaqué
434493b8aa
services(elevenlabs): implement word-by-word support through websockets
2024-09-13 09:31:35 -07:00
Aleix Conchillo Flaqué
f08b25dbb2
examples: assistant aggregator should always goes after transport
2024-09-12 00:37:34 -07:00
Aleix Conchillo Flaqué
3665734972
transports(output): initial sink clock synchronization
2024-09-12 00:37:34 -07:00
Aleix Conchillo Flaqué
a98d78cdea
services(lmnt): change to subclass of AsyncTTSService
2024-09-12 00:37:34 -07:00
Aleix Conchillo Flaqué
80f6d74e80
services(cartesia): change to subclass of AsyncWordTTSService
2024-09-12 00:37:34 -07:00
Aleix Conchillo Flaqué
02d926e9bd
services: create AsyncTTSService and AsyncWordTTSService
2024-09-12 00:31:48 -07:00
Aleix Conchillo Flaqué
7749692f72
processors: get pipeline clock from StartFrame
2024-09-12 00:31:48 -07:00
Aleix Conchillo Flaqué
7807cbeb39
pipeline(task): add a clock to the pipeline task
2024-09-12 00:31:48 -07:00
Aleix Conchillo Flaqué
72f231b327
frames: add a presentation timestamp (pts) to each frame
2024-09-12 00:31:48 -07:00
Aleix Conchillo Flaqué
3cbe97d346
clocks: added new BaseClock and SystemClock
2024-09-12 00:31:48 -07:00
Adrian Cowham
b4eff2028f
Merge branch 'main' into recording
2024-09-10 10:18:57 -07:00
Adrian Cowham
f411bf33fd
adding a frame processor with the ability to save a conversation to a buffer and another frame processor to upload audio to Canonical for evaluation and metrics collection. Examples included
2024-09-10 10:15:48 -07:00
Kwindla Hultman Kramer
b880e1a60e
Merge pull request #448 from pipecat-ai/khk/aggregation-leading-space
...
fix for leading space in context aggregator strings
2024-09-10 09:57:35 -07:00
Aleix Conchillo Flaqué
886046e696
Merge pull request #445 from dleybz/patch-1
...
Update requirements.txt
2024-09-09 17:54:33 -07:00
Aleix Conchillo Flaqué
9106a5f8ae
Merge pull request #449 from pipecat-ai/aleix/audio-out-bitrate
...
transports(daily): allow setting audio output bitrate (default 96kpbs)
2024-09-09 08:39:06 -07:00
Aleix Conchillo Flaqué
98286336bf
transports(daily): allow setting audio output bitrate (default 96kpbs)
...
Fixes #388
2024-09-08 19:39:17 -07:00
Jin Kim
fa0deededa
Add voice options and make to use InputParams for Cartesia.
2024-09-09 10:53:23 +09:00
Kwindla Hultman Kramer
081b001c8b
fix for leading space in context aggregator strings
2024-09-07 16:42:52 -07:00
Danny D. Leybzon
c92531a02f
Update requirements.txt
...
request.form() throws an error if you don't have python-multipart installed
2024-09-06 20:22:18 +02:00
Aleix Conchillo Flaqué
748a7af602
update CHANGELOG.md
2024-09-05 19:05:29 -07:00
Aleix Conchillo Flaqué
f4a0de6327
Merge pull request #444 from pipecat-ai/aleix/elevenlabs-streaming
...
services(elevenlabs): add elevenlabs package and use streaming
2024-09-05 11:24:12 -07:00
Aleix Conchillo Flaqué
e405d7af9f
services(elevenlabs): add elevenlabs package and use streaming
2024-09-05 11:20:01 -07:00
Aashraya
51cd7fd285
twiliohandle interruption ( #422 )
...
* add interuption handler in twilio serializer
* fix autopep8
* revert ruff autoformatting
* address pr comments
* change interruption frame to user started frame in serializer
* remove overrrident handle interrupt
* remove unused import
* change userstarted to interuption frame
2024-09-02 11:06:38 -07:00
Aleix Conchillo Flaqué
aba5f89174
Merge pull request #437 from soof-golan/soof-obj-id-generation
...
Generate ids with itertools.count
2024-09-02 10:53:48 -07:00
Soof Golan
5c0f5a1613
Generate ids with itertools.count
...
Avoids the critical section with threading.Lock in favor of itertools.count.
`count` objects are threadsafe, and their critical section is implemented in C and provide better performance that Python level locking.
2024-09-02 15:39:58 +02:00
Aleix Conchillo Flaqué
7c342f7ba2
Merge pull request #433 from pipecat-ai/aleix/process-all-startframes
...
StartFrame should be the first frame every processor receives
2024-08-30 14:17:38 -07:00
Aleix Conchillo Flaqué
37e2388758
StartFrame should be the first frame every processor receives
...
Fixes #427
2024-08-29 22:43:44 -07:00
Aleix Conchillo Flaqué
05f0492a8d
Merge pull request #421 from pipecat-ai/aleix/improve-multi-lingual-support
...
improve multi lingual support
2024-08-29 13:19:40 -07:00
Aleix Conchillo Flaqué
c0ac5c6ae8
services(lmnt): fix example and update README and CHANGELOG
2024-08-29 11:11:24 -07:00
Aleix Conchillo Flaqué
be923687fb
processors(rtvi): user decices if bot interrupts on update config
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
5f32fb125d
updated CHANGELOG.md
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
ae6fbb3146
services: just set model, voice, language independently
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
864768635a
services: add voice and language to set_model()
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
d7c9679977
services: allow TTSModelUpdateFrame to also update language and voice
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
fedfc366f6
services(deepgram): fix strenum values
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
b3b39626e1
services: allow switching STT language and mdoel at the same time
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
4e0ece17b6
services: added support for setting STT model and language
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
fd3fdacdee
transcriptions: added more languages
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
a253606d50
services(daily): on_joined now returns all data not only participant
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
568d9dc0a3
services(whisper): inherit from SegmentedSTTService
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
6629b853c5
services(deepgram): inherit from STTService instead of AsyncAIService
2024-08-29 11:00:03 -07:00
Aleix Conchillo Flaqué
3931cb3235
services(cartesia): allow setting language and language voices
2024-08-29 11:00:01 -07:00
Aleix Conchillo Flaqué
38cd86ad52
services: added language to transcription frames
2024-08-29 10:59:02 -07:00
Aleix Conchillo Flaqué
c0cdabf61d
frames: adde TTSLanguageUpdateFrame and TTSLanguageVoicesUpdateFrame
2024-08-29 10:59:02 -07:00
Aleix Conchillo Flaqué
51270a96c5
frames: add language to transcription frames
2024-08-29 10:59:02 -07:00
Kwindla Hultman Kramer
84d72c0d5c
Merge pull request #425 from pipecat-ai/khk/rtvi-together-function-calling
...
fixup type mismatches between rtvi data structures and together.py
2024-08-28 13:11:52 -07:00
Aleix Conchillo Flaqué
79aca8169a
Merge pull request #391 from sharvil/pr/add-lmnt
...
LMNT TTS
2024-08-27 21:40:46 -07:00
Kwindla Hultman Kramer
b9d362bd62
fixup type mismatches between rtvi data structures and together.py
2024-08-27 17:39:21 -07:00
Sharvil Nanavati
87c4a1bee1
Move stop frame task creation into TTSService.start
2024-08-27 04:45:21 +00:00
Sharvil Nanavati
c979762b70
Handle cancellation, stopping, and restarting
2024-08-27 01:24:00 +00:00
Sharvil Nanavati
1d92fc3199
Merge branch 'main' into pr/add-lmnt
2024-08-24 10:07:52 -07:00
Sharvil Nanavati
8ac7fb1a67
Use a single long-lived Task to push TTSStoppedFrame
2024-08-24 16:18:07 +00:00
Sharvil Nanavati
60c3d33def
Default LMNT to 24kHz, add example
2024-08-24 15:40:29 +00:00
Sharvil Nanavati
8a39d3f4eb
services: add a generic mechanism to produce TTSStoppedFrames
2024-08-24 15:40:12 +00:00
Aleix Conchillo Flaqué
e038767b6f
Merge pull request #413 from pipecat-ai/aleix/pipecat-0.0.41
...
prepare pipecat 0.0.41
2024-08-22 17:01:43 -07:00
Aleix Conchillo Flaqué
0c46b3e481
prepare pipecat 0.0.41
2024-08-22 11:50:20 -07:00
Aleix Conchillo Flaqué
d42f072ff5
examples: fix studypal errors and update requirements
2024-08-22 11:50:05 -07:00
Aleix Conchillo Flaqué
9b6f29c24a
Merge pull request #414 from pipecat-ai/aleix/add-livekit-dependency
...
added livekit dependency
2024-08-22 10:55:43 -07:00
Aleix Conchillo Flaqué
873d5dc23f
added livekit dependency
2024-08-22 10:54:18 -07:00
Aleix Conchillo Flaqué
6d141fd47f
Merge pull request #396 from nulyang/feat/livekit-serializers
...
Add livekit audio serializers
2024-08-22 10:44:24 -07:00
Aleix Conchillo Flaqué
c6f6cb2947
Merge pull request #412 from pipecat-ai/aleix/fastapi-variable-clash
...
transports(fastapi): fix variable name clash
2024-08-22 09:50:23 -07:00
Aleix Conchillo Flaqué
0eb189ce7f
transports(fastapi): fix variable name clash
2024-08-22 08:50:03 -07:00
Sharvil Nanavati
f4fd7b7028
LMNT TTS
2024-08-22 00:47:41 +00:00
Aleix Conchillo Flaqué
21de8e0a35
transport(out): log bot started/stopped speaking
2024-08-21 17:23:44 -07:00
Aleix Conchillo Flaqué
6f55d494bd
frames: use VADParams type in VADParamsUpdateFrame
2024-08-21 17:23:12 -07:00
Aleix Conchillo Flaqué
d216edc567
Merge pull request #409 from aashsach/anthropic-empty-tool-argument
...
handle empty parameters for anthropic function calling
2024-08-21 16:14:51 -07:00
Aashraya
ec6063ecc4
system is not a list, it is handled and assisgned as string
2024-08-21 16:31:50 +05:30
Aashraya
40fe4ce6fb
handle empty parameters for anthropic function calling
2024-08-21 15:49:36 +05:30
Aleix Conchillo Flaqué
31d87a4048
update CHANGELOG.md for 0.0.40
2024-08-20 11:48:40 -07:00
Aleix Conchillo Flaqué
ac8b171fa9
Merge pull request #406 from pipecat-ai/hush/cartesiaDocs
...
Hush/cartesia docs
2024-08-20 11:17:52 -07:00
James Hush
1f06d78213
github: remove *requirements.txt from tests.yaml
2024-08-20 11:16:25 -07:00
James Hush
28eba17df8
docs: update Cartesia references
2024-08-20 11:13:13 -07:00
Aleix Conchillo Flaqué
dfc2e62339
Merge pull request #405 from pipecat-ai/aleix/revert-dailysettings-aliases
...
Revert "transports(daily): use aliases in DailyDialinSettings"
2024-08-20 08:53:31 -07:00
Aleix Conchillo Flaqué
80c89a39c9
processors(rtvi): add support for client-ready message (fix)
2024-08-20 07:54:11 -07:00
Aleix Conchillo Flaqué
9d1c16e996
Revert "transports(daily): use aliases in DailyDialinSettings"
...
This reverts commit 47d375309d .
2024-08-20 07:52:35 -07:00
Aleix Conchillo Flaqué
86604c2353
examples(studypal): use aiohttp instead of requests
2024-08-19 18:11:30 -07:00
Aleix Conchillo Flaqué
8f31a02938
Merge pull request #403 from yashn35/studypal-demo
...
Add studypal
2024-08-19 17:39:19 -07:00
Aleix Conchillo Flaqué
47d375309d
transports(daily): use aliases in DailyDialinSettings
2024-08-19 17:27:43 -07:00
Yash Narayan
980265ca97
Add studypal
2024-08-19 16:58:29 -07:00
Aleix Conchillo Flaqué
90479fff95
processors(rtvi): add set_client_ready()
2024-08-19 16:41:43 -07:00
Aleix Conchillo Flaqué
1ce1fcb0ce
Merge pull request #401 from pipecat-ai/aleix/use-cartesia-in-more-examples
...
examples: use Cartesia TTS in most examples
2024-08-19 16:07:35 -07:00
Aleix Conchillo Flaqué
1a662376fc
examples: use Cartesia TTS in most examples
2024-08-19 15:31:34 -07:00
Aleix Conchillo Flaqué
1d24f926ec
Merge pull request #400 from pipecat-ai/aleix/rtvi-client-ready
...
processors(rtvi): add support for client-ready message
2024-08-19 10:53:49 -07:00
Aleix Conchillo Flaqué
4f2c37c940
processors(rtvi): add support for client-ready message
2024-08-19 10:33:18 -07:00
Aleix Conchillo Flaqué
042115a6bb
processors(rtvi): update initial config when sending bot-ready message
2024-08-19 09:32:27 -07:00
Aleix Conchillo Flaqué
c9f1469b41
transports(daily/helpers): add server error message to the logs
2024-08-19 08:44:05 -07:00
Aleix Conchillo Flaqué
54c9f604c9
updated CHANGELOG with VADParamsUpdateFrame
2024-08-18 21:20:40 -07:00
Kwindla Hultman Kramer
56fbcd6562
Merge pull request #397 from pipecat-ai/khk/rtvi-vad-params
...
VADParamsUpdateFrame and handling thereof
2024-08-18 21:14:58 -07:00
Kwindla Hultman Kramer
e6b0500568
make VADAnalyzer:set_params() public
2024-08-18 21:11:18 -07:00
Aleix Conchillo Flaqué
41038b6673
Merge pull request #394 from pipecat-ai/aleix/fix-function-calling-examples
...
fix function calling examples
2024-08-18 20:55:29 -07:00
Aleix Conchillo Flaqué
26d03f26c9
services(openai, anthropic): a None result should not run inference
2024-08-18 20:48:43 -07:00
Aleix Conchillo Flaqué
f3a4e54996
function calling: start callback should have function name first
2024-08-18 20:48:20 -07:00
Kwindla Hultman Kramer
925e80bb20
VADParamsUpdateFrame and handling thereof
2024-08-18 13:34:46 -07:00
nulyang
9bda09b1a8
serializers(livekit): Add audio serializers
2024-08-18 23:40:32 +08:00
Aleix Conchillo Flaqué
ef0d0531fa
services: moved request_image_frame() to LLMService
2024-08-17 23:59:38 -07:00
Aleix Conchillo Flaqué
6520f20ffe
fix function calling examples
2024-08-17 23:32:39 -07:00
Aleix Conchillo Flaqué
ebc4e0924b
Merge pull request #387 from pipecat-ai/aleix/update-reqs-081624
...
update pyproject.toml and remove requirements files
2024-08-17 23:29:47 -07:00
Aleix Conchillo Flaqué
9e7c0e6033
Merge pull request #390 from sharvil/pr/websocket-fix
...
transports(websocket): fix `_audio_buffer` being accidentally overwritten
2024-08-17 23:26:35 -07:00
Aleix Conchillo Flaqué
cf5720f316
update CHANGELOG.md
2024-08-17 21:00:32 -07:00
Kwindla Hultman Kramer
655b468269
Merge pull request #393 from pipecat-ai/khk/anthropic-tools-ordering
...
fix for out-of-order image messages in anthropic context
2024-08-17 15:07:27 -07:00
Kwindla Hultman Kramer
17f8c93e44
fix for out-of-order image messages in anthropic context
2024-08-17 14:47:29 -07:00
Aleix Conchillo Flaqué
5b4061b0d5
processors(rtvi): fix send_error()
2024-08-16 23:46:57 -07:00
Aleix Conchillo Flaqué
6ce0227e98
processors(rtvi): error-response should always include and error
2024-08-16 23:23:55 -07:00
Aleix Conchillo Flaqué
a583a28850
processors(rtvi): error message should use error field
2024-08-16 23:22:27 -07:00
Aleix Conchillo Flaqué
32daf65adc
processors(rtvi): send to the client if errors are fatal
2024-08-16 23:17:55 -07:00
Aleix Conchillo Flaqué
e22c80610e
frames: add new FatalErrorFrame
2024-08-16 23:17:31 -07:00
Sharvil Nanavati
374f1e7e01
transports(websocket): fix _audio_buffer being accidentally overwritten
...
`BaseOutputTransport` declares an `_audio_buffer` instance variable.
`WebsocketServerOutputTransport` accidentally reuses that variable
internally assuming it's class-local and not inherited.
This PR renames the variable in `WebsocketServerOutputTransport`
to avoid the name collision.
2024-08-17 05:28:05 +00:00
Aleix Conchillo Flaqué
d2dfa93bf1
processors(rtvi): send bot-ready when participant joins
2024-08-16 13:58:21 -07:00
Aleix Conchillo Flaqué
fa8c6712c6
transports(daily): fix multiple DailyTransport initialization
2024-08-16 13:32:34 -07:00
Aleix Conchillo Flaqué
4c2b84cb4d
update pyproject.toml and remove requirements files
2024-08-16 09:28:46 -07:00
Aleix Conchillo Flaqué
b57c9d569b
Merge pull request #352 from pipecat-ai/aleix/rtvi-0.1
...
processors(rtvi): rtvi 0.1 message protocol
2024-08-15 17:35:50 -07:00
Aleix Conchillo Flaqué
f0e50ba000
Merge pull request #336 from nulyang/fix/azure-transcriptionframe
...
services(azure): fix TranscriptionFrame parameter type
2024-08-15 17:08:56 -07:00
Mattie Ruth
4a6638f749
Merge pull request #385 from pipecat-ai/mrkb/anthropic-beta-caching
...
Mrkb/anthropic beta caching
2024-08-15 18:26:51 -04:00
Aleix Conchillo Flaqué
31577252f3
processors(rtvi): handle ErrorFrames
2024-08-15 15:23:31 -07:00
Aleix Conchillo Flaqué
5d71c50080
transports(daily): make sure audio_in_task exists before canceling
2024-08-15 15:23:07 -07:00
Aleix Conchillo Flaqué
981269d594
pipeline(task): process ErrorFrame in same task and stop pipeline task
2024-08-15 15:22:40 -07:00
mattie ruth backman
848db985fc
bump anthropic in 3.10 requirements
2024-08-15 16:51:48 -04:00
mattie ruth backman
d5d8e31447
add cache tokens to metrics event
2024-08-15 16:51:48 -04:00
Aleix Conchillo Flaqué
66670a2370
Merge pull request #384 from pipecat-ai/aleix/enable-prompt-caching-frames
...
services(anthropic): allow setting enable prompt caching via frame
2024-08-15 13:26:39 -07:00
Aleix Conchillo Flaqué
5637f349c6
services(anthropic): allow setting enable prompt caching via frame
2024-08-15 12:43:29 -07:00
Aleix Conchillo Flaqué
93248e1d00
Merge pull request #382 from pipecat-ai/khk/anthropic-beta-caching
...
Support for Anthropic prompt caching beta
2024-08-15 12:34:54 -07:00
Kwindla Hultman Kramer
187769357f
update version number of anthropic dependency
2024-08-15 12:28:41 -07:00
Aleix Conchillo Flaqué
5be6422cc8
Revert "processors(rtvi): process options in the order they are defined"
...
This reverts commit 61ac83e2d9 .
2024-08-15 11:51:00 -07:00
Aleix Conchillo Flaqué
8670b2d994
utils: add match_endofsentence and use it in processors
2024-08-15 11:26:25 -07:00
Aleix Conchillo Flaqué
0bc6db428d
processors(rtvi): implement bot-started-speaking and bot-stopped-speaking
2024-08-15 11:05:10 -07:00
Aleix Conchillo Flaqué
67d565930e
services: send TTSStartFrame/TTSStopFrame when really needed
2024-08-15 11:05:10 -07:00
Aleix Conchillo Flaqué
b2a7ff6fd3
processors(rtvi): all transport messages should be urgent
2024-08-15 11:05:10 -07:00
Aleix Conchillo Flaqué
425a730d7c
transports(base_output): send urgent transport messages immediately
2024-08-15 11:05:10 -07:00
Aleix Conchillo Flaqué
84c5709722
frames: add urgent field to TransportMessageFrame
2024-08-15 11:05:10 -07:00
Kwindla Hultman Kramer
94deec01c9
okay, both files now
2024-08-15 00:57:10 -07:00
Kwindla Hultman Kramer
6e0dd4a779
Anthropic beta prompt caching
2024-08-15 00:54:43 -07:00
Kwindla Hultman Kramer
14bde340dd
Merge pull request #381 from pipecat-ai/khk/anthropic-fixup-0814.2
...
Fixup anthropic context set_messages
2024-08-14 23:34:31 -07:00
Kwindla Hultman Kramer
253765c611
and fixing anthropic demos
2024-08-14 23:14:20 -07:00
Kwindla Hultman Kramer
2b26d7182f
replaces 379
2024-08-14 22:40:09 -07:00
Aleix Conchillo Flaqué
61ac83e2d9
processors(rtvi): process options in the order they are defined
2024-08-14 22:26:49 -07:00
Aleix Conchillo Flaqué
d5c7b28cad
Merge pull request #380 from pipecat-ai/aleix/rtvi-0.1-context-aggregators-updates
...
processors(aggregators): multiple LLM aggregators updates
2024-08-14 20:43:50 -07:00
Aleix Conchillo Flaqué
959580a708
processors(logger): fix linting
2024-08-14 20:39:24 -07:00
Aleix Conchillo Flaqué
3a5cd17ea3
processors(aggregators): multiple LLM aggregators updates
2024-08-14 20:23:18 -07:00
Kwindla Hultman Kramer
b78981bb9d
Merge pull request #374 from pipecat-ai/khk/together
...
Together.ai service implementation with Llama 3.1 function calling
2024-08-14 17:29:07 -07:00
Kwindla Hultman Kramer
a6d90b0a00
linting fixes to anthropic.py
2024-08-14 17:27:00 -07:00
Aleix Conchillo Flaqué
67016492f2
transports(daily/helpers): add delete_room_from_url()
2024-08-14 17:14:02 -07:00
Aleix Conchillo Flaqué
2c38089527
processors(rtvi): handle incoming messages in a separate task
2024-08-14 15:34:02 -07:00
Kwindla Hultman Kramer
48f68ba6dc
Service for together.ai, including Llama 3.1 function calling support
2024-08-13 15:01:54 -07:00
Aleix Conchillo Flaqué
574df4ba3d
processors(rtvi): make sure to send bot-ready when transport is joined
2024-08-13 13:25:15 -07:00
Aleix Conchillo Flaqué
49ca16d125
pipeline(task): only send initial metrics frames if metrics enabled
2024-08-13 12:22:37 -07:00
Aleix Conchillo Flaqué
87525b085e
processors(rtvi): linting and make send_error() public
2024-08-13 11:21:51 -07:00
Aleix Conchillo Flaqué
6b53c6add3
transports(daily): DailyTransport default DailyParams
2024-08-13 11:13:18 -07:00
Kwindla Hultman Kramer
29ca1b7855
Anthropic tool use core Pipecat pieces refactored ( #369 )
...
* processors(rtvi): rtvi 0.1 message protocol
* added a single function call handler
* wip - function calling
* fixup
* fixup
* fixup
* processors(rtvi): no need for configure_on_start()
* processors(rtvi): add new option values if they haven't been set yet
* Add the model name to the LLM usage metrics
* wip - anthropic tool calling
* still wip - anthropic tool use and vision
* anthropic tools and vision working
* anthropic tool calling and vision
* Cartesia error handling
* Anthropic tool use core Pipecat pieces refactored as per plan
* aleix has good ideas
* Usage metrics for Anthropic LLMs
* fix function call result state not getting cleared bug
* Pass **kwargs through from AnthropicLLMService constructor
* about to tinker with anthropic
* added openai function calling
* openai function calling
* fixup
---------
Co-authored-by: Aleix Conchillo Flaqué <aleix@daily.co >
Co-authored-by: Chad Bailey <chadbailey@gmail.com >
Co-authored-by: mattie ruth backman <mattieruth@gmail.com >
Co-authored-by: chadbailey59 <chadbailey59@users.noreply.github.com >
2024-08-13 13:01:24 -05:00
Aleix Conchillo Flaqué
a42d0c9907
processors(rtvi): add interrupt_bot()
2024-08-13 09:22:43 -07:00
marcus-daily
8bc6ceaa3d
Fixing pep8
2024-08-13 15:32:23 +01:00
marcus-daily
0b8a1ab5d1
Handle describe-actions message
2024-08-13 15:32:23 +01:00
Brian Hill
358c287db2
chore: Enable build without git
2024-08-12 11:38:41 -04:00
Brian Hill
2e68453655
Merge pull request #371 from pipecat-ai/cbrianhill/allow-build-without-git
...
chore: Enable build without git
2024-08-12 10:15:55 -04:00
Brian Hill
89b8a9de7d
chore: Enable build without git
2024-08-12 09:36:25 -04:00
Aleix Conchillo Flaqué
c4c2058df9
processors(rtvi): handle frames pushed from outside in order
2024-08-11 23:09:11 -07:00
Aleix Conchillo Flaqué
0d85c0085f
processors(rtvi): interrupt the bot if a new config is received
2024-08-11 23:09:11 -07:00
Mattie Ruth
6fa8a8f84f
Merge pull request #365 from pipecat-ai/ruthless/metrics
2024-08-11 20:35:05 -04:00
mattie ruth backman
a97775bff3
Add the model name to the LLM usage metrics
2024-08-11 12:08:46 -04:00
Aleix Conchillo Flaqué
32640e054d
processors(rtvi): add new option values if they haven't been set yet
2024-08-10 21:25:39 -07:00
Aleix Conchillo Flaqué
aa42da5658
processors(rtvi): no need for configure_on_start()
2024-08-10 21:25:21 -07:00
nulyang
900a94a825
services(azure): fix TranscriptionFrame parameter type
2024-08-10 13:00:03 +08:00
Aleix Conchillo Flaqué
c37552de70
processors(rtvi): add support for action responses
2024-08-09 18:12:37 -07:00
Aleix Conchillo Flaqué
916b37926c
processors(rtvi): rtvi 0.1 message protocol
2024-08-09 17:24:38 -07:00
Aleix Conchillo Flaqué
2b76c3c15a
update macos-py3.10-requirements
2024-08-09 17:18:30 -07:00
Aleix Conchillo Flaqué
cedd7dde18
update linux-py3.10-requirements.txt
2024-08-09 17:14:46 -07:00
Lewis Wolfgang
d088608d8e
Merge pull request #340 from pipecat-ai/lewis/silero-vad-via-pip
...
Install Silero VAD via pip
2024-08-09 13:27:29 -04:00
Aleix Conchillo Flaqué
06ee29bb8b
Merge pull request #359 from pipecat-ai/aleix/twilio-elevenlabs-sample-rates
...
twilio and elevenlabs sample rates
2024-08-09 09:38:35 -07:00
Aleix Conchillo Flaqué
d255e954d6
services(elevenlabs): allow specifying output_format
2024-08-09 09:38:20 -07:00
Aleix Conchillo Flaqué
6a7ab6b8ac
serializers(twilio): allow specifying input and output sample rates
2024-08-09 09:37:51 -07:00
Aleix Conchillo Flaqué
45b18cc0b1
Merge pull request #358 from pipecat-ai/aleix/daily-create-room-exp-fixes
...
transports(daily): fixed create_room expirations
2024-08-09 09:37:01 -07:00
Aleix Conchillo Flaqué
0479431f0a
Merge pull request #357 from pipecat-ai/aleix/daily-on-participant-updated
...
transports(daily): added on_participant_updated event
2024-08-09 09:36:46 -07:00
Aleix Conchillo Flaqué
ec58dbd791
transports(daily): added on_participant_updated event
...
Fixes #353
2024-08-09 09:36:24 -07:00
Aleix Conchillo Flaqué
91de68aab3
Merge pull request #355 from pipecat-ai/aleix/usage-metrics-update
...
processors(base): add start_llm_usage_metrics and start_tts_usage_met…
2024-08-09 09:35:36 -07:00
Aleix Conchillo Flaqué
85efc30145
Merge pull request #356 from pipecat-ai/aleix/eleven_turbo_v2_5
...
services(elevenlabs): update default model to eleven_turbo_v2_5
2024-08-09 09:34:47 -07:00
Aleix Conchillo Flaqué
0032594f21
transports(daily): fixed create_room expirations
...
Fixes #348
2024-08-08 22:04:22 -07:00
Aleix Conchillo Flaqué
829fdc5679
services(elevenlabs): update default model to eleven_turbo_v2_5
...
Fixes #349
2024-08-08 21:38:18 -07:00
Aleix Conchillo Flaqué
22e176e329
processors(base): add start_llm_usage_metrics and start_tts_usage_metrics
2024-08-08 16:46:56 -07:00
Lewis Wolfgang
826a70a137
Merge pull request #354 from pipecat-ai/lewis/delete_room_by_name
...
Add delete_room_by_name to DailyRESTHelper
2024-08-08 17:09:21 -04:00
Lewis Wolfgang
dd0ea674af
Treat 404 (room not found) as a success for deletion
2024-08-08 16:57:58 -04:00
Lewis Wolfgang
a4761b8921
Add delete_room_by_name to DailyRESTHelper
2024-08-08 16:31:01 -04:00
chadbailey59
3958bb7903
Additional LLM and TTS metrics ( #343 )
...
* added llm and tts usage metrics
* Metrics debug logging
* cleanup
2024-08-07 08:55:51 -05:00
Aleix Conchillo Flaqué
83a037a7ce
Merge pull request #345 from pipecat-ai/aleix/base-output-render-time-fixes
...
transports(base_output): improve render sleep computation
2024-08-06 17:30:47 -07:00
Aleix Conchillo Flaqué
a3eb8337a6
Merge pull request #342 from pipecat-ai/aleix/base-output-transport-push-audio
...
transport(base_output): push audio downstream
2024-08-06 17:30:32 -07:00
Aleix Conchillo Flaqué
541072f8e0
transports(base_output): improve render sleep computation
2024-08-06 17:20:41 -07:00
Aleix Conchillo Flaqué
881248cbd6
transport(base_output): push audio downstream
2024-08-05 14:00:09 -07:00
Aleix Conchillo Flaqué
d4979f5e64
Merge pull request #337 from pipecat-ai/aleix/audio-video-sync-and-gstreamer
...
audio/video sync and gstreamer
2024-08-05 09:28:11 -07:00
Aleix Conchillo Flaqué
4133cd03bb
processors(gstreamer): add clock_sync property
2024-08-05 09:23:25 -07:00
Lewis Wolfgang
9f07c3ca27
Fly.io example: remove step to cache silero models.
...
No longer necessary.
2024-08-05 10:12:35 -04:00
Lewis Wolfgang
b20bacb9ed
Remove no longer needed code
2024-08-05 10:10:39 -04:00
Lewis Wolfgang
97cfbfee1d
Install silero via pip
2024-08-05 10:01:27 -04:00
Aleix Conchillo Flaqué
fa7c941792
examples(gstreamer): add new GStreamer examples
2024-08-04 12:29:36 -07:00
Aleix Conchillo Flaqué
4738879f32
processors(gstreamer): add new GStreamerPipelineSource
2024-08-04 12:29:34 -07:00
Aleix Conchillo Flaqué
d5d88f756a
transport(output): improve audio and image handling for video use cases
2024-08-04 12:29:08 -07:00
Aleix Conchillo Flaqué
65b136bf15
Merge pull request #334 from pipecat-ai/aleix/cleanup-examples-remove-requests
...
cleanup examples and remove requests
2024-08-01 22:05:01 -07:00
Aleix Conchillo Flaqué
bee0b238e4
examples(storytelling-chatbot): include package-lock.json
2024-08-01 18:23:30 -07:00
Aleix Conchillo Flaqué
c891168ffb
services: revert optional aiohttp.ClientSession
2024-08-01 18:22:56 -07:00
Aleix Conchillo Flaqué
6376c2f6aa
transport(websocket): fix cancel
2024-08-01 18:09:16 -07:00
Aleix Conchillo Flaqué
4d9b7cdd61
DailyRESTHelper now receives an aiohttp client session
2024-08-01 18:08:57 -07:00
Aleix Conchillo Flaqué
8263d1dd6f
update CHANGELOG with latest changes
2024-07-31 23:44:07 -07:00
Aleix Conchillo Flaqué
faf41c0b36
services: ignore yielded None values
2024-07-31 23:41:03 -07:00
Aleix Conchillo Flaqué
27a09c0b2c
cleanup examples and remove requests library
2024-07-31 23:39:51 -07:00
Aleix Conchillo Flaqué
3db7f6a284
Merge pull request #333 from pipecat-ai/aleix/allow-internal-http-sessions-rebased
...
services: allow internal http sessions if none is given
2024-07-31 21:57:00 -07:00
Aleix Conchillo Flaqué
3bfeb5b5ef
services: allow internal http sessions if none is given
2024-07-31 21:56:19 -07:00
Aleix Conchillo Flaqué
62a7a555b5
Merge pull request #330 from pipecat-ai/aleix/stop-and-cancel-are-different
...
EndFrame tries to end gracefully CancelFrame cancels tasks
2024-07-31 15:51:29 -07:00
Aleix Conchillo Flaqué
d60e99a043
examples(06a-image-sync): make sure frames go downstream
2024-07-30 11:41:58 -07:00
Aleix Conchillo Flaqué
77723b34c7
EndFrame tries to end gracefully CancelFrame cancels tasks
2024-07-30 11:41:19 -07:00
Aleix Conchillo Flaqué
c466d34a06
Merge pull request #328 from pipecat-ai/aleix/rtvi-towards-custom-pipelines
...
processors(rtvi): refactor to allow future custom pipelines
2024-07-29 15:07:57 -07:00
Aleix Conchillo Flaqué
f816897833
Merge pull request #327 from pipecat-ai/aleix/bot-start-stop-speaking-frames
...
bot start stop speaking frames
2024-07-27 17:21:23 -07:00
Aleix Conchillo Flaqué
c1e8a5e522
processors(rtvi): refactor to allow future custom pipelines
2024-07-26 10:26:36 -07:00
Aleix Conchillo Flaqué
76aca32f2e
transport(output): emit new bot start|stop speaking frames
2024-07-25 14:50:33 -07:00
Aleix Conchillo Flaqué
7e31b2a795
processors(user_idle): use user speaking instead of interruption frames
2024-07-25 14:47:56 -07:00
Aleix Conchillo Flaqué
028e38a86b
Merge pull request #326 from pipecat-ai/aleix/rtvi-bot-ready-fixes
...
rtvi: send bot-ready when pipeline is ready and first participant joins
2024-07-25 11:39:14 -07:00
Aleix Conchillo Flaqué
8cf7649855
processors(rtvi): send bot-ready when pipeline AND first participant joins
2024-07-25 11:25:51 -07:00
Aleix Conchillo Flaqué
64f5119b08
transports(base): allow registering event handlers without decorators
2024-07-25 11:24:24 -07:00
Aleix Conchillo Flaqué
4d606aefb3
update CHANGELOG
2024-07-25 09:57:01 -07:00
Ankur Duggal
4bafdaa04d
Deepgram Adjustments ( #313 )
2024-07-25 09:51:51 -07:00
Aleix Conchillo Flaqué
5afe1abf82
Merge pull request #323 from pipecat-ai/aleix/base-input-handle-incoming-interruptions
...
transports(inputs): handle start/stop interruption frames
2024-07-24 15:16:18 -07:00
Aleix Conchillo Flaqué
f066d50b98
transports(inputs): handle start/stop interruption frames
2024-07-24 15:15:09 -07:00
Aleix Conchillo Flaqué
91103e21cc
github(publish_test): download tags and depth to 100
2024-07-24 14:49:09 -07:00
Aleix Conchillo Flaqué
f44dabcd65
Merge pull request #322 from pipecat-ai/aleix/base-input-transport-system-frames-fix
...
transports(inputs): don't queue incoming system frames
2024-07-24 14:44:18 -07:00
Aleix Conchillo Flaqué
0fd2fca231
frames: StartFrame is now a control frame
2024-07-24 14:42:59 -07:00
Aleix Conchillo Flaqué
5bb64098e7
transports(inputs): don't queue incoming system frames
2024-07-24 14:35:00 -07:00
Aleix Conchillo Flaqué
3fc85e75e0
Merge pull request #320 from pipecat-ai/aleix/req-updates-072324
...
update project requirements and dependencies
2024-07-23 17:45:18 -07:00
Aleix Conchillo Flaqué
3f61ea16b7
update project requirements and dependencies
2024-07-23 17:35:47 -07:00
Aleix Conchillo Flaqué
4b393092b5
Merge pull request #319 from pipecat-ai/aleix/daily-completion-callbacks-0.0.39-fix
...
transports(daily): fix completion callbacks handling
2024-07-23 15:27:26 -07:00
Aleix Conchillo Flaqué
b583f5162b
transports(daily): fix completion callbacks handling
2024-07-23 15:25:59 -07:00
Aleix Conchillo Flaqué
060a22f395
github: only run publish_test manually
...
We need to run this manually to avoid test.pypi.org project size limits.
2024-07-23 14:19:24 -07:00
Aleix Conchillo Flaqué
d3e85355f1
Merge pull request #318 from pipecat-ai/aleix/prepare-0.0.38
...
update CHANGELOG for 0.0.38
2024-07-23 14:12:01 -07:00
Aleix Conchillo Flaqué
83e730b768
update CHANGELOG for 0.0.38
2024-07-23 14:10:10 -07:00