vipyne
63a65627a2
Riva Service: add magpie-tts-multilingual model
2025-05-02 15:33:10 -05:00
Mark Backman
02c07755b0
Add Changelog entry for PR 1707
2025-05-02 15:33:10 -05:00
Matt Kim
15cbd18acc
[Rime] Add phonemizeBetweenBrackets and pauseBetweenBrackets to RimeTTSService (ws)
...
There is a fix incoming in
2025-05-02 15:33:10 -05:00
Kwindla Hultman Kramer
93c40b87dc
small groq updates
2025-05-02 15:33:10 -05:00
Filipi da Silva Fuchter
94c4169407
Merge pull request #1717 from pipecat-ai/local_smart_turn_torch
...
Local smart turn torch
2025-05-02 15:53:30 -03:00
Filipi Fuchter
cb8a551db8
Mentioning the new LocalSmartTurnAnalyzer in the changelog.
2025-05-02 14:32:18 -03:00
Filipi Fuchter
779f09af70
Fixing lint.
2025-05-02 14:22:38 -03:00
Filipi Fuchter
19dc0f2bfb
New example using the local smart turn
2025-05-02 14:21:42 -03:00
Filipi Fuchter
f0709e22ba
Creating a local smart turn using torch.
2025-05-02 14:21:29 -03:00
Mark Backman
8250736f5e
Merge pull request #1708 from pipecat-ai/mb/gemini-user-context
...
Push GeminiMultimodalLiveLLMService TranscriptionFrame Upstream, remo…
2025-05-02 13:10:27 -04:00
Mark Backman
83348a9f93
Merge pull request #1714 from pipecat-ai/mb/fix-gemini-text-modality
...
Restore TEXT modalities support to GeminiMultimodalLiveLLMService
2025-05-02 10:41:05 -04:00
Mark Backman
96d40903a9
Only send TTSStoppedFrame from Gemini when in AUDIO mode, only send one LLMFullResponseEndFrame
2025-05-02 10:18:53 -04:00
Aleix Conchillo Flaqué
2560811805
Merge pull request #1697 from pipecat-ai/aleix/daily-custom-audio-tracks
...
add support for multiple transport destinations
2025-05-02 06:34:09 -07:00
Mark Backman
2b8c44c008
Merge pull request #1710 from pipecat-ai/mb/openai-context-aggregation
...
fix: OpenAIRealtimeBetaLLMService writes two assistant messages to th…
2025-05-02 07:43:35 -04:00
Mark Backman
38e2d37674
Restore TEXT modalities support to GeminiMultimodalLiveLLMService
2025-05-02 07:36:12 -04:00
Vanessa Pyne
6278561f88
Merge pull request #1709 from pipecat-ai/vp-fix-fastpitch-params-update
...
Riva TTS: update FastPitch params
2025-05-01 21:23:10 -05:00
Aleix Conchillo Flaqué
750e79c1ce
DailyParams: rename to camera/microphone_out_enabled
2025-05-01 19:17:14 -07:00
Aleix Conchillo Flaqué
71eb2963c5
examples: added daily-custom-tracks
2025-05-01 19:17:14 -07:00
Aleix Conchillo Flaqué
f44e2c86ea
BaseOutputTransport: compute sample_rate and audio_chunk_size in main class
2025-05-01 19:17:14 -07:00
Aleix Conchillo Flaqué
afe1f0df8c
DailyTransport: make sure we can write audio frames to destination
2025-05-01 19:17:14 -07:00
Aleix Conchillo Flaqué
458fddfb48
update CHANGELOG with new Daily and Transport features
2025-05-01 19:17:14 -07:00
Aleix Conchillo Flaqué
8d915c5ccb
DailyParams: allow enabling/disabling camera/microphone tracks
2025-05-01 19:17:14 -07:00
Aleix Conchillo Flaqué
304153dd03
TTSService: set transport destination to all TTS frames
2025-05-01 19:17:14 -07:00
Aleix Conchillo Flaqué
a6781b7352
rename destination to transport_destination
2025-05-01 19:17:14 -07:00
Aleix Conchillo Flaqué
5ad0058303
update CHANGELOG with frame source/destination support
2025-05-01 19:11:13 -07:00
Aleix Conchillo Flaqué
75c039de33
examples: add daily-multi-translation
2025-05-01 19:11:13 -07:00
Aleix Conchillo Flaqué
74e3c3677e
DailyTransport: fix audio/video renderers registration
2025-05-01 18:58:44 -07:00
Aleix Conchillo Flaqué
dc20327f10
DailyTransport: register audio destination and use custom tracks
2025-05-01 18:58:44 -07:00
Aleix Conchillo Flaqué
e738affd29
BaseOutputTransport: allow sending audio/video to multiple destinations
2025-05-01 18:58:44 -07:00
Aleix Conchillo Flaqué
ef3d732607
DailyTransport: allow capturing multiple simultaneous audio/video sources
2025-05-01 18:58:44 -07:00
Aleix Conchillo Flaqué
6d63cff1bf
DailyTransport: custom audio tracks support
2025-05-01 18:58:44 -07:00
Aleix Conchillo Flaqué
12f42605a1
pyproject: update daily-python to 0.18.0
2025-05-01 18:58:44 -07:00
Mark Backman
76d198151c
Push GeminiMultimodalLiveLLMService TranscriptionFrame Upstream, remove direct context addition
2025-05-01 15:41:04 -04:00
Mark Backman
6a907058de
fix: OpenAIRealtimeBetaLLMService writes two assistant messages to the context
2025-05-01 15:37:39 -04:00
vipyne
6e1f531f64
Riva TTS: update FastPitch params
...
91138c3f66 (diff-ece228577b1d233ce600a948243f90cece53e3a9b89554a0b27a48bc4d6e0fdfR45)
2025-05-01 11:14:41 -05:00
Mark Backman
a6a4d3d71f
Merge pull request #1706 from rimelabs/matt/rime/update_url
...
[Rime] - Update url for Websockets API
2025-04-30 19:14:04 -04:00
Mark Backman
c52de0f5de
Merge pull request #1696 from pipecat-ai/mb/fix-gemini-live-context
...
Fix: GeminiMultimodalLiveLLMService was appending tokens to the context
2025-04-30 19:12:06 -04:00
Mark Backman
a1e1255f16
Strip newlines from generated user transcript
2025-04-30 18:27:46 -04:00
Mark Backman
c4f758725e
Ignore TranscriptionFrames too
2025-04-30 18:22:43 -04:00
Aleix Conchillo Flaqué
7bc9a78ce6
udpate CHANGELOG with RTVIObserverParams
2025-04-30 15:13:14 -07:00
Aleix Conchillo Flaqué
f8be71b32c
Merge pull request #1688 from pipecat-ai/aleix/add-rtvi-observer-params
...
RTVIObserver: add RTVIObserverParams to configure what to send
2025-04-30 15:11:18 -07:00
Aleix Conchillo Flaqué
957fa5546d
RTVIObserver: add RTVIObserverParams to configure what to send
2025-04-30 15:09:02 -07:00
Aleix Conchillo Flaqué
039cb8fcae
Merge pull request #1690 from pipecat-ai/aleix/rtvi-function-call-single-param
...
RTVIProcessor: use single FunctionCallParams
2025-04-30 15:04:05 -07:00
Mark Backman
8e05f2f1a1
Merge pull request #1702 from pipecat-ai/mb/stt-mute-transcription-frames
...
Add InterimTranscriptionFrame and TranscriptionFrame to STTMuteFilter…
2025-04-30 17:54:24 -04:00
Matt Kim
8467aa1ed3
[Rime] - Update url for Websockets API
...
Rime has migrated their Websockets api to the base url `user.rime.ai` along with all other tts endpoints.
See the [docs](https://docs.rime.ai/api-reference/endpoint/websockets )
`users-ws.rime.ai` is deprecated and will not reflect upgrades to the rime ws api.
2025-04-30 14:20:13 -07:00
Mark Backman
9c5878af3d
OpenAI Realtime and Gemini Live push LLMTextFrame again, overwrite the assitant context aggregator for LLMTextFrame
2025-04-30 17:18:20 -04:00
Mark Backman
ef29800fe9
Update the changelog
2025-04-30 16:28:17 -04:00
Mark Backman
7e09933070
OpenAI Realtime should push TTSTextFrame only
2025-04-30 16:28:17 -04:00
Mark Backman
82a9d7f992
Gemini Mulitmodal Live to push TTSTextFrame only
2025-04-30 16:28:17 -04:00
Mark Backman
facbebb15f
Transcribe user audio in 26b
2025-04-30 16:28:16 -04:00