Aleix Conchillo Flaqué
95e69597f3
update copyright keeping original year (2024)
2025-01-12 11:34:00 -08:00
Mark Backman
4d0c11fcab
Update examples to align with latest best practices
2025-01-10 15:07:06 -05:00
Mark Backman
86516d2415
PlayHTHttpTTSService fixes
2025-01-10 13:21:27 -05:00
Vanessa Pyne
5cd9dab14b
Merge pull request #949 from imsakg/main
...
fix(examples): correct TTS service import and setup
2025-01-10 10:58:50 -06:00
Mark Backman
cb22de0d13
Update the Tavus example and comment about using the PERSONA_ID
2025-01-10 08:01:00 -05:00
Mert Sefa AKGUN
67af4e619b
style(examples): fix ruff formatting in Gemini text example
...
Refactor `CartesiaTTSService` instantiation to comply with line
length requirements from the ruff linter.
2025-01-10 12:32:53 +03:00
Mert Sefa AKGUN
21c274944e
Update examples/foundational/26d-gemini-multimodal-live-text.py
...
Co-authored-by: Vanessa Pyne <vipyne@gmail.com >
2025-01-10 12:28:13 +03:00
Mert Sefa AKGUN
6664c492ac
feat(gemini): enable audio transcription in live text example
...
Add options to transcribe both user and model audio during the GeminiMultimodalLiveLLMService setup in the 26d-gemini-multimodal-live-text.py example.
2025-01-09 15:38:33 +03:00
Mert Sefa AKGUN
7634058f97
fix(examples): correct TTS service import and setup
...
- Update import to use CartesiaTTSService instead of CartesiaMultiLingualTTSService.
- Adjust GeminiMultimodalLiveLLMService setup to use set_model_modalities with TEXT modality.
2025-01-09 02:19:08 +03:00
Mert Sefa AKGUN
40e9ee6d63
fix(examples): correct import order in Gemini example
...
- Move `CartesiaMultiLingualTTSService` import to maintain proper order.
- Reorganize `enum` import to adhere to styling standards.
2025-01-08 21:14:29 +03:00
Mert Sefa AKGUN
cdb909958c
feat(examples): add Gemini multimodal live text example
...
Introduce a new example `26d-gemini-multimodal-live-text.py` to
demonstrate the use of GeminiMultimodalLiveLLMService with text-only
responses. This example sets up a pipeline for audio input via DailyTransport,
processing with Gemini, and output via Cartesia TTS.
2025-01-08 19:29:35 +03:00
Mark Backman
3e1ec4a8ee
Added support for Google Journey TTS voices
2025-01-06 14:54:34 -05:00
Vaibhav159
b3b7a5f023
adding 2025 license
2025-01-06 22:10:46 +05:30
Vaibhav159
5138017b57
ruff changes
2025-01-06 22:07:59 +05:30
Vaibhav159
87670067d7
adding changelog
2025-01-06 22:03:11 +05:30
Vaibhav159
656cd2859e
Merge branch 'main' into vl_add_audio_and_chat_livekit_example
2025-01-06 21:57:43 +05:30
Mark Backman
4667624b60
Update copyright to 2025
2025-01-06 10:19:37 -05:00
Kwindla Hultman Kramer
ab3bcde5f7
Merge pull request #907 from pipecat-ai/khk/gemini-20241221
...
Gemini unary API fixes and natural conversation demo
2024-12-23 17:34:57 -08:00
Kwindla Hultman Kramer
1368d3db5c
revert elevenlabs example changes
2024-12-23 17:33:59 -08:00
Kwindla Hultman Kramer
ab5df1a236
feature complete gemini audio, transcription, and phrase endpointing demo
2024-12-22 11:19:02 -08:00
Kwindla Hultman Kramer
f5f0de00e4
still some cleanup to do
2024-12-21 23:04:00 -08:00
Kwindla Hultman Kramer
f3dd35bfd9
working but needs cleanup
2024-12-21 22:18:56 -08:00
Kwindla Hultman Kramer
53a5e63990
function calling dead-end
2024-12-21 18:10:25 -08:00
Kwindla Hultman Kramer
d435a6a6d6
fixes to audio buffer
2024-12-21 16:22:53 -08:00
Kwindla Hultman Kramer
59240c7b96
delay gemini multimodal live websocket connect
2024-12-21 14:36:37 -08:00
Mark Backman
dac4468ca1
Add Fish Audio TTS service
2024-12-21 12:42:56 -05:00
Aleix Conchillo Flaqué
4547609ffb
examples(01a): remove unused import
2024-12-19 17:49:27 -08:00
Mark Backman
6e0d3aef32
Merge pull request #860 from pipecat-ai/mb/transcription
...
Add a TranscriptProcessor and new frames
2024-12-19 08:15:53 -05:00
Mark Backman
4f093f11db
Add CerebrasLLMService and foundational example
2024-12-19 08:10:31 -05:00
Mark Backman
1117c21483
Refactor TranscriptProcessor into user and assistant processors
2024-12-17 22:34:22 -05:00
Mark Backman
1f8a217cd1
Code review changes
2024-12-17 22:34:02 -05:00
Mark Backman
b5bd662fe1
Add changelog and rename examples
2024-12-17 22:33:39 -05:00
Mark Backman
dd2703317a
Add timestamp frames and include timestamps in the transcription event and frame
2024-12-17 22:31:15 -05:00
Mark Backman
55879bf365
Add TranscriptionProcessor
2024-12-17 22:31:15 -05:00
Aleix Conchillo Flaqué
17162258a2
fix ruff linter import organization
2024-12-17 11:28:58 -08:00
Mark Backman
ca086a856f
Add custom assistant context aggregator for Grok due to content requirement in function calling
2024-12-17 09:11:21 -05:00
Aleix Conchillo Flaqué
6d11911d83
Revert "no longer necessary to call super().process_frame(frame, direction)"
2024-12-12 17:03:40 -08:00
Aleix Conchillo Flaqué
3c3fd67d96
no longer necessary to call super().process_frame(frame, direction)
2024-12-12 13:03:41 -08:00
Vaibhav159
62fc95300b
adding livekit audio and chat version
2024-12-13 01:09:47 +05:30
Aleix Conchillo Flaqué
133e1aff6c
polly: renamed AWSTTSService to PollyTTSService
2024-12-11 17:56:43 -08:00
Mark Backman
027e360436
Fix demo numbering and prompt the bot to say hi in 26b
2024-12-11 11:36:38 -05:00
Kwindla Hultman Kramer
c219172266
Gemini Multimodal Live function calling example
2024-12-11 08:29:09 -08:00
Mark Backman
0d74531f36
Minor changes to demos
2024-12-11 11:23:59 -05:00
Mark Backman
8086a94e49
Renumber foundational demos
2024-12-11 10:56:51 -05:00
Kwindla Hultman Kramer
81895f4a5c
Gemini Multimodal Live API service
2024-12-11 07:38:23 -08:00
Aleix Conchillo Flaqué
b85072637f
examples(26-simli-layer): use room returned by configure()
2024-12-10 18:42:12 -08:00
Aleix Conchillo Flaqué
ffe1e023e7
Merge pull request #819 from pipecat-ai/aleix/fix-openaillmcontext-from-image-frame
...
fix OpenAILLMContext from image frame
2024-12-10 18:39:55 -08:00
Aleix Conchillo Flaqué
c7ca0eea0f
Merge pull request #823 from pipecat-ai/aleix/fix-15a-switch-languages
...
examples: fix 15a-switch-languages pipeline
2024-12-10 18:34:13 -08:00
Aleix Conchillo Flaqué
67e8252d76
examples: fix 15a-switch-languages pipeline
2024-12-10 18:27:49 -08:00
Aleix Conchillo Flaqué
775aa9493e
examples: fix 11-sound-effects
2024-12-10 18:25:43 -08:00