pipecat

Author	SHA1	Message	Date
Aleix Conchillo Flaqué	95e69597f3	update copyright keeping original year (2024)	2025-01-12 11:34:00 -08:00
Mark Backman	4d0c11fcab	Update examples to align with latest best practices	2025-01-10 15:07:06 -05:00
Mark Backman	86516d2415	PlayHTHttpTTSService fixes	2025-01-10 13:21:27 -05:00
Vanessa Pyne	5cd9dab14b	Merge pull request #949 from imsakg/main fix(examples): correct TTS service import and setup	2025-01-10 10:58:50 -06:00
Mark Backman	cb22de0d13	Update the Tavus example and comment about using the PERSONA_ID	2025-01-10 08:01:00 -05:00
Mert Sefa AKGUN	67af4e619b	style(examples): fix ruff formatting in Gemini text example Refactor `CartesiaTTSService` instantiation to comply with line length requirements from the ruff linter.	2025-01-10 12:32:53 +03:00
Mert Sefa AKGUN	21c274944e	Update examples/foundational/26d-gemini-multimodal-live-text.py Co-authored-by: Vanessa Pyne <vipyne@gmail.com>	2025-01-10 12:28:13 +03:00
Mert Sefa AKGUN	6664c492ac	feat(gemini): enable audio transcription in live text example Add options to transcribe both user and model audio during the GeminiMultimodalLiveLLMService setup in the 26d-gemini-multimodal-live-text.py example.	2025-01-09 15:38:33 +03:00
Mert Sefa AKGUN	7634058f97	fix(examples): correct TTS service import and setup - Update import to use CartesiaTTSService instead of CartesiaMultiLingualTTSService. - Adjust GeminiMultimodalLiveLLMService setup to use set_model_modalities with TEXT modality.	2025-01-09 02:19:08 +03:00
Mert Sefa AKGUN	40e9ee6d63	fix(examples): correct import order in Gemini example - Move `CartesiaMultiLingualTTSService` import to maintain proper order. - Reorganize `enum` import to adhere to styling standards.	2025-01-08 21:14:29 +03:00
Mert Sefa AKGUN	cdb909958c	feat(examples): add Gemini multimodal live text example Introduce a new example `26d-gemini-multimodal-live-text.py` to demonstrate the use of GeminiMultimodalLiveLLMService with text-only responses. This example sets up a pipeline for audio input via DailyTransport, processing with Gemini, and output via Cartesia TTS.	2025-01-08 19:29:35 +03:00
Mark Backman	3e1ec4a8ee	Added support for Google Journey TTS voices	2025-01-06 14:54:34 -05:00
Vaibhav159	b3b7a5f023	adding 2025 license	2025-01-06 22:10:46 +05:30
Vaibhav159	5138017b57	ruff changes	2025-01-06 22:07:59 +05:30
Vaibhav159	87670067d7	adding changelog	2025-01-06 22:03:11 +05:30
Vaibhav159	656cd2859e	Merge branch 'main' into vl_add_audio_and_chat_livekit_example	2025-01-06 21:57:43 +05:30
Mark Backman	4667624b60	Update copyright to 2025	2025-01-06 10:19:37 -05:00
Kwindla Hultman Kramer	ab3bcde5f7	Merge pull request #907 from pipecat-ai/khk/gemini-20241221 Gemini unary API fixes and natural conversation demo	2024-12-23 17:34:57 -08:00
Kwindla Hultman Kramer	1368d3db5c	revert elevenlabs example changes	2024-12-23 17:33:59 -08:00
Kwindla Hultman Kramer	ab5df1a236	feature complete gemini audio, transcription, and phrase endpointing demo	2024-12-22 11:19:02 -08:00
Kwindla Hultman Kramer	f5f0de00e4	still some cleanup to do	2024-12-21 23:04:00 -08:00
Kwindla Hultman Kramer	f3dd35bfd9	working but needs cleanup	2024-12-21 22:18:56 -08:00
Kwindla Hultman Kramer	53a5e63990	function calling dead-end	2024-12-21 18:10:25 -08:00
Kwindla Hultman Kramer	d435a6a6d6	fixes to audio buffer	2024-12-21 16:22:53 -08:00
Kwindla Hultman Kramer	59240c7b96	delay gemini multimodal live websocket connect	2024-12-21 14:36:37 -08:00
Mark Backman	dac4468ca1	Add Fish Audio TTS service	2024-12-21 12:42:56 -05:00
Aleix Conchillo Flaqué	4547609ffb	examples(01a): remove unused import	2024-12-19 17:49:27 -08:00
Mark Backman	6e0d3aef32	Merge pull request #860 from pipecat-ai/mb/transcription Add a TranscriptProcessor and new frames	2024-12-19 08:15:53 -05:00
Mark Backman	4f093f11db	Add CerebrasLLMService and foundational example	2024-12-19 08:10:31 -05:00
Mark Backman	1117c21483	Refactor TranscriptProcessor into user and assistant processors	2024-12-17 22:34:22 -05:00
Mark Backman	1f8a217cd1	Code review changes	2024-12-17 22:34:02 -05:00
Mark Backman	b5bd662fe1	Add changelog and rename examples	2024-12-17 22:33:39 -05:00
Mark Backman	dd2703317a	Add timestamp frames and include timestamps in the transcription event and frame	2024-12-17 22:31:15 -05:00
Mark Backman	55879bf365	Add TranscriptionProcessor	2024-12-17 22:31:15 -05:00
Aleix Conchillo Flaqué	17162258a2	fix ruff linter import organization	2024-12-17 11:28:58 -08:00
Mark Backman	ca086a856f	Add custom assistant context aggregator for Grok due to content requirement in function calling	2024-12-17 09:11:21 -05:00
Aleix Conchillo Flaqué	6d11911d83	Revert "no longer necessary to call super().process_frame(frame, direction)"	2024-12-12 17:03:40 -08:00
Aleix Conchillo Flaqué	3c3fd67d96	no longer necessary to call super().process_frame(frame, direction)	2024-12-12 13:03:41 -08:00
Vaibhav159	62fc95300b	adding livekit audio and chat version	2024-12-13 01:09:47 +05:30
Aleix Conchillo Flaqué	133e1aff6c	polly: renamed AWSTTSService to PollyTTSService	2024-12-11 17:56:43 -08:00
Mark Backman	027e360436	Fix demo numbering and prompt the bot to say hi in 26b	2024-12-11 11:36:38 -05:00
Kwindla Hultman Kramer	c219172266	Gemini Multimodal Live function calling example	2024-12-11 08:29:09 -08:00
Mark Backman	0d74531f36	Minor changes to demos	2024-12-11 11:23:59 -05:00
Mark Backman	8086a94e49	Renumber foundational demos	2024-12-11 10:56:51 -05:00
Kwindla Hultman Kramer	81895f4a5c	Gemini Multimodal Live API service	2024-12-11 07:38:23 -08:00
Aleix Conchillo Flaqué	b85072637f	examples(26-simli-layer): use room returned by configure()	2024-12-10 18:42:12 -08:00
Aleix Conchillo Flaqué	ffe1e023e7	Merge pull request #819 from pipecat-ai/aleix/fix-openaillmcontext-from-image-frame fix OpenAILLMContext from image frame	2024-12-10 18:39:55 -08:00
Aleix Conchillo Flaqué	c7ca0eea0f	Merge pull request #823 from pipecat-ai/aleix/fix-15a-switch-languages examples: fix 15a-switch-languages pipeline	2024-12-10 18:34:13 -08:00
Aleix Conchillo Flaqué	67e8252d76	examples: fix 15a-switch-languages pipeline	2024-12-10 18:27:49 -08:00
Aleix Conchillo Flaqué	775aa9493e	examples: fix 11-sound-effects	2024-12-10 18:25:43 -08:00

1 2 3 4 5 ...

324 Commits