pipecat

Author	SHA1	Message	Date
Mark Backman	21a729ae5d	Merge pull request #4146 from pipecat-ai/mb/gemini-live-local-vad	2026-03-26 17:48:21 -04:00
Mark Backman	fe0633ecd1	Add 14s to release evals	2026-03-26 12:27:27 -04:00
Mark Backman	503e5e9106	Fix Gemini Live local VAD by sending correct activity events to server When Gemini Live was configured with local VAD (server-side VAD disabled), the service was listening for the wrong frame types and not sending ActivityStart/ActivityEnd events to the server. Now it listens for VADUserStartedSpeakingFrame/VADUserStoppedSpeakingFrame and sends the appropriate activity signals when local VAD is in use. Also removes the unnecessary local SileroVADAnalyzer from server-side VAD examples and adds a new 26a example demonstrating local VAD configuration.	2026-03-25 18:00:13 -04:00
Mark Backman	adc003d6c7	Code review cleanup	2026-03-25 10:53:07 -04:00
Paul Kompfner	e0bc9c73c6	Add Anthropic interruptible example (07e) and register in release evals	2026-03-24 16:02:42 -04:00
Mark Backman	6eb988b729	Merge pull request #4092 from harshitajain165/harshita/smallest-tts-only Add Smallest AI TTS service integration	2026-03-24 11:54:34 -04:00
Mark Backman	51d28b4a9f	Code review fixes	2026-03-24 11:21:04 -04:00
kompfner	cf083b8411	Merge pull request #4078 from pipecat-ai/cb/gemini-updates Updates for Gemini Live	2026-03-24 11:18:00 -04:00
Mark Backman	aa0b49d69f	Code review fixes	2026-03-24 09:22:08 -04:00
dhruvladia-sarvam	349b8645f3	Merge branch 'main' into feat/sarvam-llm-integration	2026-03-24 16:34:12 +05:30
dhruvladia-sarvam	696196e30c	alignment with pr 4081	2026-03-24 16:29:58 +05:30
Mark Backman	d314e2831a	Simplify 26 name, update evals	2026-03-23 15:46:13 -04:00
Paul Kompfner	b1a8588209	feat: add 12- and 14d- image/video examples for OpenAI Responses	2026-03-18 15:39:06 -04:00
Paul Kompfner	45186cc4ce	feat: add OpenAI Responses API LLM service Add OpenAIResponsesLLMService using the Responses API, with a dedicated adapter that converts LLMContext messages to Responses API input items (system→developer, tool_calls→function_call, tool→function_call_output, multimodal content conversion, and tools schema flattening). - New adapter: open_ai_responses_adapter.py - New service: openai/responses/llm.py - Examples: 07-interruptible and 14-function-calling variants - 19 unit tests for adapter conversion logic - Eval entries for both examples	2026-03-18 11:45:23 -04:00
Mark Backman	671e9a6846	TTS service and example updates	2026-03-06 20:53:22 -05:00
Mark Backman	eeb8ed8588	Remove Hathora service integration Hathora is shutting down on March 5, 2026. Remove the STT/TTS services, examples, and related references.	2026-03-04 22:10:06 -05:00
Mark Backman	65f563ad34	Add debug logging to KrispVivaTurn analyze_end_of_turn and update example Move speech detection tracking outside the per-frame loop in append_audio since is_speech applies to the whole buffer. Add debug log in analyze_end_of_turn to show state and probability at decision time. Update the Krisp VIVA example to use Cartesia TTS and turn analyzer strategy.	2026-02-23 21:35:35 -05:00
Mark Backman	8b9da632d1	Add OpenAIRealtimeSTTService	2026-02-05 15:48:00 -05:00
Aleix Conchillo Flaqué	95689cc81c	KokoroTTSService: use kokoro-onnx instead of kokoro	2026-01-31 17:20:27 -08:00
Aleix Conchillo Flaqué	fee633cb92	scripts(evals): disable kokoro for now	2026-01-30 21:23:42 -08:00
Mark Backman	c92ec1552e	Add 22 foundational to release evals	2026-01-30 15:12:52 -05:00
Aleix Conchillo Flaqué	72ab329513	services(tss): add new KokoroTTSService	2026-01-30 09:39:01 -08:00
Aleix Conchillo Flaqué	875614ff7a	tts: add support for local PiperTTSService	2026-01-29 00:16:39 -08:00
Gökmen Görgen	45b7ec4e2c	re-enable `07zd-interruptible-aicoustics.py` in release evals.	2026-01-27 16:18:56 +01:00
Mark Backman	0b93c3f900	Add Camb TTS to release evals	2026-01-17 16:27:16 -05:00
Mike Seese	dc8ea615d9	add hathora to run-release-evals.py	2026-01-17 10:33:58 -08:00
Mark Backman	efd4432cfb	Renumber the 07 foundational examples	2026-01-15 10:26:17 -05:00
Aleix Conchillo Flaqué	248dac3a9d	Merge pull request #3420 from pipecat-ai/pk/fix-gemini-3-parallel-function-calls Fix parallel function calling with Gemini 3.	2026-01-13 14:40:33 -08:00
Mark Backman	41eef5efc4	Add 07j Gladia VAD foundational example, add to release evals	2026-01-13 11:36:15 -05:00
Paul Kompfner	6668712f7b	Add evals for parallel function calling	2026-01-13 11:03:38 -05:00
Aleix Conchillo Flaqué	5da1f86575	scripts: add 53-concurrent-llm-evaluation.py to release evals	2026-01-09 09:26:38 -08:00
Mark Backman	4d61c5d7b2	Deprecate support for vad_events in DeepgramSTTService	2026-01-08 20:32:30 -05:00
Mark Backman	3a7b489208	Add foundational 19c and add to evals	2026-01-08 13:00:45 -05:00
Mark Backman	98f70b775f	Update copyright date range to 2024-2026	2026-01-07 16:58:13 -05:00
Mark Backman	31907b90f0	Add 07 example variants to release evals	2025-12-31 09:11:00 -05:00
Mark Backman	845b4ad20e	Add 51 foundational to evals	2025-12-20 08:07:25 -05:00
Mark Backman	56c58f7302	Move Ultravox foundational example to 50, add to release evals	2025-12-18 13:38:12 -05:00
Aleix Conchillo Flaqué	d07b37b288	scripts(evals): more eval prompts improvements	2025-12-17 09:55:12 -08:00
Aleix Conchillo Flaqué	5b30f1b1ef	scripts(evals): improve prompts	2025-12-16 17:26:50 -08:00
Mark Backman	bd3bf9a00e	Inworld TTS services: Add websocket TTS class, add word-timestamp alignment	2025-12-16 13:47:24 -05:00
Aleix Conchillo Flaqué	21e346abe2	scripts(evals): improve eval prompts	2025-12-15 13:21:40 -08:00
Aleix Conchillo Flaqué	4f848e9631	Merge pull request #3227 from fixie-ai/mike/upstream Add Ultravox service	2025-12-13 18:29:02 -08:00
Mike Depinet	4b81be7acf	Add Ultravox service (#1 ) Adds support for using Ultravox Realtime as a speech-to-speech service. Also removes the deprecated Ultravox speech-to-text vllm model integration to avoid confusion.	2025-12-12 10:16:15 -08:00
Paul Kompfner	12979293ad	Add thinking examples to eval suite	2025-12-11 15:58:48 -05:00
vipyne	acba544e6f	pr notes for nvidia service name change	2025-12-01 22:41:17 -06:00
Aleix Conchillo Flaqué	51ba245e10	scripts(evals): fix EVAL_CONVERSATION/EVAL_WEATHER eval	2025-11-18 21:14:27 -08:00
Aleix Conchillo Flaqué	38aac44a1e	scripts(evals): 26c should be a camera eval	2025-11-07 11:30:41 -08:00
Mark Backman	7eb880c5e8	Add DeepgramHttpTTSService	2025-10-31 11:39:32 -04:00
Aleix Conchillo Flaqué	19f046a338	examples(foundational): add 12d-describe-image-moondream	2025-10-30 14:02:17 -07:00
Aleix Conchillo Flaqué	74fb6e7676	scripts(evals): improve eval prompting	2025-10-30 13:08:15 -07:00

1 2

82 Commits