Mark Backman
21a729ae5d
Merge pull request #4146 from pipecat-ai/mb/gemini-live-local-vad
2026-03-26 17:48:21 -04:00
Mark Backman
fe0633ecd1
Add 14s to release evals
2026-03-26 12:27:27 -04:00
Mark Backman
503e5e9106
Fix Gemini Live local VAD by sending correct activity events to server
...
When Gemini Live was configured with local VAD (server-side VAD disabled),
the service was listening for the wrong frame types and not sending
ActivityStart/ActivityEnd events to the server. Now it listens for
VADUserStartedSpeakingFrame/VADUserStoppedSpeakingFrame and sends the
appropriate activity signals when local VAD is in use.
Also removes the unnecessary local SileroVADAnalyzer from server-side VAD
examples and adds a new 26a example demonstrating local VAD configuration.
2026-03-25 18:00:13 -04:00
Mark Backman
adc003d6c7
Code review cleanup
2026-03-25 10:53:07 -04:00
Paul Kompfner
e0bc9c73c6
Add Anthropic interruptible example (07e) and register in release evals
2026-03-24 16:02:42 -04:00
Mark Backman
6eb988b729
Merge pull request #4092 from harshitajain165/harshita/smallest-tts-only
...
Add Smallest AI TTS service integration
2026-03-24 11:54:34 -04:00
Mark Backman
51d28b4a9f
Code review fixes
2026-03-24 11:21:04 -04:00
kompfner
cf083b8411
Merge pull request #4078 from pipecat-ai/cb/gemini-updates
...
Updates for Gemini Live
2026-03-24 11:18:00 -04:00
Mark Backman
aa0b49d69f
Code review fixes
2026-03-24 09:22:08 -04:00
dhruvladia-sarvam
349b8645f3
Merge branch 'main' into feat/sarvam-llm-integration
2026-03-24 16:34:12 +05:30
dhruvladia-sarvam
696196e30c
alignment with pr 4081
2026-03-24 16:29:58 +05:30
Mark Backman
d314e2831a
Simplify 26 name, update evals
2026-03-23 15:46:13 -04:00
Paul Kompfner
b1a8588209
feat: add 12- and 14d- image/video examples for OpenAI Responses
2026-03-18 15:39:06 -04:00
Paul Kompfner
45186cc4ce
feat: add OpenAI Responses API LLM service
...
Add OpenAIResponsesLLMService using the Responses API, with a dedicated
adapter that converts LLMContext messages to Responses API input items
(system→developer, tool_calls→function_call, tool→function_call_output,
multimodal content conversion, and tools schema flattening).
- New adapter: open_ai_responses_adapter.py
- New service: openai/responses/llm.py
- Examples: 07-interruptible and 14-function-calling variants
- 19 unit tests for adapter conversion logic
- Eval entries for both examples
2026-03-18 11:45:23 -04:00
Mark Backman
786279f143
Remove unused imports, 2026-03-07
2026-03-09 12:44:47 -04:00
Mark Backman
cd28c82de3
Update examples to use the class Settings alias
2026-03-07 09:15:24 -05:00
Mark Backman
671e9a6846
TTS service and example updates
2026-03-06 20:53:22 -05:00
Aleix Conchillo Flaqué
593b75bc8b
Update foundational examples to use "user" role
...
Use system_instruction on LLM service constructors instead of adding
system messages to LLMContext. Messages added to context now use
"user" role.
2026-03-06 09:53:33 -08:00
Mark Backman
ab37185208
Update run_eval_pipeline with the latest settings, system_instruction patterns
2026-03-06 08:32:59 -05:00
Mark Backman
62554a2390
Update examples
2026-03-06 08:30:00 -05:00
Aleix Conchillo Flaqué
3199168d3e
scripts(evals): use context.add_message()
2026-03-05 19:14:06 -08:00
Aleix Conchillo Flaqué
1221e2dd76
Fix Daily transport log level and eval script import
...
Change participant_updated log from debug to trace (too noisy).
Fix deepgram LiveOptions import in eval script.
2026-03-05 16:37:02 -08:00
Mark Backman
eeb8ed8588
Remove Hathora service integration
...
Hathora is shutting down on March 5, 2026. Remove the STT/TTS services,
examples, and related references.
2026-03-04 22:10:06 -05:00
Mark Backman
65f563ad34
Add debug logging to KrispVivaTurn analyze_end_of_turn and update example
...
Move speech detection tracking outside the per-frame loop in append_audio
since is_speech applies to the whole buffer. Add debug log in
analyze_end_of_turn to show state and probability at decision time. Update
the Krisp VIVA example to use Cartesia TTS and turn analyzer strategy.
2026-02-23 21:35:35 -05:00
Mark Backman
8b9da632d1
Add OpenAIRealtimeSTTService
2026-02-05 15:48:00 -05:00
Aleix Conchillo Flaqué
95689cc81c
KokoroTTSService: use kokoro-onnx instead of kokoro
2026-01-31 17:20:27 -08:00
Aleix Conchillo Flaqué
fee633cb92
scripts(evals): disable kokoro for now
2026-01-30 21:23:42 -08:00
Mark Backman
c92ec1552e
Add 22 foundational to release evals
2026-01-30 15:12:52 -05:00
Aleix Conchillo Flaqué
c9310789dc
scripts(evals): use new vad_analyzer from LLMUSerAggregator
2026-01-30 10:57:17 -08:00
Aleix Conchillo Flaqué
b93e12d701
scripts(evals): disable RTVI
2026-01-30 10:52:38 -08:00
Aleix Conchillo Flaqué
72ab329513
services(tss): add new KokoroTTSService
2026-01-30 09:39:01 -08:00
Aleix Conchillo Flaqué
875614ff7a
tts: add support for local PiperTTSService
2026-01-29 00:16:39 -08:00
Aleix Conchillo Flaqué
f3ef488925
rename DAILY_SAMPLE_ROOM_URL to DAILY_ROOM_URL
2026-01-28 10:05:27 -08:00
Gökmen Görgen
45b7ec4e2c
re-enable 07zd-interruptible-aicoustics.py in release evals.
2026-01-27 16:18:56 +01:00
Aleix Conchillo Flaqué
aed44c863a
scripts(eval): give examples to numerical word answers
...
Some models need extra help.
2026-01-19 14:37:00 -08:00
Mark Backman
cddd6d5b0a
Merge pull request #3492 from pipecat-ai/mb/remove-unused-imports
...
Remove unused imports
2026-01-19 14:07:16 -05:00
Mark Backman
11cf891ac8
Manual updates for unused imports
2026-01-19 14:03:22 -05:00
Mark Backman
0b93c3f900
Add Camb TTS to release evals
2026-01-17 16:27:16 -05:00
Mike Seese
dc8ea615d9
add hathora to run-release-evals.py
2026-01-17 10:33:58 -08:00
Mark Backman
efd4432cfb
Renumber the 07 foundational examples
2026-01-15 10:26:17 -05:00
Aleix Conchillo Flaqué
248dac3a9d
Merge pull request #3420 from pipecat-ai/pk/fix-gemini-3-parallel-function-calls
...
Fix parallel function calling with Gemini 3.
2026-01-13 14:40:33 -08:00
Mark Backman
41eef5efc4
Add 07j Gladia VAD foundational example, add to release evals
2026-01-13 11:36:15 -05:00
Paul Kompfner
6668712f7b
Add evals for parallel function calling
2026-01-13 11:03:38 -05:00
Aleix Conchillo Flaqué
5da1f86575
scripts: add 53-concurrent-llm-evaluation.py to release evals
2026-01-09 09:26:38 -08:00
Garegin Harutyunyan
16819a5caa
Krisp VIVA SDK Filter and Turn support. ( #3261 )
...
* Krisp VIVA SDK Filter and Turn support.
* Reverted the krisp_filter.py as it's already deprectaed.
* enabled test with krisp_audio mock.
* More review comment fixes.
reverted the state logic in viva filter to be similar to the existing impl on main branch.
Fixed tests, ruff, etc.
* More review comments for Turn detection.
removed integration tests.
* Moved the SDK init/deinit into start/stop
2026-01-09 08:15:08 -05:00
Mark Backman
4d61c5d7b2
Deprecate support for vad_events in DeepgramSTTService
2026-01-08 20:32:30 -05:00
Mark Backman
3a7b489208
Add foundational 19c and add to evals
2026-01-08 13:00:45 -05:00
Mark Backman
98f70b775f
Update copyright date range to 2024-2026
2026-01-07 16:58:13 -05:00
Mark Backman
31907b90f0
Add 07 example variants to release evals
2025-12-31 09:11:00 -05:00
Mark Backman
845b4ad20e
Add 51 foundational to evals
2025-12-20 08:07:25 -05:00