Files

Paul Kompfner 217f03b9cc Add additional functionality related to "thinking", for Google and Anthropic LLMs.

Thinking, sometimes called "extended thinking" or "reasoning", is an LLM process where the model takes some additional time before giving an answer. It's useful for complex tasks that may require some level of planning and structured, step-by-step reasoning. The model can output its thoughts (or thought summaries, depending on the model) in addition to the answer. The thoughts are usually pretty granular and not really suitable for being spoken out loud in a conversation, but can be useful for logging or prompt debugging.

Here's what's added:

1. New typed input parameters for Google and Anthropic LLMs that control the models' thinking behavior (like how much thinking to do, and whether to output thoughts or thought summaries).
2. New frames for representing thoughts output by LLMs.
3. A generic mechanism for associating extra LLM-specific data with a function call in context, used specifically to support Google's function-call-related "thought signatures", which are necessary to ensure thinking continuity between function calls in a chain (where the model thinks, makes a function call, thinks some more, etc.)
4. A generic mechanism for recording LLM thoughts to context, used specifically to support Anthropic, whose thought signatures are expected to appear alongside the text of the thoughts within assistant context messages.
5. An expansion of `TranscriptProcessor` to process LLM thoughts in addition to user and assistant utterances.

2025-12-08 09:29:01 -05:00

assets

examples(foundational): re-add 12-* but load image from file

2025-10-30 13:08:15 -07:00

01-say-one-thing-piper.py

transports: reorganize module

2025-09-02 17:31:39 -07:00

01-say-one-thing-rime.py

transports: reorganize module

2025-09-02 17:31:39 -07:00

01-say-one-thing.py

transports: reorganize module

2025-09-02 17:31:39 -07:00

01a-local-audio.py

examples: use new services packages

2025-03-30 16:21:00 -07:00

01b-livekit-audio.py

examples(01b): use TTSSpeakFrame instead of TextFrame

2025-09-16 17:18:45 -04:00

01c-nvidia-riva-tts.py

examples: rename nvidia foundational examples

2025-12-01 22:41:17 -06:00

02-llm-say-one-thing.py

Update examples, wherever possible, to use LLMContext and associated machinery instead of OpenAILLMContext and associated machinery.

2025-09-22 16:21:35 -04:00

03-still-frame.py

transports: reorganize module

2025-09-02 17:31:39 -07:00

03a-local-still-frame.py

examples: update camera_* with video_*

2025-04-24 17:14:18 -07:00

03b-still-frame-imagen.py

transports: reorganize module

2025-09-02 17:31:39 -07:00

04-transports-small-webrtc.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

04a-transports-daily.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

04b-transports-livekit.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

05-sync-speech-and-image.py

Update examples, wherever possible, to use LLMContext and associated machinery instead of OpenAILLMContext and associated machinery.

2025-09-22 16:21:35 -04:00

05a-local-sync-speech-and-image.py

Update examples, wherever possible, to use LLMContext and associated machinery instead of OpenAILLMContext and associated machinery.

2025-09-22 16:21:35 -04:00

06-listen-and-respond.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

06a-image-sync.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07-interruptible-cartesia-http.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07-interruptible.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07a-interruptible-speechmatics-vad.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07a-interruptible-speechmatics.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07aa-interruptible-soniox.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07ab-interruptible-inworld-http.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07ac-interruptible-asyncai-http.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07ac-interruptible-asyncai.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07ad-interruptible-aicoustics.py

Merge pull request #3050 from ai-coustics/aic-vad-analyzer

2025-11-14 08:11:15 -05:00

07ae-interruptible-hume.py

rm TranscriptProcessor 2

2025-11-18 20:41:10 +01:00

07af-interruptible-gradium.py

Add the example.

2025-12-05 10:51:22 +01:00

07b-interruptible-langchain.py

Update comment in example 07b to reference LLMContext rather than OpenAILLMContext

2025-09-24 12:49:34 -04:00

07c-interruptible-deepgram-flux.py

Introduced a minimum confidence parameter in DeepgramFluxSTTService to avoid generating transcriptions below a defined threshold.

2025-11-17 09:54:30 -03:00

07c-interruptible-deepgram-http.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07c-interruptible-deepgram-sagemaker.py

Add 07c Deepgram SageMaker example

2025-11-24 16:41:01 -05:00

07c-interruptible-deepgram-vad.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07c-interruptible-deepgram.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07d-interruptible-elevenlabs-http.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07d-interruptible-elevenlabs.py

Merge pull request #3042 from pipecat-ai/pk/follow-up-inter-frame-spaces

2025-11-13 11:03:06 -05:00

07e-interruptible-playht-http.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07e-interruptible-playht.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07f-interruptible-azure-http.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07f-interruptible-azure.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07g-interruptible-openai.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07h-interruptible-openpipe.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07i-interruptible-xtts.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07j-interruptible-gladia.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07k-interruptible-lmnt.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07l-interruptible-groq.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07m-interruptible-aws-strands.py

examples: update Strands Agents with universal context and add evals

2025-09-23 11:37:57 -07:00

07m-interruptible-aws.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07n-interruptible-gemini-image.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07n-interruptible-gemini.py

Merge pull request #3042 from pipecat-ai/pk/follow-up-inter-frame-spaces

2025-11-13 11:03:06 -05:00

07n-interruptible-google-http.py

Add additional functionality related to "thinking", for Google and Anthropic LLMs.

2025-12-08 09:29:01 -05:00

07n-interruptible-google.py

Add additional functionality related to "thinking", for Google and Anthropic LLMs.

2025-12-08 09:29:01 -05:00

07o-interruptible-assemblyai.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07p-interruptible-krisp-viva.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07p-interruptible-krisp.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07q-interruptible-rime-http.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07q-interruptible-rime.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07r-interruptible-nvidia.py

examples: rename nvidia foundational examples

2025-12-01 22:41:17 -06:00

07s-interruptible-google-audio-in.py

Add additional functionality related to "thinking", for Google and Anthropic LLMs.

2025-12-08 09:29:01 -05:00

07t-interruptible-fish.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07u-interruptible-ultravox.py

Update quickstart and foundational examples to use smart-turn v3

2025-09-18 23:54:18 -04:00

07v-interruptible-neuphonic-http.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07v-interruptible-neuphonic.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07w-interruptible-fal.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07x-interruptible-local.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07y-interruptible-minimax.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07z-interruptible-sarvam-http.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

07z-interruptible-sarvam.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

08-custom-frame-processor.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

09-mirror.py

transports: reorganize module

2025-09-02 17:31:39 -07:00

09a-local-mirror.py

transports: reorganize module

2025-09-02 17:31:39 -07:00

10-wake-phrase.py

Update examples, wherever possible, to use LLMContext and associated machinery instead of OpenAILLMContext and associated machinery.

2025-09-22 16:21:35 -04:00

11-sound-effects.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

12-describe-image-openai.py

LLMContext: create_image_message/create_audio_message are now async

2025-11-18 09:04:40 -08:00

12a-describe-image-anthropic.py

LLMContext: create_image_message/create_audio_message are now async

2025-11-18 09:04:40 -08:00

12b-describe-image-aws.py

LLMContext: create_image_message/create_audio_message are now async

2025-11-18 09:04:40 -08:00

12c-describe-image-gemini-flash.py

LLMContext: create_image_message/create_audio_message are now async

2025-11-18 09:04:40 -08:00

12d-describe-image-moondream.py

examples(foundational): add 12d-describe-image-moondream

2025-10-30 14:02:17 -07:00

13-whisper-transcription.py

fix: 13 foundational examples now push frames from TranscriptionLogger

2025-09-10 10:40:10 -04:00

13a-whisper-local.py

fix: 13 foundational examples now push frames from TranscriptionLogger

2025-09-10 10:40:10 -04:00

13b-deepgram-transcription.py

fix: 13 foundational examples now push frames from TranscriptionLogger

2025-09-10 10:40:10 -04:00

13c-gladia-transcription.py

fix: 13 foundational examples now push frames from TranscriptionLogger

2025-09-10 10:40:10 -04:00

13c-gladia-translation.py

fix: 13 foundational examples now push frames from TranscriptionLogger

2025-09-10 10:40:10 -04:00

13d-assemblyai-transcription.py

fix: 13 foundational examples now push frames from TranscriptionLogger

2025-09-10 10:40:10 -04:00

13e-whisper-mlx.py

fix: 13 foundational examples now push frames from TranscriptionLogger

2025-09-10 10:40:10 -04:00

13f-cartesia-transcription.py

examples(foundational/07): use CartesiaSTTService

2025-10-15 09:46:57 -07:00

13g-sambanova-transcription.py

fix: 13 foundational examples now push frames from TranscriptionLogger

2025-09-10 10:40:10 -04:00

13h-speechmatics-transcription.py

fix: 13 foundational examples now push frames from TranscriptionLogger

2025-09-10 10:40:10 -04:00

13i-soniox-transcription.py

fix: 13 foundational examples now push frames from TranscriptionLogger

2025-09-10 10:40:10 -04:00

13j-azure-transcription.py

fix: 13 foundational examples now push frames from TranscriptionLogger

2025-09-10 10:40:10 -04:00

13k-elevenlabs-transcription.py

Add ElevenLabsSTTService

2025-09-23 10:27:31 -04:00

14-function-calling.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14a-function-calling-anthropic.py

Update examples, wherever possible, to use LLMContext and associated machinery instead of OpenAILLMContext and associated machinery.

2025-09-22 16:21:35 -04:00

14c-function-calling-together.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14d-function-calling-anthropic-video.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14d-function-calling-aws-video.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14d-function-calling-gemini-flash-video.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14d-function-calling-moondream-video.py

Update Moondream example so that Moondream service output makes it into the context, even if the TTS service is disabled

2025-11-17 15:16:19 -05:00

14d-function-calling-openai-video.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14e-function-calling-google.py

Update examples, wherever possible, to use LLMContext and associated machinery instead of OpenAILLMContext and associated machinery.

2025-09-22 16:21:35 -04:00

14f-function-calling-groq.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14g-function-calling-grok.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14h-function-calling-azure.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14i-function-calling-fireworks.py

examples(foundational): update 14i-fireworks with new serverless model

2025-12-05 15:31:29 -08:00

14j-function-calling-nvidia.py

examples: rename nvidia foundational examples

2025-12-01 22:41:17 -06:00

14k-function-calling-cerebras.py

Update examples, wherever possible, to use LLMContext and associated machinery instead of OpenAILLMContext and associated machinery.

2025-09-22 16:21:35 -04:00

14l-function-calling-deepseek.py

Update examples, wherever possible, to use LLMContext and associated machinery instead of OpenAILLMContext and associated machinery.

2025-09-22 16:21:35 -04:00

14m-function-calling-openrouter.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14n-function-calling-perplexity.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14o-function-calling-gemini-openai-format.py

Update quickstart and foundational examples to use smart-turn v3

2025-09-18 23:54:18 -04:00

14p-function-calling-gemini-vertex-ai.py

location should not be optional when using Google Vertex.

2025-10-10 12:58:45 -04:00

14q-function-calling-qwen.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14r-function-calling-aws.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14s-function-calling-sambanova.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14t-function-calling-direct.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14u-function-calling-ollama.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14v-function-calling-openai.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14w-function-calling-mistral.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

14x-function-calling-openpipe.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

15-switch-voices.py

Update more examples to use universal LLMContext. Specifically, update examples we didn't update before because they weren't using ToolsSchema for their tool definitions, which is a requirement for using LLMContext.

2025-09-23 12:41:35 -04:00

15a-switch-languages.py

2025-09-23 12:41:35 -04:00

16-gpu-container-local-bot.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

17-detect-user-idle.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

18-gstreamer-filesrc.py

transports: reorganize module

2025-09-02 17:31:39 -07:00

18a-gstreamer-videotestsrc.py

transports: reorganize module

2025-09-02 17:31:39 -07:00

19-openai-realtime-beta.py

transports: reorganize module

2025-09-02 17:31:39 -07:00

19-openai-realtime.py

examples(19): linting

2025-12-01 18:30:34 -08:00

19a-azure-realtime-beta.py

transports: reorganize module

2025-09-02 17:31:39 -07:00

19a-azure-realtime.py

examples(19): linting

2025-12-01 18:30:34 -08:00

19b-openai-realtime-beta-text.py

examples(foundational): fix 19b-openai-realtime-beta-text

2025-09-12 11:03:32 -07:00

19b-openai-realtime-text.py

Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).

2025-10-29 15:43:51 -04:00

20a-persistent-context-openai.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

20b-persistent-context-openai-realtime-beta.py

Add OpenAIRealtimeLLMService, AzureRealtimeLLMService (#2596 )

2025-09-07 09:09:57 -04:00

20b-persistent-context-openai-realtime.py

Update OpenAIRealtimeLLMService to work with LLMContext and LLMContextAggregatorPair (cont'd).

2025-10-29 15:43:51 -04:00

20c-persistent-context-anthropic.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

20d-persistent-context-gemini.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

20e-persistent-context-aws-nova-sonic.py

Get rid of LLMContext.get_messages_for_persistent_storage().

2025-10-20 09:49:05 -04:00

21-tavus-transport.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

21a-tavus-video-service.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

22-natural-conversation.py

deprecate pipecat.sync package

2025-12-03 18:44:41 -08:00

22b-natural-conversation-proposal.py

deprecate pipecat.sync package

2025-12-03 18:44:41 -08:00

22c-natural-conversation-mixed-llms.py

deprecate pipecat.sync package

2025-12-03 18:44:41 -08:00

22d-natural-conversation-gemini-audio.py

deprecate pipecat.sync package

2025-12-03 18:44:41 -08:00

23-bot-background-sound.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

24-stt-mute-filter.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

25-google-audio-in.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

26-gemini-live.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

26a-gemini-live-transcription.py

Deprecate expect_stripped_words option from LLMAssistantAggregatorParams, when used with the newer LLMAssistantAggregator, which now handles word spacing automatically.

2025-10-30 17:22:47 -04:00

26b-gemini-live-function-calling.py

GeminiLiveLLMService supports context-provided system instruction and tools

2025-11-03 10:30:46 -05:00

26c-gemini-live-video.py

Deprecate expect_stripped_words option from LLMAssistantAggregatorParams, when used with the newer LLMAssistantAggregator, which now handles word spacing automatically.

2025-10-30 17:22:47 -04:00

26d-gemini-live-text.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

26e-gemini-live-google-search.py

Deprecate expect_stripped_words option from LLMAssistantAggregatorParams, when used with the newer LLMAssistantAggregator, which now handles word spacing automatically.

2025-10-30 17:22:47 -04:00

26f-gemini-live-files-api.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

26g-gemini-live-groundingMetadata.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

26h-gemini-live-vertex-function-calling.py

Deprecate expect_stripped_words option from LLMAssistantAggregatorParams, when used with the newer LLMAssistantAggregator, which now handles word spacing automatically.

2025-10-30 17:22:47 -04:00

26i-gemini-live-graceful-end.py

Deprecate expect_stripped_words option from LLMAssistantAggregatorParams, when used with the newer LLMAssistantAggregator, which now handles word spacing automatically.

2025-10-30 17:22:47 -04:00

27-simli-layer.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

28-transcription-processor.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

29-turn-tracking-observer.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

30-observer.py

Fix foundational 30 example to output TTSTextFrames synced to audio

2025-11-18 13:29:06 -05:00

31-heartbeats.py

Updating foundation examples to use SmallWebRTCTransport and pipecat-ai-small-webrtc-prebuilt (#1534 )

2025-04-11 19:44:16 -04:00

32-gemini-grounding-metadata.py

Update examples, wherever possible, to use LLMContext and associated machinery instead of OpenAILLMContext and associated machinery.

2025-09-22 16:21:35 -04:00

33-gemini-rag.py

2025-09-23 12:41:35 -04:00

34-audio-recording.py

Update examples, wherever possible, to use LLMContext and associated machinery instead of OpenAILLMContext and associated machinery.

2025-09-22 16:21:35 -04:00

35-pattern-pair-voice-switching.py

Augmented PatternPairAggregator so that matched patterns can...

2025-11-21 17:16:10 -05:00

36-user-email-gathering.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

37-mem0.py

Add support for universal LLMContext to Mem0MemoryService

2025-09-23 10:35:54 -04:00

38-smart-turn-fal.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

38a-smart-turn-local-coreml.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

38b-smart-turn-local.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

39-mcp-stdio.py

add mcp filter example and changelog

2025-12-05 10:56:59 -06:00

39a-mcp-streamable-http.py

rename MCP foundational examples

2025-11-19 10:34:13 -06:00

39b-mcp-streamable-http-gemini-live.py

rename MCP foundational examples

2025-11-19 10:34:13 -06:00

39c-multiple-mcp.py

add mcp filter example and changelog

2025-12-05 10:56:59 -06:00

40-aws-nova-sonic.py

Update AWSNovaSonicLLMService to work with LLMContext and LLMContextAggregatorPair

2025-10-20 09:49:00 -04:00

41a-text-only-webrtc.py

update imports to avoid deprecated module

2025-09-23 15:58:09 +00:00

41b-text-and-audio-webrtc.py

update imports to avoid deprecated module

2025-09-23 15:58:09 +00:00

42-interruption-config.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

43-heygen-transport.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

43a-heygen-video-service.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

44-voicemail-detection.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

45-before-and-after-events.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

46-video-processing.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

47-sentry-metrics.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

48-service-switcher.py

Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.

2025-11-13 10:02:33 -05:00

49-thinking-functions.py

Add additional functionality related to "thinking", for Google and Anthropic LLMs.

2025-12-08 09:29:01 -05:00

49-thinking.py

Add additional functionality related to "thinking", for Google and Anthropic LLMs.

2025-12-08 09:29:01 -05:00

README.md

Update release evals for OpenAI Realtime, Gemini Live Vertex; shorten 26 foundational names

2025-10-10 12:26:23 -04:00

README.md

Pipecat Foundational Examples

This directory contains examples showing how to build voice and multimodal agents with Pipecat. Each example demonstrates specific features, progressing from basic to advanced concepts.

Setup

Follow the README steps to get your local environment configured.

Run from root directory: Make sure you are running the steps from the root directory.

Using local audio?: The LocalAudioTransport requires a system dependency for portaudio. Install the dependency to use the transport.
Copy the env.example file and add API keys for services you plan to use:
```
cp env.example .env
# Edit .env with your API keys
```
Navigate to the examples directory if you aren't already there:
```
cd examples/foundational
```
Run any example:
```
uv run python 01-say-one-thing.py
```
Open the web interface at http://localhost:7860/client/ and click "Connect"

Running examples with other transports

Most examples support running with other transports, like Twilio or Daily.

Daily

You need to create a Daily account at https://dashboard.daily.co/u/signup. Once signed up, you can create your own room from the dashboard and set the environment variables DAILY_SAMPLE_ROOM_URL and DAILY_API_KEY. Alternatively, you can let the example create a room for you (still needs DAILY_API_KEY environment variable). Then, start any example with -t daily:

uv run 07-interruptible.py -t daily

Twilio

It is also possible to run the example through a Twilio phone number. You will need to setup a few things:

Install and run ngrok.

ngrok http 7860

Configure your Twilio phone number. One way is to setup a TwiML app and set the request URL to the ngrok URL from step (1). Then, set your phone number to use the new TwiML app.

Then, run the example with:

uv run 07-interruptible.py -t twilio -x NGROK_HOST_NAME

Examples by Feature

Basics

01-say-one-thing.py: Most basic bot that says one phrase and exits (Transport, TTS, Event handlers)
02-llm-say-one-thing.py: Bot generates a response with an LLM (LLM initialization)
03-still-frame.py: Displays a static image (Video transport, Image service)
04-transport.py: Different transport options (WebRTC, Daily, Livekit)

Conversational AI

07-interruptible.py: Basic voice assistant bot (STT, TTS, LLM, Interruptible speech)
10-wake-phrase.py: Bot activated by wake phrase (WakeCheckFilter)
22-natural-conversation.py: Smart turn detection (Multiple LLMs, Turn management)
38-smart-turn-fal.py: ML-based turn detection (Fal service, Local models)

Common Utilities

17-detect-user-idle.py: Handle inactive users (UserIdleProcessor)
24-stt-mute-filter.py: Selectively mute user input (STTMuteFilter)
28-transcription-processor.py: Record conversation text (TranscriptProcessor)
30-observer.py: Access frame data (Custom observers)
31-heartbeats.py: Detect idle pipelines (Pipeline monitoring)
34-audio-recording.py: Record conversation audio (Composite and track-level recording)

Advanced LLM Features

14-function-calling.py: Bot with tool usage (Function schemas, Tool registration)
20a-persistent-context-openai.py: Persistent conversation context (Memory management)
32-gemini-grounding-metadata.py: Web search capabilities (Google search integration)
33-gemini-rag.py: Retrieval-augmented generation (Data sources, Grounding)
37-mem0.py: Long-term agent memory (Mem0 service integration)

Media Handling

05-sync-speech-and-images.py: Synchronized narration with images (Custom processors, SyncParallelPipeline)
06a-image-sync.py: Dynamic image updates while speaking (Synchronized A/V pipelines)
09-mirror.py: Mirror user's audio and video (Custom frame processors)
11-sound-effects.py: Add sounds when bot speaks (Sound playback, Event synchronization)
23-bot-background-sound.py: Play background audio (SoundfileMixer)

Vision & Multimodal

12a-describe-video-gemini-flash.py: Bot describes user's video (Video input, Multimodal LLMs)
26c-gemini-live-video.py: Gemini with video input (Streaming video, Function calls)

Voice & Language

13-transcription.py: Speech transcription demo (STT providers, Real-time transcription)
15-switch-voices.py: Dynamic voice/language changing (ParallelPipelines, FunctionFilters)
25-google-audio-in.py: Gemini for speech recognition (Alternative transcription)
35-pattern-pair-voice-switching.py: Dynamic TTS voice switching (XML parsing, PatternPairAggregator)
36-user-email-gathering.py: Spelling mode for TTS (Confirmation patterns, XML tags)

Integration Examples

18-gstreamer-filesrc.py: GStreamer video streaming (Video processing)
19-openai-realtime-beta.py: OpenAI Speech-to-Speech (Direct S2S, Function calls)
21-tavus-layer-tavus-transport.py: Tavus digital twin (Avatar integration)
27-simli-layer.py: Simli avatar integration (Video synchronization)

Performance & Optimization

16-gpu-container-local-bot.py: GPU-accelerated local bot (Performance measurement)

Advanced Usage

Customizing Network Settings

uv run python <example-name> --host 0.0.0.0 --port 8080

Troubleshooting

No audio/video: Check browser permissions for microphone and camera
Connection errors: Verify API keys in .env file
Port conflicts: Use --port to change the port

For more examples, visit our the `pipecat-examples repository.