Files

Vanessa Pyne 18c0374126 Merge pull request #1785 from pipecat-ai/vp-small-filenmae-change

39-aws-nova-sonic.py -> 40-aws-nova-sonic.py

2025-05-09 12:19:09 -05:00

assets

added an example using using Gemini's large context window for RAG

2025-02-06 12:49:29 -08:00

01-say-one-thing-piper.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

01-say-one-thing-rime.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

01-say-one-thing.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

01a-local-audio.py

examples: use new services packages

2025-03-30 16:21:00 -07:00

01b-livekit-audio.py

examples: use new services packages

2025-03-30 16:21:00 -07:00

01c-fastpitch.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

02-llm-say-one-thing.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

03-still-frame.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

03a-local-still-frame.py

examples: update camera_* with video_*

2025-04-24 17:14:18 -07:00

03b-still-frame-imagen.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

04-transports-small-webrtc.py

Update examples with new transport param names

2025-04-25 15:18:56 -05:00

04a-transports-daily.py

Add transports examples to foundational examples

2025-04-25 08:20:04 -04:00

04b-transports-livekit.py

Add transports examples to foundational examples

2025-04-25 08:20:04 -04:00

05-sync-speech-and-image.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

05a-local-sync-speech-and-image.py

examples: update camera_* with video_*

2025-04-24 17:14:18 -07:00

06-listen-and-respond.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

06a-image-sync.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07-interruptible.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07a-interruptible-vad.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07b-interruptible-langchain.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07c-interruptible-deepgram-vad.py

Update Deepgram TTS default voice to Aura 2 voice

2025-05-06 11:29:32 -04:00

07c-interruptible-deepgram.py

Update Deepgram TTS default voice to Aura 2 voice

2025-05-06 11:29:32 -04:00

07d-interruptible-elevenlabs-http.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07d-interruptible-elevenlabs.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07e-interruptible-playht-http.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07e-interruptible-playht.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07f-interruptible-azure.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07g-interruptible-openai.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07h-interruptible-openpipe.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07i-interruptible-xtts.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07j-interruptible-gladia.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07k-interruptible-lmnt.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07l-interruptible-groq.py

small groq updates

2025-05-02 15:33:10 -05:00

07m-interruptible-aws.py

AWSBedrockLLMService: fix function calling

2025-05-07 09:26:26 -07:00

07n-interruptible-google.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07o-interruptible-assemblyai.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07p-interruptible-krisp.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07q-interruptible-rime-http.py

support for rime arcana model

2025-05-05 10:50:46 -04:00

07q-interruptible-rime.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07r-interruptible-riva-nim.py

Riva: remove deprecated lines in example

2025-05-02 15:33:10 -05:00

07s-interruptible-google-audio-in.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07t-interruptible-fish.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07u-interruptible-ultravox.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07v-interruptible-neuphonic-http.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07v-interruptible-neuphonic.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07w-interruptible-fal.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

07x-interruptible-local.py

examples: remove vad_enabled=True

2025-04-24 17:14:18 -07:00

08-bots-arguing.py

Updating foundation examples to use SmallWebRTCTransport and pipecat-ai-small-webrtc-prebuilt (#1534 )

2025-04-11 19:44:16 -04:00

09-mirror.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

09a-local-mirror.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

10-wake-phrase.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

11-sound-effects.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

12-describe-video.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

12a-describe-video-gemini-flash.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

12b-describe-video-gpt-4o.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

12c-describe-video-anthropic.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

13-whisper-transcription.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

13a-whisper-local.py

examples: remove vad_enabled=True

2025-04-24 17:14:18 -07:00

13b-deepgram-transcription.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

13c-gladia-transcription.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

13c-gladia-translation.py

Demo fixes

2025-05-02 20:58:10 -04:00

13d-assemblyai-transcription.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

13e-whisper-mlx.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

14-function-calling.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14a-function-calling-anthropic.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14b-function-calling-anthropic-video.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14c-function-calling-together.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14d-function-calling-video.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14e-function-calling-gemini.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14f-function-calling-groq.py

small groq updates

2025-05-02 15:33:10 -05:00

14g-function-calling-grok.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14h-function-calling-azure.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14i-function-calling-fireworks.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14j-function-calling-nim.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14k-function-calling-cerebras.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14l-function-calling-deepseek.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14m-function-calling-openrouter.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14n-function-calling-perplexity.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

14o-function-calling-gemini-openai-format.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14p-function-calling-gemini-vertex-ai.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14q-function-calling-qwen.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

14r-function-calling-aws.py

AWSBedrockLLMService: fix function calling

2025-05-07 09:26:26 -07:00

15-switch-voices.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

15a-switch-languages.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

16-gpu-container-local-bot.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

17-detect-user-idle.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

18-gstreamer-filesrc.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

18a-gstreamer-videotestsrc.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

19-openai-realtime-beta.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

19a-azure-realtime-beta.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

20a-persistent-context-openai.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

20b-persistent-context-openai-realtime.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

20c-persistent-context-anthropic.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

20d-persistent-context-gemini.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

20e-persistent-context-aws-nova-sonic.py

[WIP] AWS Nova Sonic service - update persistent-context example to better avoid saving "transitional", as opposed to meaningful, context messages

2025-05-07 13:52:51 -04:00

21-tavus-layer.py

examples: remove vad_enabled=True

2025-04-24 17:14:18 -07:00

22-natural-conversation.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

22b-natural-conversation-proposal.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

22c-natural-conversation-mixed-llms.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

22d-natural-conversation-gemini-audio.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

23-bot-background-sound-daily.py

examples: remove vad_enabled=True

2025-04-24 17:14:18 -07:00

23-bot-background-sound-p2p.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

24-stt-mute-filter.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

25-google-audio-in.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

26-gemini-multimodal-live.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

26a-gemini-multimodal-live-transcription.py

Push GeminiMultimodalLiveLLMService TranscriptionFrame Upstream, remove direct context addition

2025-05-01 15:41:04 -04:00

26b-gemini-multimodal-live-function-calling.py

Transcribe user audio in 26b

2025-04-30 16:28:16 -04:00

26c-gemini-multimodal-live-video.py

examples: remove vad_enabled=True

2025-04-24 17:14:18 -07:00

26d-gemini-multimodal-live-text.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

26e-gemini-multimodal-google-search.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

27-simli-layer.py

Fix: SimliVideoService was continuously emitting audio, preventing BotStoppedSpeakingFrame from being sent

2025-05-02 16:32:42 -04:00

28-transcription-processor.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

30-observer.py

Add DebugLogObserver

2025-05-07 17:21:08 -04:00

31-heartbeats.py

Updating foundation examples to use SmallWebRTCTransport and pipecat-ai-small-webrtc-prebuilt (#1534 )

2025-04-11 19:44:16 -04:00

32-gemini-grounding-metadata.py

GoogleLLMService: deprecate google-generativeai

2025-05-09 09:14:43 -07:00

33-gemini-rag.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

34-audio-recording.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

35-pattern-pair-voice-switching.py

Demo fixes

2025-05-02 20:58:10 -04:00

36-user-email-gathering.py

examples: update with single FunctionCallParams parameter

2025-04-25 13:34:05 -07:00

37-mem0.py

formatting

2025-04-28 22:57:18 +05:30

38-smart-turn-fal.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

38a-smart-turn-local-coreml.py

examples: allow setting custom program arguments

2025-04-24 17:14:18 -07:00

38b-smart-turn-local.py

New example using the local smart turn

2025-05-02 14:21:42 -03:00

39-mcp-stdio.py

Update examples with new transport param names

2025-04-25 15:18:56 -05:00

39a-mcp-run-sse.py

Update examples with new transport param names

2025-04-25 15:18:56 -05:00

39b-multiple-mcp.py

Update examples with new transport param names

2025-04-25 15:18:56 -05:00

40-aws-nova-sonic.py

39-aws-nova-sonic.py -> 40-aws-nova-sonic.py

2025-05-09 08:30:59 -05:00

daily_runner.py

Updating foundation examples to use SmallWebRTCTransport and pipecat-ai-small-webrtc-prebuilt (#1534 )

2025-04-11 19:44:16 -04:00

README.md

Update foundational README with ToC

2025-04-24 18:04:36 -04:00

requirements.txt

Add deepgram and cartesia to foundational example requirements to make quickstart smoother

2025-04-23 08:47:47 -04:00

run.py

Fixing the examples to use the new IceServer structure.

2025-04-29 10:33:19 -03:00

README.md

Pipecat Foundational Examples

This directory contains examples showing how to build voice and multimodal agents with Pipecat. Each example demonstrates specific features, progressing from basic to advanced concepts.

Learning Paths

Depending on what you're trying to build, these learning paths will guide you through relevant examples:

New to Pipecat: Start with examples 01, 02, 07
Building conversational bots: 07, 10, 38
Common add-on capabilities: 17, 24, 28, 34
Adding visual capabilities: 03, 12, 26
Advanced agent capabilities: 14, 20, 37

Quick Start

Set up a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Create a .env file with your API keys.
Run any example:
```
python run.py 01-say-one-thing.py
```
Open the web interface at http://localhost:7860 and click "Connect"

Examples by Feature

Basics

01-say-one-thing.py: Most basic bot that says one phrase and exits (Transport, TTS, Event handlers)
02-llm-say-one-thing.py: Bot generates a response with an LLM (LLM initialization)
03-still-frame.py: Displays a static image (Video transport, Image service)
04-transport.py: Different transport options (WebRTC, Daily, Livekit)

Conversational AI

07-interruptible.py: Basic voice assistant bot (STT, TTS, LLM, Interruptible speech)
10-wake-phrase.py: Bot activated by wake phrase (WakeCheckFilter)
22-natural-conversation.py: Smart turn detection (Multiple LLMs, Turn management)
38-smart-turn-fal.py: ML-based turn detection (Fal service, Local models)

Common Utilities

17-detect-user-idle.py: Handle inactive users (UserIdleProcessor)
24-stt-mute-filter.py: Selectively mute user input (STTMuteFilter)
28-transcription-processor.py: Record conversation text (TranscriptProcessor)
30-observer.py: Access frame data (Custom observers)
31-heartbeats.py: Detect idle pipelines (Pipeline monitoring)
34-audio-recording.py: Record conversation audio (Composite and track-level recording)

Advanced LLM Features

14-function-calling.py: Bot with tool usage (Function schemas, Tool registration)
20a-persistent-context-openai.py: Persistent conversation context (Memory management)
32-gemini-grounding-metadata.py: Web search capabilities (Google search integration)
33-gemini-rag.py: Retrieval-augmented generation (Data sources, Grounding)
37-mem0.py: Long-term agent memory (Mem0 service integration)

Media Handling

05-sync-speech-and-images.py: Synchronized narration with images (Custom processors, SyncParallelPipeline)
06a-image-sync.py: Dynamic image updates while speaking (Synchronized A/V pipelines)
09-mirror.py: Mirror user's audio and video (Custom frame processors)
11-sound-effects.py: Add sounds when bot speaks (Sound playback, Event synchronization)
23-bot-background-sound.py: Play background audio (SoundfileMixer)

Vision & Multimodal

12a-describe-video-gemini-flash.py: Bot describes user's video (Video input, Multimodal LLMs)
26c-gemini-multimodal-live-video.py: Gemini with video input (Streaming video, Function calls)

Voice & Language

13-transcription.py: Speech transcription demo (STT providers, Real-time transcription)
15-switch-voices.py: Dynamic voice/language changing (ParallelPipelines, FunctionFilters)
25-google-audio-in.py: Gemini for speech recognition (Alternative transcription)
35-pattern-pair-voice-switching.py: Dynamic TTS voice switching (XML parsing, PatternPairAggregator)
36-user-email-gathering.py: Spelling mode for TTS (Confirmation patterns, XML tags)

Integration Examples

18-gstreamer-filesrc.py: GStreamer video streaming (Video processing)
19-openai-realtime-beta.py: OpenAI Speech-to-Speech (Direct S2S, Function calls)
21-tavus-layer.py: Tavus digital twin (Avatar integration)
27-simli-layer.py: Simli avatar integration (Video synchronization)

Performance & Optimization

16-gpu-container-local-bot.py: GPU-accelerated local bot (Performance measurement)

Utilities

Advanced Usage

Customizing Network Settings

python run.py <example-name> --host 0.0.0.0 --port 8080

Troubleshooting

No audio/video: Check browser permissions for microphone and camera
Connection errors: Verify API keys in .env file
Missing dependencies: Run pip install -r requirements.txt
Port conflicts: Use --port to change the port

For more examples, visit our GitHub repository.