pipecat

Author	SHA1	Message	Date
mattie ruth backman	8a90decbc0	codepilot review fixes	2025-11-14 13:51:45 -05:00
mattie ruth backman	ccca6e8d81	Make the PatternPair action an Enum	2025-11-14 13:51:45 -05:00
mattie ruth backman	e6dc1a510d	Introduce AggregatedLLMTextFrame to allow a separation of TTSTextFrame, indicating a spoken frame vs other aggregated, non-spoken frames	2025-11-14 13:51:45 -05:00
mattie ruth backman	69945c5e0d	Various fixes: 1. Fixed pattern_pair_aggregator to support various ways of handling pattern matches (remove, keep and just trigger a callback, or aggregate 2. Fixed ivr_navigator use of pattern_pair_aggregator 3. Test fixes -- Tests now pass	2025-11-14 13:51:45 -05:00
mattie ruth backman	5c8635570d	test fixes	2025-11-14 13:51:45 -05:00
mattie ruth backman	fe9aa3383e	Adding support for new bot-output RTVI Message: 1. TTSTextFrames now include metadata about whether the text was spoken or not along with a type string to describe what the text represents: ex. "sentence", "word", "custom aggregation" 2. Expanded how aggregators work so that the aggregate method returns aggregated text along with the type of aggregation used to create it 3. Deprecated the RTVI bot-transcription event in lieu of... 4. Introduced support for a new bot-output event. This event is meant to be the one stop shop for communicating what the bot actually "says". It is based off TTSTextFrames to communicate both sentence by sentence (or whatever aggregation is used) as well as word by word. In addition, it will include LLMTextFrames, aggregated by sentence when tts is turned off (i.e. skip_tts is true). Resolves pipecat-ai/pipecat-client-web#158	2025-11-14 13:51:45 -05:00
Angad Singh	d1116d149e	feat: Add ErrorFrame emission to TTS/STT services for pipeline error detection (#2881 ) * feat: Add ErrorFrame emission to TTS/STT services for pipeline error detection - Add ErrorFrame emission to all major TTS/STT services during initialization and runtime failures - Services updated: Cartesia, ElevenLabs, Deepgram, AssemblyAI, Rime, Azure - ErrorFrame objects emitted with fatal=False for graceful degradation - Enables on_pipeline_error event handler to detect service failures programmatically - Add comprehensive pytest test suite to verify ErrorFrame emission - Fixes issue where services failed gracefully but didn't emit ErrorFrame objects This allows developers to implement real-time error monitoring and alerting using the on_pipeline_error event handler introduced in v0.0.90. * Update STT and TTS services to use consistent error handling pattern - Improves error handling consistency across all services * Add changelog entry for STT/TTS error handling improvements * Linting issues Resolved * Azure STT ErrorFrames added with consistent patterns * Cartesia STT and Deepgram STT; additional fixes made * Removed Fatal Flags across services, removed duplication * Moving the changelog entry to the correct place. * Refactoring some classes to use yield instead of push_error directly. * Fixing ruff format. --------- Co-authored-by: Filipi Fuchter <filipi87@gmail.com>	2025-11-14 15:03:05 -03:00
Mark Backman	74a0e8c88d	Merge pull request #3050 from ai-coustics/aic-vad-analyzer feat(ai-coustics): add ai-coustics integrated VAD	2025-11-14 08:11:15 -05:00
Corvin Jaedicke	fbbad27d37	add changelog info	2025-11-14 13:30:06 +01:00
kompfner	e83ac82bf3	Merge pull request #3042 from pipecat-ai/pk/follow-up-inter-frame-spaces Follow-up to #3041	2025-11-13 11:03:06 -05:00
Mark Backman	d78d38ce44	Merge pull request #3039 from pipecat-ai/mb/update-google-gemini-tts Update GeminiTTSService for streaming, other Google TTS improvements	2025-11-13 10:33:46 -05:00
Mark Backman	edbf96b3c5	Update GeminiTTSService for streaming, other Google TTS improvements	2025-11-13 10:22:34 -05:00
Paul Kompfner	8851d18f92	Tweak the LLM prompt again to try to fix the issue of LLMs sometimes omitting punctuation in their output.	2025-11-13 10:02:33 -05:00
Mark Backman	d823a3edec	Merge pull request #3040 from pipecat-ai/mb/11labs-realtime-stt Add ElevenLabsRealtimeSTTService	2025-11-13 09:53:34 -05:00
Mark Backman	0e37658f8d	Add ElevenLabsRealtimeSTTService	2025-11-13 09:49:05 -05:00
Corvin Jaedicke	2fab3e2286	fix formatting	2025-11-13 14:39:26 +01:00
Corvin Jaedicke	a7b2052b38	add ai-coustics VAD	2025-11-13 14:20:35 +01:00
Mark Backman	6d0e99c3b8	Merge pull request #3044 from rimelabs/rime-hin-lanaguge-support Add support for Hindi language in RIme TTS service	2025-11-12 21:13:01 -05:00
gokuljs	fe25465987	changelog update	2025-11-13 07:16:36 +05:30
gokuljs	498e9ca4f6	Add support for Hindi language in RIme TTS service	2025-11-13 04:33:22 +05:30
Paul Kompfner	1802f949ef	Fix an issue with some examples where punctuation was missing from the LLM output, by tweaking the LLM prompt.	2025-11-12 17:12:03 -05:00
Paul Kompfner	1ad6405ebb	Override `includes_inter_frame_spaces` in: - `GoogleHttpTTSService` - `OpenAITTSService` The reason I skipped this work in an earlier PR was because these services seemed to be emitting long, punctuation-free text frames. It turns out that the issue was with the LLM prompt, though, resulting in the LLM nondeterministically excluding all punctuation. An upcoming commit will address that prompt issue.	2025-11-12 17:07:43 -05:00
kompfner	4c25555396	Merge pull request #3041 from pipecat-ai/pk/apply-includes-inter-frame-spaces-wherever-necessary Apply `includes_inter_frame_spaces = True` in all LLM and TTS service…	2025-11-12 16:49:14 -05:00
Paul Kompfner	5222ff99de	Apply `includes_inter_frame_spaces = True` in all LLM and TTS services that need it. Note that for `LLMTextFrame`s, the right behavior is pretty much always `includes_inter_frame_spaces = True`. I decided not to go ahead and make that the default for `LLMTextFrame`s, though, simply to not introduce a subtle behavior change for creative/unexpected use-cases that were relying on text in hand-crafted `LLMTextFrame`s being handled a certain way. Ditto for `TTSTextFrame`s. Also, fix an issue in `NeuphonicTTSService` where it wasn't pushing `TTSTextFrame`s. Also, fix the broken `SarvamHttpTTSService` example. Also, add a couple of missing examples.	2025-11-12 15:10:11 -05:00
Mark Backman	203a627707	Merge pull request #3028 from sam-s10s/fix/smx-tts-retry SpeechmaticsTTS - Support for retry when 503 error to TTS API.	2025-11-12 09:26:07 -05:00
James Hush	2006a64def	Fix Langfuse tracing for GoogleLLMService with universal LLMContext (#3025 ) * Fix Langfuse tracing for GoogleLLMService with universal LLMContext - Fixed issue where input appeared as null in Langfuse dashboard for GoogleLLMService - Added fallback to use adapter's get_messages_for_logging() for universal LLMContext - Ensures proper message format conversion for Google/Gemini services - Handles system message conversion to system_instruction format - Also fixes serialization of empty message lists ([] now serializes correctly) This fix ensures Langfuse tracing works correctly for Google services using both OpenAILLMContext/GoogleLLMContext and the universal LLMContext. * Add unit tests for Langfuse tracing with GoogleLLMService - Test that tracing correctly captures messages with universal LLMContext - Test that empty message lists are properly serialized - Test that adapter's get_messages_for_logging is used instead of context method - All tests verify that input is correctly added to Langfuse spans * Fix test mocking to patch opentelemetry.trace.get_tracer correctly The tests were failing in CI because they were trying to patch 'pipecat.utils.tracing.service_decorators.trace' which doesn't exist as an attribute. The trace module is imported from opentelemetry, so we need to patch 'opentelemetry.trace.get_tracer' instead. * Skip tracing tests when opentelemetry is not installed The tracing dependencies (opentelemetry) are optional in Pipecat and not installed in the CI environment. Added a skipif marker to skip these tests when opentelemetry is not available, preventing CI failures while still allowing the tests to run when tracing dependencies are installed locally. * Install tracing dependencies in GitHub Actions CI Instead of skipping the tracing tests, install the 'tracing' extra (opentelemetry) in the CI environment so the tests can run properly. Removed the skipif condition from the tests since opentelemetry will now be available in CI. * Use the context type to determine which messages to use, fix tool_count and tools (#3032) --------- Co-authored-by: Mark Backman <mark@daily.co>	2025-11-12 14:58:00 +01:00
Corvin Jaedicke	3c76917c1e	use async process function	2025-11-12 13:48:22 +01:00
Filipi da Silva Fuchter	eb36a1bc91	Merge pull request #3033 from pipecat-ai/filipi/deepgram_flux_urlencode_changelog Mentioning DeepgramFluxSTTService URL encode fix in changelog.	2025-11-11 17:29:07 -03:00
Filipi Fuchter	fff8aac18c	Mention DeepgramFluxSTTService URL encode fix in changelog.	2025-11-11 17:25:40 -03:00
Filipi da Silva Fuchter	ec4bd8db10	Merge pull request #3014 from julienvantyghem/fix/deepgramflux-keyterm-encoding fix(deepgram-flux): urlencode keyterm and tag parameters	2025-11-11 17:24:17 -03:00
Filipi da Silva Fuchter	4cc298d616	Merge pull request #3029 from pipecat-ai/filipi/heygen_keep_alive Preventing HeyGenVideoService from disconnecting.	2025-11-11 15:25:43 -03:00
Sam Sykes	8d21b54ef3	Revert to `ErrorFrame`.	2025-11-11 18:24:08 +00:00
Sam Sykes	217d7e9953	Fix for max attempts.	2025-11-11 18:05:06 +00:00
Sam Sykes	41cf9adef4	Updated for max retries.	2025-11-11 18:00:27 +00:00
Sam Sykes	501744d7da	Update CHANGELOG.	2025-11-11 17:53:31 +00:00
Sam Sykes	60bc77c795	Update debugging messages.	2025-11-11 17:50:06 +00:00
Sam Sykes	0febfc62ec	Updated to use backoff utility function.	2025-11-11 17:45:22 +00:00
Filipi Fuchter	b76b25a6e1	Mentioning the HeyGen fix in the changelog.	2025-11-11 11:58:31 -03:00
Filipi Fuchter	62caadfc7c	Preventing HeyGenVideoService from disconnecting.	2025-11-11 11:37:46 -03:00
Sam Sykes	41ac43cf71	updated docs	2025-11-11 13:56:45 +00:00
Sam Sykes	adf5198423	Support for retry when 503 error to TTS API.	2025-11-11 13:49:14 +00:00
Mark Backman	54e8d29615	Merge pull request #3022 from pipecat-ai/mb/changelog-0.0.94 Prep for 0.0.94 hotfix v0.0.94	2025-11-10 16:52:38 -05:00
Mark Backman	ee494918a9	Prep for 0.0.94 hotfix	2025-11-10 16:50:58 -05:00
Mark Backman	aa8a50bc61	Merge pull request #3015 from pipecat-ai/mb/deprecate-krisp Deprecate KrispFilter	2025-11-10 16:38:06 -05:00
Mark Backman	20857ac19a	Merge pull request #3010 from pipecat-ai/mb/gemini-live-ar-xa Add ar-XA language code for Gemini Live	2025-11-10 16:36:33 -05:00
Mark Backman	421a1b5389	Merge pull request #3021 from pipecat-ai/mb/add-sarvam-stt-readme Add Sarvam STT to README list	2025-11-10 16:36:03 -05:00
Mark Backman	8dd45af5b7	Deprecate KrispFilter	2025-11-10 16:35:11 -05:00
kompfner	66c903276a	Merge pull request #3020 from pipecat-ai/pk/make-explicit-adding-spaces-when-concatenating-tts-text Make the mechanism of adding spaces when concatenating TTS (or speech…	2025-11-10 14:34:10 -05:00
Mark Backman	588dcf2ab9	Add Sarvam STT to README list	2025-11-10 14:29:54 -05:00
Paul Kompfner	913194844e	Make the mechanism of adding spaces when concatenating TTS (or speech-to-speech LLM) output text explicit and deterministic, rather than heuristic-based. This fixes a bug where spaces were sometimes missing from assistant messages in context.	2025-11-10 14:22:32 -05:00

1 2 3 4 5 ...

6322 Commits