pipecat

Author	SHA1	Message	Date
joycech333	77cc314a08	feat: add Inception LLM service with Mercury-2 support Adds InceptionLLMService, an OpenAI-compatible service for Inception's Mercury-2 diffusion-based reasoning model. Supports reasoning_effort (instant/low/medium/high) and realtime mode for reduced TTFT.	2026-05-21 11:23:23 -04:00
asilvestre	a1c40df471	add documentation entry	2026-05-18 14:40:56 +02:00
Mark Backman	3034f8bb3b	Update README to remove NVIDIA references to RIVA	2026-04-28 12:42:58 -04:00
Mark Backman	58a038ddb2	Add Soniox real-time TTS service Introduce SonioxTTSService, a WebSocket TTS provider that streams text and receives audio over a persistent connection, multiplexing up to 5 concurrent streams per socket via Soniox's `stream_id`. Also updates the README service table and the Soniox voice example to use the new TTS end-to-end.	2026-04-27 16:04:02 -04:00
Paul Kompfner	005fe33b25	Update docs URLs in README to reflect new docs site structure and avoid redirects	2026-04-27 10:22:49 -04:00
Paul Kompfner	24154474c9	Add OpenAI Responses to the README's list of LLM services	2026-04-27 10:19:13 -04:00
Mark Backman	c091232f2f	Add xAI streaming STT service New `XAISTTService` wraps xAI's real-time speech-to-text WebSocket (`wss://api.x.ai/v1/stt`). It extends `WebsocketSTTService`, authenticates with the `XAI_API_KEY` as a Bearer token on the WS handshake, and streams raw audio (PCM/mu-law/A-law) with configurable interim results, endpointing, language, multichannel, and diarization settings. - `src/pipecat/services/xai/stt.py`: new service, settings dataclass, and `language_to_xai_stt_language` helper. - `src/pipecat/services/stt_latency.py`: `XAI_TTFS_P99` default. - `pyproject.toml` / `uv.lock`: `xai` extra now pulls in `websockets-base`. - `README.md`: link to xAI STT in the services table. - `examples/voice/voice-xai.py`: swap DeepgramSTTService for XAISTTService so the xAI voice example is fully xAI. - `examples/transcription/transcription-xai.py`: new transcription-only example using the new service.	2026-04-21 13:45:34 -04:00
Aleix Conchillo Flaqué	8ec85f981d	Add Pipecat Subagents to the ecosystem section in README	2026-04-16 09:57:23 -07:00
Mark Backman	9ffcccdd84	Merge pull request #4253 from pipecat-ai/mb/mistral-stt Add Mistral Voxtral Realtime STT service	2026-04-15 09:00:27 -04:00
Mark Backman	da9a55a430	Fix translation example in README	2026-04-10 09:13:42 -04:00
Mark Backman	874e2878be	Update README with Mistral services	2026-04-07 15:36:22 -04:00
Aleix Conchillo Flaqué	f4743a6c91	require python >= 3.11	2026-04-01 19:02:34 -04:00
Mark Backman	d3021b4590	Rename example files to prepend parent folder name, preventing package shadowing Example files like openai.py shadow installed packages when Python adds the script directory to sys.path. Prepend the parent folder name to each example file (e.g. openai.py -> function-calling-openai.py). Also split thinking-and-mcp/ into separate mcp/ and thinking/ directories.	2026-03-31 22:06:01 -04:00
Mark Backman	e719cbbe6d	Reorganize examples into topic-based subfolders Move 304 examples from a flat numbered directory into 14 descriptive subfolders: getting-started, services (speech + function-calling), transcription, vision, realtime, persistent-context, context-summarization, update-settings (stt/tts/llm), turn-management, thinking-and-mcp, transports, video-avatar, video-processing, and features. Strip numbered prefixes from filenames (e.g. 07c-interruptible-deepgram.py becomes services/speech/deepgram.py) since the folder context makes them redundant. Keep numbered prefixes only in getting-started/ where ordering matters. Update eval script paths and README to match the new structure.	2026-03-31 13:12:24 -04:00
Mark Backman	f2ce7ececc	Move foundational examples to examples/	2026-03-31 13:12:24 -04:00
Mark Backman	32022a952e	Merge pull request #4205 from pipecat-ai/mb/remove-quickstart Remove quickstart example from repo	2026-03-30 18:58:49 -04:00
Mark Backman	b78ae40d3c	Remove quickstart example from repo	2026-03-30 18:20:41 -04:00
Aleix Conchillo Flaqué	f0d04dde1c	audio(filters): remove KrispFilter	2026-03-30 14:01:06 -07:00
Mark Backman	e1a3ddbb57	Add missing services to README available services table Adds Kokoro (TTS), LiveKit and WhatsApp (Transport), Genesys (Serializers), and Krisp Viva and RNNoise (Audio Processing).	2026-03-30 10:06:14 -04:00
Arindam200	39919f7889	Add NebiusLLMService for Nebius Token Factory Adds an OpenAI-compatible LLM service for Nebius Token Factory, supporting open-source models (Meta Llama, Qwen, DeepSeek) via their OpenAI-compatible REST API at https://api.tokenfactory.nebius.com/v1/.	2026-03-29 14:35:46 +05:30
Mark Backman	ca2bfd6f12	Remove SambaNovaSTTService SambaNova no longer offers speech-to-text audio models.	2026-03-26 12:22:06 -04:00
Mark Backman	adc003d6c7	Code review cleanup	2026-03-25 10:53:07 -04:00
Nicholas Zhao	02b97035f8	Add xAI TTS service	2026-03-25 10:45:15 -04:00
Mark Backman	51d28b4a9f	Code review fixes	2026-03-24 11:21:04 -04:00
Mark Backman	aa0b49d69f	Code review fixes	2026-03-24 09:22:08 -04:00
Mark Backman	1c8a8f51d4	Code review fixes	2026-03-24 08:46:03 -04:00
dhruvladia-sarvam	349b8645f3	Merge branch 'main' into feat/sarvam-llm-integration	2026-03-24 16:34:12 +05:30
dhruvladia-sarvam	696196e30c	alignment with pr 4081	2026-03-24 16:29:58 +05:30
Mark Backman	a11c48d5b0	Add community integrations to README	2026-03-20 10:09:58 -04:00
Mark Backman	eeb8ed8588	Remove Hathora service integration Hathora is shutting down on March 5, 2026. Remove the STT/TTS services, examples, and related references.	2026-03-04 22:10:06 -05:00
Mark Backman	aae9136df9	Review feedback	2026-03-02 17:52:39 -05:00
filipi87	49c73bb0a3	Merge branch 'main' into filipi/lemonslice # Conflicts: # README.md # uv.lock	2026-03-02 19:24:52 -03:00
Mark Backman	44993fe9e3	Remove PlayHT TTS services	2026-02-25 14:12:39 -05:00
Aleix Conchillo Flaqué	68e19a730b	Restore dev skills and add marketplace for maintainer workflows Brings back the 6 development workflow skills (changelog, cleanup, code-review, docstring, pr-description, pr-submit) that were moved to pipecat-ai/skills, and adds a .claude-plugin/marketplace.json so other pipecat-ai repos can install them. Updates README contributing section with installation instructions.	2026-02-24 23:47:06 -08:00
Aleix Conchillo Flaqué	ee46cbce4c	Move skills to pipecat-ai/skills repo, add README instructions Remove bundled Claude Code skills (changelog, cleanup, code-review, docstring, pr-description, pr-submit) that now live in https://github.com/pipecat-ai/skills. Add a section to the README with installation instructions. The update-docs skill remains as it is specific to this repository.	2026-02-24 11:41:19 -08:00
Joshua Primas	35aba4128c	Adding the LemonSlice transport integration	2026-02-20 15:24:48 -08:00
Mark Backman	5cda72d138	Add Resemble TTS to README	2026-02-02 09:05:03 -05:00
Waldek Maleska	b13b65d6e2	Update README.md - fix Google Imagen URL	2026-01-22 15:17:41 +00:00
Mike Seese	e5632a9339	transition Hathora service to use the unified API and apply PR feedback add Hathora to root files Hathora run linter added hathora changelog	2026-01-15 15:27:53 -08:00
Neil Ruaro	9942fcfeb2	updated per PR reviews	2026-01-16 01:20:17 +08:00
Mark Backman	b58471fdb1	Add Exotel and Vonage to Serializers in README services list	2026-01-12 12:24:56 -05:00
Mark Backman	d646ca594b	Update Ultravox README link	2025-12-29 11:43:28 -05:00
Mark Backman	5ad8e5436d	Add Grok Voice Agent to README services list	2025-12-20 08:11:41 -05:00
Mike Depinet	4b81be7acf	Add Ultravox service (#1 ) Adds support for using Ultravox Realtime as a speech-to-speech service. Also removes the deprecated Ultravox speech-to-text vllm model integration to avoid confusion.	2025-12-12 10:16:15 -08:00
Aleix Conchillo Flaqué	f0af0a6b96	README: remove manta badge	2025-12-05 16:16:19 -08:00
laurent	af52833ca0	Update the readme and env.example.	2025-12-05 10:44:30 +01:00
Mark Backman	588dcf2ab9	Add Sarvam STT to README list	2025-11-10 14:29:54 -05:00
Mark Backman	f820c20fa2	Add SpeechmaticsTTSService and SonioxSTTService changes to changelog	2025-10-31 07:41:17 -04:00
Mark Backman	9f66b0ba41	Add Pipecat CLI to README's ecosystem section	2025-10-21 13:17:37 -04:00
Mark Backman	e11ede475b	Update moondream chatbot README link	2025-10-15 13:22:56 -04:00

1 2 3 4 5

215 Commits