pipecat

Author	SHA1	Message	Date
pratham-sarvam	6d582e41b7	Added Sarvam TTS Websocket Implementation (#2356 ) * Added Sarvam TTS Websocket Implementation * Addressed some of the comments on PR * added change voice logic * added changes from main * pushing text frames and added flush audio * updated docs string for better docs * Addressed comments and added some improvements * pushed optional args down * removed new line * made aiohttp session mandatory in http service * added push frame and removed unused function * removed pong message * added disconnecting logic --------- Co-authored-by: vinayak-sarvam <vinayak@sarvam.ai>	2025-08-26 18:10:26 -03:00
Aleix Conchillo Flaqué	8f01cd220a	pyproject: update daily-python to 0.19.7	2025-08-21 18:40:01 -07:00
Aleix Conchillo Flaqué	802af28888	update pytest-asyncio to 1.1.0	2025-08-21 18:09:56 -07:00
Aleix Conchillo Flaqué	f387776985	add custom asyncio.wait_for() This patch uses `wait_for2` package to implement `asyncio.wait_for()` for Python < 3.12. In Python 3.12, `asyncio.wait_for()` is implemented in terms of `asyncio.timeout()` which fixed a bunch of issues. However, this was never backported (because of the lack of `async.timeout()`) and there are still many remainig issues, specially in Python 3.10, in `async.wait_for()`. See https://github.com/python/cpython/pull/98518	2025-08-20 14:09:05 -07:00
Mark Backman	42bd1e9d40	Add Mistral to README and pyproject.toml	2025-08-14 11:15:52 -04:00
Mark Backman	c4506523ab	Refactor PlayHTHttpTTSService to use aiohttp	2025-08-11 19:58:25 -04:00
Aleix Conchillo Flaqué	bc1949b4bf	MoondreamService: update to revision 2025-01-09	2025-08-11 14:54:04 -07:00
Mark Backman	3a306dae90	fix: pin numba to >=0.61.2	2025-08-08 10:52:47 -04:00
Mark Backman	312fb23c89	fix: pin openai package upper bound to <=1.99.1	2025-08-07 18:00:25 -04:00
Mark Backman	820176084c	Add support for 3.13 by bumping min version for vllm to 0.9.0, adding support for torch and torchaudio up to the next major version	2025-08-06 13:36:01 -04:00
Mark Backman	41a22d3bf4	Add new python-compatiblity workflow to check for dependency compatibility across supported python versions	2025-08-06 13:36:01 -04:00
Mark Backman	ac6b59cae2	Merge pull request #2372 from pipecat-ai/mb/dotenv-dev Wider package support for python-dotenv dev dep	2025-08-06 06:06:01 -07:00
Mark Backman	12e168e740	Wider package support for python-dotenv dev dep	2025-08-06 09:04:01 -04:00
Mark Backman	42094fb206	Update docs auto-generation to use uv	2025-08-05 20:37:27 -04:00
Aleix Conchillo Flaqué	a1f3f51168	pyproject: update daily-python to 0.19.6	2025-08-02 20:02:22 -07:00
Mark Backman	b71057bf7c	Move dev to [dependency-groups], update uv.lock	2025-08-01 09:43:56 -04:00
Mark Backman	637d372fe4	Add dev to optional-dependencies	2025-07-31 23:39:23 -04:00
Sam Sykes	2d3f61aa07	Updated Speechmatics Plugin (#2225 ) Changes Split out module attributes to make engine settings clearer Removed internal audio buffer to use latest Speechmatics python SDK (0.4.0) Use diarization for improved VAD in multi-speaker situations Support custom dictionary / vocabulary with attributes Deprecated attributes superseded by re-organised attributes Diarization Enhancements Focus on specific speakers (using speaker labels) Ignore specific speakers (using speaker labels) Separate transcription formats for active and inactive speakers Support for known speakers	2025-07-31 17:51:38 -03:00
Mark Backman	aa85fffa57	New runner module (#2269 ) * Adds pipecat.runner.run - FastAPI-based development server with automatic bot discovery * Adds new RunnerArguments types for different transports * Adds new runner utils for creating transports and parsing data * Adds new Daily and LiveKit utils for setup	2025-07-30 22:02:28 -04:00
Aleix Conchillo Flaqué	c679227aa8	pyproject: update daily-python to 0.19.5	2025-07-30 13:19:48 -07:00
Filipi Fuchter	6e921cdf45	HeyGen implementation for Pipecat - HeyGenVideoService	2025-07-30 09:07:15 -03:00
Ashot	83b4747196	chore: address review comments	2025-07-28 17:52:17 +04:00
Mark Backman	41c8d22cf3	Merge pull request #2208 from padillamt/mtp/add-inworld-tts Inworld HTTP TTS Service	2025-07-25 17:13:37 -07:00
Mark Backman	5b7b4efdc9	Add broader version support for stable core dependencies, up to the next major version	2025-07-24 09:40:52 -04:00
Mark Backman	cfa26524ca	Add support for fastapi>=0.115.6,<0.117.0	2025-07-24 09:37:42 -04:00
Mark Backman	3d4ab7158d	pyproject.toml dependency updates to support better cross compatibility	2025-07-24 09:37:42 -04:00
Mark Backman	083b32887e	NeuphonicHttpTTSService: Refactor to use POST API	2025-07-24 01:05:37 -04:00
Mark Backman	7955080da2	Change extra_headers to additional_headers, update websocket version support	2025-07-23 11:53:43 -04:00
Mark Backman	b07b947352	Merge pull request #2244 from pipecat-ai/mb/upgrade-deepgram-4.7.0 Deepgram: Update optional dependency to 4.7.0	2025-07-23 07:04:02 -07:00
dbtreasure	f710c94b6e	Address code review feedback: remove explicit llvmlite pin - Remove explicit llvmlite>=0.44.0 pin as numba>=0.61.0 automatically pulls compatible version - Add changelog entry for Python 3.11+ dependency fix 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-22 18:45:32 -06:00
dbtreasure	6e3a0a2d5d	Add explicit numba/llvmlite pins for Python 3.11+ compatibility Fixes dependency resolution issues where transitive dependencies through resampy would install incompatible versions: - numba>=0.61.0 (supports Python 3.10-3.13) - llvmlite>=0.44.0 (supports Python 3.10-3.13) Previously, older versions (numba 0.53.1, llvmlite 0.36.0) only supported Python 3.6-3.9, causing deployment failures on Python 3.11+. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-22 18:45:02 -06:00
Mark Backman	26c937af87	Update match_endofsentence to use NLTK sentence tokenizer	2025-07-22 20:19:29 -04:00
Mark Backman	976f6168f0	Deepgram: Update optional dependency to 4.7.0	2025-07-22 20:15:30 -04:00
Mark Backman	c9717a23a5	Livekit: change tenacity supported versions	2025-07-21 17:30:18 -04:00
kompfner	a74a935ca0	Merge pull request #1910 from matejmarinko-soniox/main Add Soniox STT service integration	2025-07-17 09:29:07 -04:00
padillamt	f3984aec33	inworld: added (empty) requirements for Inworld to be explicit reg dependencies	2025-07-16 13:21:32 -07:00
Jaideep	2fe06f0a4e	Update pyproject.toml	2025-07-12 11:34:45 +05:30
Mark Backman	bf580d061d	Update numpy, transformers to support newer versions	2025-07-11 11:58:31 -07:00
Mark Backman	0bdbc83ed9	Package updates to run the release evals	2025-07-08 11:39:49 -07:00
Filipi Fuchter	74da197304	Refactored AWSBedrockLLMService and AWSPollyTTSService to work asynchronously using aioboto3 instead of the boto3 library.	2025-07-08 07:28:23 -03:00
Matej Marinko	0f727248d2	Merge branch 'main' of github.com:pipecat-ai/pipecat	2025-07-08 08:20:10 +02:00
Sam Sykes	7596d71460	Speechmatics STT + multi-speaker conversations (#2036 ) * initial config * skeleton * Added a README (to be added to). * Payloads coming from the ASR. * doc update * handle the partials and finals * enable diarization in the example * support sending messages to pipecat pipeline * requirements fix in README * updated example (with amusement) * updated example to match master * updated docs * support for diarization tags * logic fix for wrapper * Use an internal SpeechFrame for speaker_id (not user_id). * only include speaker tags on finalised transcript (as this may skew end of utterance detection) * updated docs * correction to docs and updated example * updated requirement * Fix for using default EU server. * Updates from PR comments. * Refactor based on comments in the original PR. Primary focus on documentation, naming conventions and how `user_id` is used. * Check for SMX installed when importing. * Variable name change * Comment correction. * Support for Esporanto and Uyghur * Impoved language support * function name change * Locale fix * intercept * interim changes * pass the pipeline task to the module for adding events to the top of the pipeline * logging for the pipeline * Reduce timeout for content aggregator. * staged update * testing with Azure * Updated context (Azure was dropping punctuation) and using better ElevenLabs model. * Updated to RT 0.3.0 and use OpenAI (not Azure). * Missing OpenAI import; parameter name change for output locale validation. * Revert to `0.2.0` of RT SDK. * fix for assignment of `output_locale_code`. * update Speechmatics library to 0.3.1 * new transcription example * updated asyncio task handling * Updated doc strings * enable OpenTelemetry logging * removed import from stt for __init__ * updated examples and default values * updated examples * prevent lock up when closing the STT connection	2025-07-03 17:25:13 -03:00
Mark Backman	5c2ea3b804	Upgrade google-genai version to 1.24.0	2025-07-02 11:18:37 -07:00
Aleix Conchillo Flaqué	de5f9c9217	pyproject: update daily-python to 0.19.4	2025-07-02 09:51:36 -07:00
Aleix Conchillo Flaqué	58aedc88a4	DailyTransport: allow receiving audio in a single track	2025-07-01 17:29:10 -07:00
kompfner	de74284a8e	Merge pull request #2051 from pipecat-ai/pk/direct-functions Implement "direct functions", which allow you to bypass specifying a …	2025-07-01 14:19:33 -04:00
Mark Backman	fd570b0377	Update the remaining docstrings, update pre-commit hook, add docstring formatting CI, update CONTRIBUTING with formatting guidance (#2089 )	2025-07-01 00:37:04 -04:00
Paul Kompfner	15b9a5faf6	Implement "direct functions", which allow you to bypass specifying a function configuration (as a `FunctionSchema` or in a provider-specific format) and use the Python function directly. Metadata is gathered automatically from the function signature and docstring.	2025-06-30 10:36:42 -04:00
Mark Backman	0ecfa827e6	Improve docstrings for services and processors (#2087 )	2025-06-28 13:39:45 -04:00
Yousif	46b52cb9bb	Merge branch 'main' into mcp-streaming-http	2025-06-26 12:30:43 -07:00

1 2 3 4 5 ...

288 Commits