pipecat

Author	SHA1	Message	Date
Aleix Conchillo Flaqué	e553bb010f	tests: migrate LLM tests to Settings-based constructor API Replace the old `model=` / `params=InputParams(...)` style with the new `settings=<Service>.Settings(...)` form across LLM service tests.	2026-04-10 17:37:49 -07:00
Mark Backman	8adb38f87c	Remove unused imports across codebase	2026-04-02 22:21:16 -04:00
Paul Kompfner	394599d031	Remove deprecated `OpenAILLMContext` as well as everything (code paths or whole types) dependent on it (all of which were also deprecated)	2026-03-31 18:15:25 -04:00
Paul Kompfner	f2a8a9e753	Add WebSocket-based OpenAI Responses LLM service with previous_response_id optimization Introduce a WebSocket variant of the OpenAI Responses API service that maintains a persistent connection to wss://api.openai.com/v1/responses for lower-latency inference. The WebSocket variant automatically uses previous_response_id to send only incremental context when possible, falling back to full context on reconnection or cache miss. The WebSocket variant becomes the new default OpenAIResponsesLLMService, and the HTTP variant is renamed to OpenAIResponsesHttpLLMService. Both share a private base class with common settings, parameter building, and run_inference (always HTTP) logic.	2026-03-30 09:58:56 -04:00
Paul Kompfner	4c121332cf	Convert developer messages to user for Cerebras (and lay groundwork for other incompatible services) OpenAI-compatible services that don't support the "developer" message role can now set supports_developer_role = False on the service class. BaseOpenAILLMService passes this as convert_developer_to_user to the adapter, which converts developer messages to user messages before sending them to the API. Applied to Cerebras and Perplexity. Also removes the now-redundant developer→user conversion step from PerplexityLLMAdapter (handled by the parent adapter via the flag).	2026-03-24 16:05:15 -04:00
Paul Kompfner	d4dea30407	Centralize system message handling in adapters; add developer message support Two goals: 1. Centralize system_instruction vs context system message resolution into the LLM adapters. This eliminates duplication between in-pipeline and out-of-band (run_inference) code paths across ~16 locations in service llm.py files. 2. Add support for "developer" role messages in conversation context, which is facilitated by the above centralization. Shared helpers on BaseLLMAdapter: - _extract_initial_system_or_developer: extracts/converts messages[0] based on role and whether system_instruction is provided - _resolve_system_instruction: warns on conflicts between system_instruction and context system messages, returns the effective instruction Developer message handling (new): - Non-OpenAI adapters: an initial "developer" message is promoted to the system instruction when no system_instruction is provided; otherwise it is converted to "user". Subsequent "developer" messages are always converted to "user". No conflict warning is emitted for developer messages (unlike "system" messages). - OpenAI adapter: "developer" messages pass through in conversation history without triggering conflict warnings. - OpenAI Responses adapter: "developer" messages are kept as "developer" role (same as "system", which is also converted to "developer" for the Responses API). Other behavior changes: - Gemini: "initial" system message detection now checks messages[0] only (previously searched anywhere in the list) - Bedrock: a lone system message is now converted to "user" instead of being extracted to an empty message list (matches existing Anthropic behavior)	2026-03-24 16:02:42 -04:00
Paul Kompfner	348df9d4ce	fix: remove redundant instructions override in run_inference The override would re-add `instructions` after the adapter had intentionally converted it to a developer message for empty contexts. Added a regression test.	2026-03-19 13:34:41 -04:00
Paul Kompfner	951bb0c1a7	feat: set store=False and add run_inference tests Set store=False in Responses API calls since we send full conversation history as input items and don't use previous_response_id. Add 5 run_inference tests for OpenAIResponsesLLMService using real LLMContext and adapter (only HTTP client mocked).	2026-03-18 14:47:12 -04:00
Paul Kompfner	c4f21ef76b	test: add run_inference tests for OpenAIResponsesLLMService Uses real LLMContext and adapter (only HTTP client is mocked) to test basic inference, client exception propagation, system_instruction override, empty context fallback, and max_tokens override.	2026-03-18 14:17:21 -04:00
Paul Kompfner	a7167ad121	test: add run_inference tests for OpenAIResponsesLLMService Tests cover basic inference, client exception propagation, system_instruction override, and max_tokens override.	2026-03-18 14:09:17 -04:00
Mark Backman	912f1be31c	Add system_instruction parameter to run_inference (#3968 ) * Add system_instruction parameter to run_inference Allow callers to provide a custom system instruction directly when calling run_inference, without having to construct provider-specific context objects. For OpenAI, the instruction is prepended as a system message (preserving existing messages). For Anthropic, Google, and AWS Bedrock, it overrides the single system field with a warning when an existing system instruction is present in the context. * Use system_instruction parameter in _generate_summary Pass the summarization prompt via run_inference's system_instruction parameter instead of embedding it as a system message in the context. * Add changelog for #3968	2026-03-10 12:57:23 -04:00
Aleix Conchillo Flaqué	305ab44132	tests: add unittest.main() call	2026-01-30 10:07:34 -08:00
Aleix Conchillo Flaqué	2626154a64	update examples and tests copyright and use a proper dash in 2024-2026	2026-01-07 19:32:22 -08:00
Mark Backman	21a55f6aae	Update run_inference to use the provided LLM configuration params	2025-12-17 10:58:05 -05:00
Paul Kompfner	9f82c6b4a4	Add unit tests for `run_inference`	2025-09-12 11:07:11 -04:00

15 Commits