pipecat

Author	SHA1	Message	Date
Paul Kompfner	4c121332cf	Convert developer messages to user for Cerebras (and lay groundwork for other incompatible services) OpenAI-compatible services that don't support the "developer" message role can now set supports_developer_role = False on the service class. BaseOpenAILLMService passes this as convert_developer_to_user to the adapter, which converts developer messages to user messages before sending them to the API. Applied to Cerebras and Perplexity. Also removes the now-redundant developer→user conversion step from PerplexityLLMAdapter (handled by the parent adapter via the flag).	2026-03-24 16:05:15 -04:00
Paul Kompfner	d4dea30407	Centralize system message handling in adapters; add developer message support Two goals: 1. Centralize system_instruction vs context system message resolution into the LLM adapters. This eliminates duplication between in-pipeline and out-of-band (run_inference) code paths across ~16 locations in service llm.py files. 2. Add support for "developer" role messages in conversation context, which is facilitated by the above centralization. Shared helpers on BaseLLMAdapter: - _extract_initial_system_or_developer: extracts/converts messages[0] based on role and whether system_instruction is provided - _resolve_system_instruction: warns on conflicts between system_instruction and context system messages, returns the effective instruction Developer message handling (new): - Non-OpenAI adapters: an initial "developer" message is promoted to the system instruction when no system_instruction is provided; otherwise it is converted to "user". Subsequent "developer" messages are always converted to "user". No conflict warning is emitted for developer messages (unlike "system" messages). - OpenAI adapter: "developer" messages pass through in conversation history without triggering conflict warnings. - OpenAI Responses adapter: "developer" messages are kept as "developer" role (same as "system", which is also converted to "developer" for the Responses API). Other behavior changes: - Gemini: "initial" system message detection now checks messages[0] only (previously searched anywhere in the list) - Bedrock: a lone system message is now converted to "user" instead of being extracted to an empty message list (matches existing Anthropic behavior)	2026-03-24 16:02:42 -04:00
Paul Kompfner	348df9d4ce	fix: remove redundant instructions override in run_inference The override would re-add `instructions` after the adapter had intentionally converted it to a developer message for empty contexts. Added a regression test.	2026-03-19 13:34:41 -04:00
Paul Kompfner	951bb0c1a7	feat: set store=False and add run_inference tests Set store=False in Responses API calls since we send full conversation history as input items and don't use previous_response_id. Add 5 run_inference tests for OpenAIResponsesLLMService using real LLMContext and adapter (only HTTP client mocked).	2026-03-18 14:47:12 -04:00
Paul Kompfner	c4f21ef76b	test: add run_inference tests for OpenAIResponsesLLMService Uses real LLMContext and adapter (only HTTP client is mocked) to test basic inference, client exception propagation, system_instruction override, empty context fallback, and max_tokens override.	2026-03-18 14:17:21 -04:00
Paul Kompfner	a7167ad121	test: add run_inference tests for OpenAIResponsesLLMService Tests cover basic inference, client exception propagation, system_instruction override, and max_tokens override.	2026-03-18 14:09:17 -04:00
Mark Backman	912f1be31c	Add system_instruction parameter to run_inference (#3968 ) * Add system_instruction parameter to run_inference Allow callers to provide a custom system instruction directly when calling run_inference, without having to construct provider-specific context objects. For OpenAI, the instruction is prepended as a system message (preserving existing messages). For Anthropic, Google, and AWS Bedrock, it overrides the single system field with a warning when an existing system instruction is present in the context. * Use system_instruction parameter in _generate_summary Pass the summarization prompt via run_inference's system_instruction parameter instead of embedding it as a system message in the context. * Add changelog for #3968	2026-03-10 12:57:23 -04:00
Aleix Conchillo Flaqué	305ab44132	tests: add unittest.main() call	2026-01-30 10:07:34 -08:00
Aleix Conchillo Flaqué	2626154a64	update examples and tests copyright and use a proper dash in 2024-2026	2026-01-07 19:32:22 -08:00
Mark Backman	21a55f6aae	Update run_inference to use the provided LLM configuration params	2025-12-17 10:58:05 -05:00
Paul Kompfner	9f82c6b4a4	Add unit tests for `run_inference`	2025-09-12 11:07:11 -04:00

11 Commits