Kwindla Hultman Kramer
6f2a464451
conversation save/load for openai, openai-realtime, and anthropic
2024-10-13 18:12:03 -07:00
Kwindla Hultman Kramer
9e95419301
much cleanup
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
f390ec9608
temp commit; debugging
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
ce8a83efba
tools frame support and wip message resetting/loading
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
e5a2bf9564
context management improvements
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
31916ed9fd
turn on/off openai vad
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
3a2fbc2b19
send user started/stopped speaking event from openai realtime events
...
send user started/stopped speaking event from openai realtime events
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
40dc546b81
function call fix and user transcription frames
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
7dfac0163b
bits of pydantic
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
09a3c2a82d
major functionality working (not configurable, occasional timing bugs maybe)
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
c32c65014b
definitely broke something in the pipeline
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
b8898e449e
lots of debugging statements. multiple function calls broken
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
d1f6d229ca
space exploration prompt
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
4fa0318005
configurability via constructor
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
93ebb9d541
working 19-openai-realtime-beta.py example
2024-10-12 21:58:11 -07:00
Kwindla Hultman Kramer
16101c79c5
beginning of realtime impl
2024-10-12 21:58:11 -07:00
Aleix Conchillo Flaqué
75d27aeb9f
examples(storytelling): update packages
2024-10-02 12:00:00 -07:00
Aleix Conchillo Flaqué
8c01a900cd
google: allow using GOOGLE_APPLICATION_CREDENTIALS
2024-10-02 11:56:01 -07:00
Allenmylath
70c3ff31fd
Update env.example
...
elevenlabs is not used in code instead cartesian is used hence changed
2024-10-02 21:59:51 +05:30
Mark Backman
3d642df2b0
Revert aligning voice_id name in TTS service constructor
2024-10-02 11:07:48 -04:00
Mark Backman
28643b453d
Update to use LLM, STT, TTS subclasses and remove setter methods
2024-10-01 20:30:27 -04:00
Kwindla Hultman Kramer
92d8b37229
implement vision for openai
2024-09-30 21:49:29 -07:00
Kwindla Hultman Kramer
0d5cb84531
function calling testing and improvements
2024-09-30 20:59:28 -07:00
Kwindla Hultman Kramer
a5c73ec829
handle openai multiple function calls
2024-09-30 16:08:30 -07:00
Kwindla Hultman Kramer
5d63615b1b
Merge pull request #528 from pipecat-ai/khk/sentence-splits
...
TTS sentence aggregation fix
2024-09-30 16:07:21 -07:00
Kwindla Hultman Kramer
ed607d5c4b
typo fix
2024-09-30 16:05:32 -07:00
Kwindla Hultman Kramer
37da7e44cd
whitespace fix
2024-09-30 16:05:32 -07:00
Aleix Conchillo Flaqué
c8995b82e5
all frame processors are asynchrnous
...
In this commit we make all frame processors asynchronous, that is, they have an
internal queue and they push frames using a task from that queue.
2024-09-30 15:11:21 -07:00
Kwindla Hultman Kramer
6b7f924af6
tts sentence aggregation fix
2024-09-30 14:33:08 -07:00
Mark Backman
97cba92fa5
Merge pull request #516 from pipecat-ai/mb/google-tts
...
Add Google TTS
2024-09-30 12:25:16 -04:00
Mark Backman
830d2df671
Add Google TTS
2024-09-27 07:36:20 -04:00
Aleix Conchillo Flaqué
4501dca133
Merge pull request #467 from joachimchauvet/main
...
Add LiveKit audio transport
2024-09-26 22:58:25 -07:00
Mark Backman
55a70cde8f
Merge pull request #514 from pipecat-ai/mb/aws-polly-tts
...
Add AWS Polly TTS support
2024-09-26 22:20:13 -04:00
Mark Backman
706c00d897
Code review feedback
2024-09-26 22:13:37 -04:00
Mark Backman
a018112a13
Merge pull request #510 from pipecat-ai/mb/deepgram-tts-http
...
Improve usability of Deepgram TTS: use Deepgram client, remove aiohttp
2024-09-26 13:38:42 -04:00
Mark Backman
6a6ea251ae
Add AWS Polly TTS support
2024-09-26 13:35:59 -04:00
Mark Backman
73da8c1910
Improve usability of Deepgram TTS: use Deepgram client, remove aiohttp
2024-09-25 22:43:10 -04:00
joachimchauvet
b6e1d6e6ae
format with ruff
2024-09-24 10:21:02 +03:00
joachimchauvet
fa609f1afc
adjust output sample rate and create user token
2024-09-24 10:16:54 +03:00
joachimchauvet
2e5b0c1d6b
add tenacity dependency
2024-09-24 10:16:54 +03:00
joachimchauvet
a9390d96a1
add LiveKit audio transport
2024-09-24 10:16:54 +03:00
Jin Kim
cf72129852
Merge remote-tracking branch 'upstream/main'
2024-09-24 07:18:22 +09:00
Aleix Conchillo Flaqué
eeb8338dce
introduce Ruff formatting
2024-09-23 09:53:37 -07:00
Jin Kim
68cc4186ad
Merge remote-tracking branch 'upstream/main'
2024-09-23 16:34:31 +09:00
Mark Backman
55c645c614
Add voice_settings and optimize_streaming_latency to ElevenLabs
2024-09-22 13:58:50 -04:00
Mark Backman
c73111afea
Add extra input param to LLMs
2024-09-21 00:01:25 -04:00
Mark Backman
e8f8a49646
Merge pull request #484 from pipecat-ai/mb/llm-input-params
...
Add input params for OpenAI, Anthropic, Together AI LLMs
2024-09-20 20:35:49 -04:00
Mark Backman
f3fd312b83
Add Together AI interruptible example
2024-09-20 20:21:19 -04:00
Aleix Conchillo Flaqué
14acf05a26
Merge pull request #480 from pipecat-ai/aleix/input-output-frames
...
introduce input/output audio and image frames
2024-09-20 14:44:37 -07:00
Aleix Conchillo Flaqué
7e39d9ad3d
introduce input/output audio and image frames
...
We now distinguish between input and output audio and image frames. We introduce
`InputAudioRawFrame`, `OutputAudioRawFrame`, `InputImageRawFrame` and
`OutputImageRawFrame` (and other subclasses of those). The input frames usually
come from an input transport and are meant to be processed inside the pipeline
to generate new frames. However, the input frames will not be sent through an
output transport. The output frames can also be processed by any frame processor
in the pipeline and they are allowed to be sent by the output transport.
2024-09-19 23:11:03 -07:00