Kwindla Hultman Kramer
07124bfafc
tools frame support and wip message resetting/loading
2024-10-09 11:03:53 -07:00
Kwindla Hultman Kramer
df2ddb4b91
context management improvements
2024-10-08 17:47:32 -07:00
Kwindla Hultman Kramer
d0cdb496e4
turn on/off openai vad
2024-10-07 22:09:18 -07:00
Kwindla Hultman Kramer
b640b2d024
send user started/stopped speaking event from openai realtime events
...
send user started/stopped speaking event from openai realtime events
2024-10-07 21:00:18 -07:00
Kwindla Hultman Kramer
e856566c30
function call fix and user transcription frames
2024-10-07 18:34:52 -07:00
Kwindla Hultman Kramer
71fe09f7f0
bits of pydantic
2024-10-07 18:32:51 -07:00
Kwindla Hultman Kramer
7ae3c420f4
major functionality working (not configurable, occasional timing bugs maybe)
2024-10-07 18:32:51 -07:00
Kwindla Hultman Kramer
830a36319c
definitely broke something in the pipeline
2024-10-07 18:32:50 -07:00
Kwindla Hultman Kramer
b4214b56b3
lots of debugging statements. multiple function calls broken
2024-10-07 18:32:50 -07:00
Kwindla Hultman Kramer
8565655f08
space exploration prompt
2024-10-07 18:32:50 -07:00
Kwindla Hultman Kramer
fa3a6647ef
configurability via constructor
2024-10-07 18:32:50 -07:00
Kwindla Hultman Kramer
efd3627202
working 19-openai-realtime-beta.py example
2024-10-07 18:32:50 -07:00
Kwindla Hultman Kramer
cc94ec179c
beginning of realtime impl
2024-10-07 18:32:50 -07:00
Aleix Conchillo Flaqué
75d27aeb9f
examples(storytelling): update packages
2024-10-02 12:00:00 -07:00
Aleix Conchillo Flaqué
8c01a900cd
google: allow using GOOGLE_APPLICATION_CREDENTIALS
2024-10-02 11:56:01 -07:00
Allenmylath
70c3ff31fd
Update env.example
...
elevenlabs is not used in code instead cartesian is used hence changed
2024-10-02 21:59:51 +05:30
Mark Backman
3d642df2b0
Revert aligning voice_id name in TTS service constructor
2024-10-02 11:07:48 -04:00
Mark Backman
28643b453d
Update to use LLM, STT, TTS subclasses and remove setter methods
2024-10-01 20:30:27 -04:00
Kwindla Hultman Kramer
92d8b37229
implement vision for openai
2024-09-30 21:49:29 -07:00
Kwindla Hultman Kramer
0d5cb84531
function calling testing and improvements
2024-09-30 20:59:28 -07:00
Kwindla Hultman Kramer
a5c73ec829
handle openai multiple function calls
2024-09-30 16:08:30 -07:00
Kwindla Hultman Kramer
5d63615b1b
Merge pull request #528 from pipecat-ai/khk/sentence-splits
...
TTS sentence aggregation fix
2024-09-30 16:07:21 -07:00
Kwindla Hultman Kramer
ed607d5c4b
typo fix
2024-09-30 16:05:32 -07:00
Kwindla Hultman Kramer
37da7e44cd
whitespace fix
2024-09-30 16:05:32 -07:00
Aleix Conchillo Flaqué
c8995b82e5
all frame processors are asynchrnous
...
In this commit we make all frame processors asynchronous, that is, they have an
internal queue and they push frames using a task from that queue.
2024-09-30 15:11:21 -07:00
Kwindla Hultman Kramer
6b7f924af6
tts sentence aggregation fix
2024-09-30 14:33:08 -07:00
Mark Backman
97cba92fa5
Merge pull request #516 from pipecat-ai/mb/google-tts
...
Add Google TTS
2024-09-30 12:25:16 -04:00
Mark Backman
830d2df671
Add Google TTS
2024-09-27 07:36:20 -04:00
Aleix Conchillo Flaqué
4501dca133
Merge pull request #467 from joachimchauvet/main
...
Add LiveKit audio transport
2024-09-26 22:58:25 -07:00
Mark Backman
55a70cde8f
Merge pull request #514 from pipecat-ai/mb/aws-polly-tts
...
Add AWS Polly TTS support
2024-09-26 22:20:13 -04:00
Mark Backman
706c00d897
Code review feedback
2024-09-26 22:13:37 -04:00
Mark Backman
a018112a13
Merge pull request #510 from pipecat-ai/mb/deepgram-tts-http
...
Improve usability of Deepgram TTS: use Deepgram client, remove aiohttp
2024-09-26 13:38:42 -04:00
Mark Backman
6a6ea251ae
Add AWS Polly TTS support
2024-09-26 13:35:59 -04:00
Mark Backman
73da8c1910
Improve usability of Deepgram TTS: use Deepgram client, remove aiohttp
2024-09-25 22:43:10 -04:00
joachimchauvet
b6e1d6e6ae
format with ruff
2024-09-24 10:21:02 +03:00
joachimchauvet
fa609f1afc
adjust output sample rate and create user token
2024-09-24 10:16:54 +03:00
joachimchauvet
2e5b0c1d6b
add tenacity dependency
2024-09-24 10:16:54 +03:00
joachimchauvet
a9390d96a1
add LiveKit audio transport
2024-09-24 10:16:54 +03:00
Jin Kim
cf72129852
Merge remote-tracking branch 'upstream/main'
2024-09-24 07:18:22 +09:00
Aleix Conchillo Flaqué
eeb8338dce
introduce Ruff formatting
2024-09-23 09:53:37 -07:00
Jin Kim
68cc4186ad
Merge remote-tracking branch 'upstream/main'
2024-09-23 16:34:31 +09:00
Mark Backman
55c645c614
Add voice_settings and optimize_streaming_latency to ElevenLabs
2024-09-22 13:58:50 -04:00
Mark Backman
c73111afea
Add extra input param to LLMs
2024-09-21 00:01:25 -04:00
Mark Backman
e8f8a49646
Merge pull request #484 from pipecat-ai/mb/llm-input-params
...
Add input params for OpenAI, Anthropic, Together AI LLMs
2024-09-20 20:35:49 -04:00
Mark Backman
f3fd312b83
Add Together AI interruptible example
2024-09-20 20:21:19 -04:00
Aleix Conchillo Flaqué
14acf05a26
Merge pull request #480 from pipecat-ai/aleix/input-output-frames
...
introduce input/output audio and image frames
2024-09-20 14:44:37 -07:00
Aleix Conchillo Flaqué
7e39d9ad3d
introduce input/output audio and image frames
...
We now distinguish between input and output audio and image frames. We introduce
`InputAudioRawFrame`, `OutputAudioRawFrame`, `InputImageRawFrame` and
`OutputImageRawFrame` (and other subclasses of those). The input frames usually
come from an input transport and are meant to be processed inside the pipeline
to generate new frames. However, the input frames will not be sent through an
output transport. The output frames can also be processed by any frame processor
in the pipeline and they are allowed to be sent by the output transport.
2024-09-19 23:11:03 -07:00
mattie ruth backman
a4edb3dab1
Cleanup on aisle METRICS. Note: See below, this is a breaking change
...
1. Fleshed out MetricsFrames and broke it into a proper set of types
2. Add model_name as a property to the AIService so that it can be
automatically included in metrics and also remove that
overhead from all the various services themselves
Breaking change!
Because of the types improvements, the MetricsFrame type has
changed. Each frame will have a list of metrics simlilar to before
except each item in the list will only contain one type of metric:
"ttfb", "tokens", "characters", or "processing". Previously these
fields would be in every entry but set to None if they didn't apply.
While this changes internal handling of the MetricsFrame, it does NOT
break the RTVI/daily messaging of metrics. That format remains the same.
Also. Remember to use model_name for accessing a service's current
model and set_model_name for setting it.
2024-09-19 21:30:34 -04:00
mattie ruth backman
50b45ac2da
get the test infrastructure running again
...
disable broken tests for now
2024-09-19 20:58:17 -04:00
Aleix Conchillo Flaqué
4f1b06e6b2
pipeline: renamed ParallelTask to SyncParallelPipeline
2024-09-19 01:32:17 -07:00