Refactor mkdocs.yml and add new documentation for workflow configuration and voice customization
- Restructured the navigation in mkdocs.yml to improve organization, introducing subcategories for assistant creation and component libraries. - Added new documentation for workflow configuration options, detailing setup and best practices. - Introduced new sections for voice recognition and generation, outlining configuration items and recommendations for optimal performance.
This commit is contained in:
24
docs/content/customization/asr.md
Normal file
24
docs/content/customization/asr.md
Normal file
@@ -0,0 +1,24 @@
|
||||
# 语音识别
|
||||
|
||||
语音识别(ASR)负责将用户音频实时转写为文本,供对话引擎理解。
|
||||
|
||||
## 配置项
|
||||
|
||||
| 配置项 | 说明 |
|
||||
|---|---|
|
||||
| ASR 引擎 | 选择语音识别服务提供商 |
|
||||
| 模型 | 识别模型名称 |
|
||||
| 语言 | 中文/英文/多语言 |
|
||||
| 热词 | 提升特定词汇识别准确率 |
|
||||
| 标点与规范化 | 是否自动补全标点、文本规范化 |
|
||||
|
||||
## 建议
|
||||
|
||||
- 客服场景建议开启热词并维护业务词表
|
||||
- 多语言场景建议按会话入口显式指定语言
|
||||
- 对延迟敏感场景优先选择流式识别模型
|
||||
|
||||
## 相关文档
|
||||
|
||||
- [语音配置总览](voices.md)
|
||||
|
||||
25
docs/content/customization/tts.md
Normal file
25
docs/content/customization/tts.md
Normal file
@@ -0,0 +1,25 @@
|
||||
# 语音生成
|
||||
|
||||
语音生成(TTS)负责将助手回复文本转换为可播放音频。
|
||||
|
||||
## 配置项
|
||||
|
||||
| 配置项 | 说明 |
|
||||
|---|---|
|
||||
| TTS 引擎 | 选择语音合成服务提供商 |
|
||||
| 声音/音色 | 选择目标音色或发音人 |
|
||||
| 模型 | 语音合成模型名称 |
|
||||
| 语速 | 播放速度,通常 0.5-2.0 |
|
||||
| 音量/增益 | 输出音量控制 |
|
||||
| 音调 | 声线高低调整 |
|
||||
|
||||
## 建议
|
||||
|
||||
- 对话助手建议保持语速在 `0.9-1.2`
|
||||
- 生产环境建议固定主音色,降低体验波动
|
||||
- 若需要打断能力,优先使用低延迟流式 TTS
|
||||
|
||||
## 相关文档
|
||||
|
||||
- [语音配置总览](voices.md)
|
||||
|
||||
Reference in New Issue
Block a user