feat(voice_agent): support voice agent in AgentScope

Support voice agent in three steps:

- Support TTS models
    - [x] Support TTS model #965 
    - [ ] Support to filter text (e.g. markdown)
    - [ ] [TBD] If we need to support the incremental output from TTS model
- Support multimodal models
    - [x] Support omini model in DashScope API #759 
    - [x] Support gpt-audio model in OpenAI API
    - [ ] [Abandoned] Support audio input in the UserAgent class
- Support realtime models #773
    - [x] Support omini-realtime API
    - [x] Support realtime API of OpenAI API

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(voice_agent): support voice agent in AgentScope #773

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

feat(voice_agent): support voice agent in AgentScope #773

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions