Speech To Speech¶
Introduction¶
rai s2s provides tools and components for voice interaction with the system. This package contains plug-and-play Agents which can be easily integrated with Agents provided by rai core, as well as custom ones. It also provides integration with host sound system, which can be used for low level sound manipulation.
Core Components¶
| Component | Description |
|---|---|
| Agents | Agents in rai s2s provide functionality for voice interaction with the rest of the system. |
| Models | rai s2s provides a models which can be optionally installed and utilized by the Agents. |
| Connector | The sounddevice connector allows for interfacing directly with sound devices for asynchronous sound IO. |
Best Practices¶
When utilizing S2S features:
- Deployment of
SpeechToSpeechAgentis meant for local setup, while theSpeechRecognitionandTextToSpeechAgents are meant to be ran on separate hosts. - Note that
sounddevicepython API has notable issues in multi-threaded environment - this can lead to issues when developing Agents using theSoundDeviceConnector