Speech To Speech¶
Introduction¶
rai s2s
provides tools and components for voice interaction with the system. This package contains plug-and-play Agents which can be easily integrated with Agents provided by rai core
, as well as custom ones. It also provides integration with host sound system, which can be used for low level sound manipulation.
Core Components¶
Component | Description |
---|---|
Agents | Agents in rai s2s provide functionality for voice interaction with the rest of the system. |
Models | rai s2s provides a models which can be optionally installed and utilized by the Agents. |
Connector | The sounddevice connector allows for interfacing directly with sound devices for asynchronous sound IO. |
Best Practices¶
When utilizing S2S features:
- Deployment of
SpeechToSpeechAgent
is meant for local setup, while theSpeechRecognition
andTextToSpeech
Agents are meant to be ran on separate hosts. - Note that
sounddevice
python API has notable issues in multi-threaded environment - this can lead to issues when developing Agents using theSoundDeviceConnector