Speech To Speech¶

Introduction¶

rai s2s provides tools and components for voice interaction with the system. This package contains plug-and-play Agents which can be easily integrated with Agents provided by rai core, as well as custom ones. It also provides integration with host sound system, which can be used for low level sound manipulation.

Core Components¶

Component	Description
Agents	Agents in `rai s2s` provide functionality for voice interaction with the rest of the system.
Models	`rai s2s` provides a models which can be optionally installed and utilized by the Agents.
Connector	The `sounddevice` connector allows for interfacing directly with sound devices for asynchronous sound IO.

Best Practices¶

When utilizing S2S features:

Deployment of SpeechToSpeechAgent is meant for local setup, while the SpeechRecognition and TextToSpeech Agents are meant to be ran on separate hosts.
Note that sounddevice python API has notable issues in multi-threaded environment - this can lead to issues when developing Agents using the SoundDeviceConnector