Explore our guides and technical documentation to get the most out of Voice Air
This guide will help you get started with Voice Air. We will cover everything, starting with creating an account, and then moving on to cloning your first voice using our platform.
Once you are ready to create a production in Speech Synthesis. Here, you will be able to generate your first audio outputs using the best voices in our library along with any voices you have cloned.
We will also touch on prompting techniques (how to influence the AI’s performance), as well as its current limitations and challenges and provide some troubleshooting and various tips-and-tricks.
How does the AI model work?
The AI models have been trained on a vast amount of audio and are suitable for a wide variety of tasks, ranging from more natural and conversational speech to more dramatic audiobook readings. We offer multiple voices each one has been trained differently and has its own strengths.
It is important to note is that the AI understands context. This means that if you write something in the style of a book, the AI will try to interpret how to perform a certain passage from the context of the writing itself. To achieve a wider emotive range, you can lower the stability slider, although this may sacrifice some degree of predictability.
With each successive update to the model, where it has been re-trained, the AI gets better and better at understanding different contexts as its dataset grows. This will help it understand more nuances between humans, languages, and accents