Your First Text to Speech Conversion¶

This guide walks you through creating your first audio file with Speech Studio in just a few steps.

Step-by-Step Instructions¶

Speech Studio Main Interface

1. Open Speech Studio¶

Launch Speech Studio from your desktop shortcut or the Start Menu. The main window will open with the text-to-speech interface ready.

2. Enter Your Text¶

Type or paste the text you want to convert into the large text input area. You can enter anything from a single sentence to multiple paragraphs.

Best Results

For the most natural-sounding output, use properly punctuated text with clear sentence structure. The AI voice engine uses punctuation to determine pauses and intonation.

3. Select a Voice¶

Click the Voice dropdown to browse available voices. You can filter by:

Language -- Choose from 80+ languages and regional dialects
Gender -- Male or female voices
Voice name -- Each voice has a unique character and tone

Start with a popular voice like en-US-JennyNeural (English, US) for a natural, conversational tone.

4. Adjust Settings (Optional)¶

Before generating, you can optionally adjust:

Speed -- Slow down or speed up the speech rate
Pitch -- Raise or lower the voice pitch
Volume -- Control the output volume level

5. Generate Audio¶

Click the Convert button. Speech Studio will send your text to the Azure speech engine and generate the audio. A progress bar indicates the conversion status.

Processing Time

Conversion speed depends on text length and your internet connection. A typical paragraph converts in a few seconds.

6. Preview and Save¶

Once conversion is complete:

Click Play to preview the audio directly in the app
Click Save to export the audio file to your preferred location
Choose your desired format (MP3, WAV, or OGG)

Next Steps¶

Explore Voice Selection to find the perfect voice for your project
Learn about SSML Support for advanced voice control
Try Batch Conversion for processing multiple texts at once

Get Speech Studio