Your First Text to Speech Conversion¶
This guide walks you through creating your first audio file with Speech Studio in just a few steps.
Step-by-Step Instructions¶
1. Open Speech Studio¶
Launch Speech Studio from your desktop shortcut or the Start Menu. The main window will open with the text-to-speech interface ready.

2. Enter Your Text¶
Type or paste the text you want to convert into the large text input area. You can enter anything from a single sentence to multiple paragraphs.
Best Results
For the most natural-sounding output, use properly punctuated text with clear sentence structure. The AI voice engine uses punctuation to determine pauses and intonation.
3. Select a Voice¶
Click the Voice dropdown to browse available voices. You can filter by:
- Language -- Choose from 80+ languages and regional dialects
- Gender -- Male or female voices
- Voice name -- Each voice has a unique character and tone
Start with a popular voice like en-US-JennyNeural (English, US) for a natural, conversational tone.
4. Adjust Settings (Optional)¶
Before generating, you can optionally adjust:
- Speed -- Slow down or speed up the speech rate
- Pitch -- Raise or lower the voice pitch
- Volume -- Control the output volume level
5. Generate Audio¶
Click the Convert button. Speech Studio will send your text to the Azure speech engine and generate the audio. A progress bar indicates the conversion status.
Processing Time
Conversion speed depends on text length and your internet connection. A typical paragraph converts in a few seconds.
6. Preview and Save¶
Once conversion is complete:
- Click Play to preview the audio directly in the app
- Click Save to export the audio file to your preferred location
- Choose your desired format (MP3, WAV, or OGG)
Next Steps¶
- Explore Voice Selection to find the perfect voice for your project
- Learn about SSML Support for advanced voice control
- Try Batch Conversion for processing multiple texts at once