Skip to content

Your First Text to Speech Conversion

This guide walks you through creating your first audio file with Speech Studio in just a few steps.

Step-by-Step Instructions

1. Open Speech Studio

Launch Speech Studio from your desktop shortcut or the Start Menu. The main window will open with the text-to-speech interface ready.

Speech Studio main window

2. Enter Your Text

Type or paste the text you want to convert into the large text input area. You can enter anything from a single sentence to multiple paragraphs.

Best Results

For the most natural-sounding output, use properly punctuated text with clear sentence structure. The AI voice engine uses punctuation to determine pauses and intonation.

3. Select a Voice

Click the Voice dropdown to browse available voices. You can filter by:

  • Language -- Choose from 80+ languages and regional dialects
  • Gender -- Male or female voices
  • Voice name -- Each voice has a unique character and tone

Start with a popular voice like en-US-JennyNeural (English, US) for a natural, conversational tone.

4. Adjust Settings (Optional)

Before generating, you can optionally adjust:

  • Speed -- Slow down or speed up the speech rate
  • Pitch -- Raise or lower the voice pitch
  • Volume -- Control the output volume level

5. Generate Audio

Click the Convert button. Speech Studio will send your text to the Azure speech engine and generate the audio. A progress bar indicates the conversion status.

Processing Time

Conversion speed depends on text length and your internet connection. A typical paragraph converts in a few seconds.

6. Preview and Save

Once conversion is complete:

  • Click Play to preview the audio directly in the app
  • Click Save to export the audio file to your preferred location
  • Choose your desired format (MP3, WAV, or OGG)

Next Steps


:octicons-arrow-right-24: Get Speech Studio