How to Use the Text-to-Speech Endpoint in Vultr Serverless Inference

Updated on November 27, 2024

Vultr Serverless Inference text-to-speech endpoint converts text into spoken audio using advanced AI models. This service enables users to integrate high-quality, natural-sounding speech synthesis into their applications, enhancing accessibility and user engagement through seamless audio output. By leveraging this feature, you can provide an auditory experience that is both clear and engaging, making your applications more user-friendly and inclusive.

Follow this guide to utilize the text-to-speech endpoint on your Vultr account using the Vultr Customer Portal.

  • Vultr Customer Portal
  1. Navigate to Products, click Serverless, and then click Inference.

    Serverless Inference option in products menu

  2. Click your target inference service to open its management page.

    Selection of a target serverless inference service

  3. Open the Text-to-Speech page.

    Button to open the text-to-speech endpoint page

  4. Select a preferred model.

    Button to select preferred text-to-speech model

  5. Select a preferred voice.

    Button to select a preffered voice for output generation

  6. Provide an input and click on Prompts

    Field to provide input for text-to-speech conversion

  7. Click Reset to provide a new input.

    Button to reset the input field for a new text input