---
title: FAQ
url: https://docs.vultr.com/products/compute/serverless-inference/faq
description: Frequently asked questions and answers about Vultrs products, services, and platform features.
publish_date: 2024-09-23T20:20:47.094379Z
last_updated: 2026-05-26T19:09:55.007444Z
---

# Frequently Asked Questions (FAQs) About Vultr Serverless Inference

These are the frequently asked questions for Vultr Serverless Inference.

???+ note "Can I run inference workloads for models other than large language models on Vultr Serverless Inference?"

    Vultr Serverless Inference supports a growing catalog of production-ready models across multiple categories, including large language models, chat-optimized models, code generation models, text-to-speech models, and image generation models. The available model list is regularly updated as new models are added. To view the current supported models, navigate to the **Serverless Inference** section in the [Vultr Console](https://console.vultr.com) and check the model selector in the **Prompt** tab.

??? note "How do I monitor the usage and cost of my Vultr serverless inference subscription?"

    You can monitor your usage and costs by navigating to the "Usage" tab of your Vultr Serverless Inference subscription in the Vultr Console. Here, you will find details on your current token usage, overage, and any associated costs. You can also view your API key and other subscription details in the "Overview" tab.

??? note "Can I integrate Vultr serverless inference with my existing ML pipeline?"

    Yes, you can integrate Vultr Serverless Inference with your existing machine learning pipeline. To do this, replace your current inference API URL (such as OpenAI's base API URL) with Vultr’s API URL. Then, use your Vultr API key for authentication to seamlessly incorporate Vultr Serverless Inference into your workflow.

??? note "How do I regenerate my Vultr serverless Inference API key?"

    You can regenerate your Vultr Serverless Inference API key from the Overview page in the Vultr Console. This will invalidate the previous API key and generate a new one for enhanced security.

??? note "Why am I not getting high-quality outputs from Vultr serverless inference?"

    The quality of the outputs from Vultr Serverless Inference depends on the machine learning model you are using. If the outputs are not meeting your expectations, consider trying a different model or refining your prompts. Vultr provides the infrastructure, but the model's performance is a key factor in the output quality.

??? note "Is there a way to test inference before committing to a large workload?"

    Yes, you can test inference workloads by using the "Prompt" tab in the Vultr Serverless Inference section of the Vultr Console. This allows you to input prompts, select a model, and adjust settings such as max tokens and temperature to see how the model responds before running larger workloads.

??? note "How secure is my data when using Vultr serverless inference?"

    Vultr takes data security seriously. All data transmitted to and from Vultr Serverless Inference is encrypted, and the subscription is designed with security best practices to ensure that your data and workloads are protected.