Skip to main content

Models Endpoint

This endpoint provides information about available TTS models from supported providers.

Request

GET /v1/models

Query Parameters

ParameterTypeRequiredDescription
providerstringNoFilter models by provider (e.g., aws, google, azure).

Example Request

curl -X GET "https://api.uberduck.ai/v1/models" \
-H "Authorization: Bearer YOUR_API_KEY"

Response

Response Body

FieldTypeDescription
modelsarrayArray of model objects.
totalintegerTotal number of models.

Model Object

FieldTypeDescription
idstringUnique identifier for the model.
namestringDisplay name of the model.
provider_idstringIdentifier for the model provider.
provider_namestringDisplay name of the model provider.
descriptionstringDescription of the model.
featuresarrayArray of features supported by the model.
is_defaultbooleanWhether this is the default model for the provider.

Example Response

{
"models": [
{
"id": "polly_standard",
"name": "Standard",
"provider_id": "aws",
"provider_name": "Amazon Polly",
"description": "Amazon Polly's standard text-to-speech voices using concatenative synthesis",
"features": ["fast-synthesis", "low-latency"],
"is_default": true
},
{
"id": "polly_neural",
"name": "Neural",
"provider_id": "aws",
"provider_name": "Amazon Polly",
"description": "Amazon Polly's neural text-to-speech voices using deep learning",
"features": ["natural-sounding", "expressiveness", "high-quality"],
"is_default": false
},
{
"id": "google_standard",
"name": "Standard",
"provider_id": "google",
"provider_name": "Google Cloud",
"description": "Google's standard TTS voices",
"features": ["fast-synthesis", "wide-language-support"],
"is_default": true
},
{
"id": "google_wavenet",
"name": "WaveNet",
"provider_id": "google",
"provider_name": "Google Cloud",
"description": "Google's WaveNet voices for more natural sounding speech",
"features": ["natural-sounding", "high-quality", "expressiveness"],
"is_default": false
},
{
"id": "google_neural2",
"name": "Neural2",
"provider_id": "google",
"provider_name": "Google Cloud",
"description": "Google's Neural2 voices with enhanced naturalness and quality",
"features": ["natural-sounding", "high-quality", "latest-technology"],
"is_default": false
},
{
"id": "azure_neural",
"name": "Neural",
"provider_id": "azure",
"provider_name": "Microsoft Azure",
"description": "Microsoft's neural voices for human-like, natural speech",
"features": ["human-like", "prosody", "high-quality"],
"is_default": true
}
],
"total": 6
}

Error Codes

Status CodeError CodeDescription
400invalid_requestInvalid request parameters.
401unauthorizedInvalid or missing API key.
429rate_limit_exceededRate limit exceeded.
500internal_errorInternal server error.

Available Model Features

Models may include one or more of the following features:

FeatureDescription
fast-synthesisQuick audio generation with lower latency.
low-latencyOptimized for real-time or near-real-time applications.
natural-soundingMore human-like speech quality.
expressivenessBetter emotional range and expressivity.
high-qualityHigher audio fidelity and clarity.
wide-language-supportSupports multiple languages.
human-likeVery close to human speech patterns and quality.
prosodyAdvanced control over intonation, stress, and rhythm.
latest-technologyUses the newest speech synthesis technologies.

Usage Notes

  • The models endpoint is useful for discovering available models and their capabilities.
  • When using the text-to-speech endpoint, you'll need to specify a model ID from this list.
  • Each model is compatible with specific voices. Check the voice details to confirm compatibility.
  • Provider-specific features and parameters are available for each model. See the text-to-speech endpoint documentation for details.