Skip to content

SpeechModelSpec

Defined in: core/media-types.ts:245

Specification for a speech generation model.

optional defaultFormat: AudioFormat

Defined in: core/media-types.ts:271

Default format if not specified


optional defaultVoice: string

Defined in: core/media-types.ts:268

Default voice if not specified


displayName: string

Defined in: core/media-types.ts:253

Human-readable display name


optional features: object

Defined in: core/media-types.ts:274

Additional feature flags

optional languages: number

Number of supported languages

optional multiSpeaker: boolean

Supports multi-speaker output

optional voiceInstructions: boolean

Supports voice instructions/steering


formats: AudioFormat[]

Defined in: core/media-types.ts:262

Supported audio formats


maxInputLength: number

Defined in: core/media-types.ts:265

Maximum input text length (characters)


modelId: string

Defined in: core/media-types.ts:250

Model identifier


pricing: SpeechModelPricing

Defined in: core/media-types.ts:256

Pricing information


provider: string

Defined in: core/media-types.ts:247

Provider identifier (e.g., “openai”, “gemini”)


voices: string[]

Defined in: core/media-types.ts:259

Available voice options