skills.vishalvoidskills/vishalvoid
Technical & DevelopmentIntermediate

venice-audio-speech

Text-to-speech models, voices, formats, and streaming

Developer Setup

Setup & Installation

bash
npx skills add https://github.com/veniceai/skills --skill venice-audio-speech

Overview

What This Skill Does

Text-to-speech models, voices, formats, and streaming

Application

When to use this Skill

Documentation

Show Skills.md file

Venice TTS (/audio/speech)

POST /api/v1/audio/speech converts text to an audio stream or file. OpenAI-compatible — the OpenAI SDK's audio.speech.create() works as a drop-in.

Use when

  • You want narration, voice replies, or UI audio from text.
  • You need a specific voice family (ElevenLabs, Kokoro, xAI, Qwen 3, Orpheus, Chatterbox, MiniMax, Inworld, Gemini Flash).
  • You want streaming audio returned sentence-by-sentence.
  • You need style/emotion control on supported models.

For music generation (lyrics + instrumental), see venice-audio-music. For transcription (audio → text), see venice-audio-transcription.

Minimal request

curl https://api.venice.ai/api/v1/audio/speech \
  -H "Authorization: Bearer $VENICE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "tts-xai-v1",
    "voice": "eve",
    "input": "Hello, welcome to Venice Voice.",
    "response_format": "mp3",
    "speed": 1.0,
    "streaming": false
  }' --output hello.mp3
Lines 1 - 28 of 147

Recommendations

Explore other random skills

All skillsMy patterns