skills.vishalvoidskills/vishalvoid
Technical & DevelopmentIntermediate

venice-audio-transcription

Audio transcription models and speech-to-text options

Developer Setup

Setup & Installation

bash
npx skills add https://github.com/veniceai/skills --skill venice-audio-transcription

Overview

What This Skill Does

Audio transcription models and speech-to-text options

Application

When to use this Skill

Documentation

Show Skills.md file

Venice Transcription (/audio/transcriptions)

POST /api/v1/audio/transcriptions takes an audio file and returns text. It's OpenAI-compatible with multipart/form-data — the OpenAI SDK's audio.transcriptions.create() works unchanged.

Use when

  • You need STT (speech-to-text) for voice notes, meetings, podcasts, short audio.
  • You need timestamps for subtitles / chapters.
  • You want to pick between fast local-style models (Parakeet) and large multilingual ones (Whisper, Wizper, Scribe).

For long video / YouTube transcription, see venice-video's /video/transcriptions (takes a public video URL directly).

Minimal request

curl https://api.venice.ai/api/v1/audio/transcriptions \
  -H "Authorization: Bearer $VENICE_API_KEY" \
  -F "file=@./meeting.m4a" \
  -F "model=nvidia/parakeet-tdt-0.6b-v3" \
  -F "response_format=json" \
  -F "timestamps=false"
{ "text": "Alright everyone, let's kick off the meeting..." }
Lines 1 - 26 of 101

Recommendations

Explore other random skills

All skillsMy patterns