Technical & DevelopmentIntermediate
transcribe
Transcribe audio files to text with optional speaker diarization
Developer Setup
Setup & Installation
bash
npx skills add https://github.com/openai/skills --skill transcribenpx skills add https://github.com/openai/skills --skill transcribeOr paste this URL into your assistant to install:
Overview
What This Skill Does
Transcribe audio files to text with optional speaker diarization
Application
When to use this Skill
- Integrating transcribe into your development workflow.
- Following best practices for transcribe audio files to text with optional speaker diarization.
- Automating repetitive tasks with AI-assisted tooling.
- Building production-grade applications with proper standards.
- Debugging and troubleshooting common implementation issues.
Documentation
Show Skills.md file
Audio Transcribe
Transcribe audio using OpenAI, with optional speaker diarization when requested. Prefer the bundled CLI for deterministic, repeatable runs.
Workflow
- Collect inputs: audio file path(s), desired response format (text/json/diarized_json), optional language hint, and any known speaker references.
- Verify
OPENAI_API_KEYis set. If missing, ask the user to set it locally (do not ask them to paste the key). - Run the bundled
transcribe_diarize.pyCLI with sensible defaults (fast text transcription). - Validate the output: transcription quality, speaker labels, and segment boundaries; iterate with a single targeted change if needed.
- Save outputs under
output/transcribe/when working in this repo.
Decision rules
- Default to
gpt-4o-mini-transcribewith--response-format textfor fast transcription. - If the user wants speaker labels or diarization, use
--model gpt-4o-transcribe-diarize --response-format diarized_json. - If audio is longer than ~30 seconds, keep
--chunking-strategy auto. - Prompting is not supported for
gpt-4o-transcribe-diarize.
Output conventions
- Use
output/transcribe/<job-id>/for evaluation runs. - Use
--out-dirfor multiple files to avoid overwriting.
Dependencies (install if missing)
Prefer uv for dependency management.
uv pip install openai
Lines 1 - 27 of 75
Recommendations
Explore other random skills
CreativeAdvanced
fullstack-dev
Backend architecture with REST API design, auth flows, real-time features, and database integration
CreativeIntermediate
android-native-dev
Android native development with Kotlin/Jetpack Compose, Material Design 3, and accessibility
CreativeIntermediate
ios-application-dev
iOS development with UIKit, SnapKit, and SwiftUI covering navigation, Dark Mode, and HIG compliance