skills.vishalvoidskills/vishalvoid
Technical & DevelopmentIntermediate

azure-ai-contentunderstanding-py

Multimodal content extraction

Developer Setup

Setup & Installation

bash
npx skills add https://github.com/microsoft/skills --skill azure-ai-contentunderstanding-py

Overview

What This Skill Does

Azure AI Content Understanding SDK for Python. Connects to Azure's multimodal AI service to extract structured content from documents, images, audio, and video. Returns markdown, transcripts, key frames, and typed fields via a polling-based async API.

Application

When to use this Skill

Documentation

Show Skills.md file

Azure AI Content Understanding SDK for Python

Multimodal AI service that extracts semantic content from documents, video, audio, and image files for RAG and automated workflows.

Installation

pip install azure-ai-contentunderstanding

Environment Variables

CONTENTUNDERSTANDING_ENDPOINT=https://<resource>.cognitiveservices.azure.com/  # Required for all auth methods
AZURE_TOKEN_CREDENTIALS=prod # Required only if DefaultAzureCredential is used in production

Authentication & Lifecycle

🔑 Two rules apply to every code sample below:

  1. Prefer DefaultAzureCredential. It works locally (Azure CLI / VS Code / Developer CLI) and in Azure (managed identity, workload identity) with no code change. Avoid connection strings, account/API keys — they bypass Entra audit and rotation.
    • Local dev: DefaultAzureCredential works as-is.
    • Production: set AZURE_TOKEN_CREDENTIALS=prod (or AZURE_TOKEN_CREDENTIALS=<specific_credential>) to constrain the credential chain to production-safe credentials.
  2. Wrap every client in a context manager so HTTP transports, sockets, and token caches are released deterministically:
Lines 1 - 25 of 282

Recommendations

Explore other random skills

All skillsMy patterns