skills.vishalvoidskills/vishalvoid
Technical & DevelopmentIntermediate

hugging-face-datasets

Create and manage datasets with configs and SQL querying

Developer Setup

Setup & Installation

bash
npx skills add https://github.com/huggingface/skills --skill hugging-face-datasets

Overview

What This Skill Does

Manages datasets on the Hugging Face Hub. Handles repo initialization, configuration, streaming row updates, and SQL-based querying and transformation via DuckDB. Works alongside the HF MCP server, which covers discovery and search.

Application

When to use this Skill

Documentation

Show Skills.md file

Hugging Face Dataset Viewer

Use this skill to execute read-only Dataset Viewer API calls for dataset exploration and extraction.

Core workflow

  1. Optionally validate dataset availability with /is-valid.
  2. Resolve config + split with /splits.
  3. Preview with /first-rows.
  4. Paginate content with /rows using offset and length (max 100).
  5. Use /search for text matching and /filter for row predicates.
  6. Retrieve parquet links via /parquet and totals/metadata via /size and /statistics.

Defaults

  • Base URL: https://datasets-server.huggingface.co
  • Default API method: GET
  • Query params should be URL-encoded.
  • offset is 0-based.
  • length max is usually 100 for row-like endpoints.
  • Gated/private datasets require Authorization: Bearer <HF_TOKEN>.

Dataset Viewer

  • Validate dataset: /is-valid?dataset=<namespace/repo>
Lines 1 - 25 of 102

Recommendations

Explore other random skills

All skillsMy patterns