Technical & DevelopmentIntermediate
hugging-face-datasets
Create and manage datasets with configs and SQL querying
Developer Setup
Setup & Installation
bash
npx skills add https://github.com/huggingface/skills --skill hugging-face-datasetsnpx skills add https://github.com/huggingface/skills --skill hugging-face-datasetsOr paste this URL into your assistant to install:
Overview
What This Skill Does
Manages datasets on the Hugging Face Hub. Handles repo initialization, configuration, streaming row updates, and SQL-based querying and transformation via DuckDB. Works alongside the HF MCP server, which covers discovery and search.
Application
When to use this Skill
- Configuring integration settings for custom agent workflows.
- Optimizing query execution and response latency in production.
- Developing clean, standard-compliant implementations for enterprise services.
- Troubleshooting connection timeouts and authentication handshakes.
- Monitoring API rate limits and execution pipelines programmatically.
Documentation
Show Skills.md file
Hugging Face Dataset Viewer
Use this skill to execute read-only Dataset Viewer API calls for dataset exploration and extraction.
Core workflow
- Optionally validate dataset availability with
/is-valid. - Resolve
config+splitwith/splits. - Preview with
/first-rows. - Paginate content with
/rowsusingoffsetandlength(max 100). - Use
/searchfor text matching and/filterfor row predicates. - Retrieve parquet links via
/parquetand totals/metadata via/sizeand/statistics.
Defaults
- Base URL:
https://datasets-server.huggingface.co - Default API method:
GET - Query params should be URL-encoded.
offsetis 0-based.lengthmax is usually100for row-like endpoints.- Gated/private datasets require
Authorization: Bearer <HF_TOKEN>.
Dataset Viewer
Validate dataset:/is-valid?dataset=<namespace/repo>
Lines 1 - 25 of 102
Recommendations