Technical & DevelopmentIntermediate
venice-responses
OpenAI-compatible Responses API for Venice
Developer Setup
Setup & Installation
bash
npx skills add https://github.com/veniceai/skills --skill venice-responsesnpx skills add https://github.com/veniceai/skills --skill venice-responsesOr paste this URL into your assistant to install:
Overview
What This Skill Does
OpenAI-compatible Responses API for Venice
Application
When to use this Skill
- Configuring integration settings for custom agent workflows.
- Optimizing query execution and response latency in production.
- Developing clean, standard-compliant implementations for enterprise services.
- Troubleshooting connection timeouts and authentication handshakes.
- Monitoring API rate limits and execution pipelines programmatically.
Documentation
Show Skills.md file
Venice Responses API (Alpha)
POST /api/v1/responses is Venice's OpenAI-compatible Responses endpoint. It returns a structured, typed output array instead of a single message.content string — ideal for agents that need to separate reasoning, messages, tool calls, and built-in tool events.
Alpha. Access is gated behind the
responsesApiEnabledflag on Bearer API keys (staff-only during beta). x402 wallet auth bypasses this flag — you can pay per request without the flag. Schemas may change.
Use when
- You need the OpenAI Responses-style response shape (
output[]with typedtype: "reasoning" | "message" | "function_call" | "web_search_call"blocks) for a client library that expects it. - You want clean separation of reasoning vs message vs tool-call output.
- You want streaming via SSE with typed events.
Otherwise use venice-chat — it has more features, more models, and full Venice parameters.
Limitations vs /chat/completions
| Limitation | Detail |
|---|---|
| Stateless | No conversation persistence across requests. Send the full history each call. |
| E2EE models default to rejection | E2EE-capable models return 400 unless you pass venice_parameters.enable_e2ee: false (TEE-only mode). For end-to-end encrypted inference with E2EE headers, use /chat/completions. |
Subset of venice_parameters |
character_slug, enable_e2ee, enable_web_search, enable_web_scraping, enable_web_citations, include_venice_system_prompt, include_search_results_in_stream are supported. strip_thinking_response, disable_thinking, enable_x_search are not wired through in Alpha. |
| Access gated by feature flag | Bearer keys without responsesApiEnabled get 401. x402 requests are allowed (pay-per-call). |
Authentication
Lines 1 - 25 of 164
Recommendations