GPT-5.4 Mini API
GPT-5.4 Mini optimizes cost/throughput while keeping most of 5.4's reasoning ability. Use for classifiers, summarizers, agents that fan out many calls, and chat surfaces under tight latency budgets.
Quickstart
SkillBoss Wholesale is OpenAI SDK-compatible — point any OpenAI client at https://api.skillboss.co/v1 and pass your wholesale key as a bearer token. Works out of the box with the official OpenAI Python and Node SDKs, Vercel AI SDK, LangChain, LlamaIndex, and most other OpenAI-shaped clients.
bashcurl https://api.skillboss.co/v1/chat/completions \
-H "Authorization: Bearer $SKILLBOSS_WHOLESALE_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "openai/gpt-5.4-mini",
"messages": [
{"role": "user", "content": "Hello"}
]
}'Your first 200 response is the fastest way to confirm setup. From there, swap in your real prompt and tune the model-specific parameters listed below.
Authentication
Every request must include your wholesale key. The header name depends on the endpoint — match the SDK you're using:
bashAuthorization: Bearer $SKILLBOSS_WHOLESALE_KEYTreat the wholesale key like a password — never commit it to source control or ship it in client-side bundles. Rotate from the wholesale dashboard if exposed. Standard (non-wholesale) console keys are rejected at the gateway with 401.
Code examples
pythonimport os
from openai import OpenAI
client = OpenAI(
api_key=os.environ["SKILLBOSS_WHOLESALE_KEY"],
base_url="https://api.skillboss.co/v1",
)
resp = client.chat.completions.create(
model="openai/gpt-5.4-mini",
messages=[{"role": "user", "content": "Hello"}],
)
print(resp.choices[0].message.content)typescriptimport OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.SKILLBOSS_WHOLESALE_KEY,
baseURL: "https://api.skillboss.co/v1",
});
const resp = await client.chat.completions.create({
model: "openai/gpt-5.4-mini",
messages: [{ role: "user", content: "Hello" }],
});
console.log(resp.choices[0].message.content);Parameters
Standard OpenAI chat completion parameters — pass in the request body as JSON. All fields below work across every chat model on the gateway.
Endpoint
| Method | POST |
| URL | https://api.skillboss.co/v1/chat/completions |
| Auth header | Authorization: Bearer $SKILLBOSS_WHOLESALE_KEY |
| Content-Type | application/json |
| Streaming | Set stream: true in the request body for SSE token-by-token streaming. |
Errors
The API uses standard HTTP status codes:
Pricing
Wholesale pricing is your account-specific discount × vendor list price. Discount rate depends on your contract — see the live numbers on the wholesale dashboard. The dashboard shows your current cost per 1M tokens (or per image / per second) for every model.
No platform markup on standard token billing. Volume tiers + monthly caps are configurable per key.