Python SDKAPI ReferenceComponents
PublicEndpoint - Python SDK
PublicEndpoint - Python SDK
PublicEndpoint method reference
The Python SDK and docs are currently in beta. Report issues on GitHub.
Information about a specific model endpoint
Fields
| Field | Type | Required | Description | Example |
|---|---|---|---|---|
context_length | int | ✔️ | N/A | |
latency_last_30m | Nullable[components.PercentileStats] | ✔️ | Latency percentiles in milliseconds over the last 30 minutes. Latency measures time to first token. Only visible when authenticated with an API key or cookie; returns null for unauthenticated requests. | {"p50": 25.5,"p75": 35.2,"p90": 48.7,"p99": 85.3} |
max_completion_tokens | Nullable[int] | ✔️ | N/A | |
max_prompt_tokens | Nullable[int] | ✔️ | N/A | |
model_id | str | ✔️ | The unique identifier for the model (permaslug) | openai/gpt-4 |
model_name | str | ✔️ | N/A | |
name | str | ✔️ | N/A | |
pricing | components.Pricing | ✔️ | N/A | |
provider_name | components.ProviderName | ✔️ | N/A | OpenAI |
quantization | Nullable[components.PublicEndpointQuantization] | ✔️ | N/A | fp16 |
status | Optional[components.EndpointStatus] | ➖ | N/A | 0 |
supported_parameters | List[components.Parameter] | ✔️ | N/A | |
supports_implicit_caching | bool | ✔️ | N/A | |
tag | str | ✔️ | N/A | |
throughput_last_30m | Nullable[components.PercentileStats] | ✔️ | N/A | {"p50": 25.5,"p75": 35.2,"p90": 48.7,"p99": 85.3} |
uptime_last_1d | Nullable[float] | ✔️ | Uptime percentage over the last 1 day, calculated as successful requests / (successful + error requests) * 100. Rate-limited requests are excluded. Returns null if insufficient data. | |
uptime_last_30m | Nullable[float] | ✔️ | N/A | |
uptime_last_5m | Nullable[float] | ✔️ | Uptime percentage over the last 5 minutes, calculated as successful requests / (successful + error requests) * 100. Rate-limited requests are excluded. Returns null if insufficient data. |