Skip to content

Segments

Our service offers various types of GPU cards. All cards have been segmented by VRAM size, CUDA Cores and worker type (Data Center / Community). We offer 5 types of segments:

  • Basic (representive GPU: Nvidia RTX 3060)
  • Standard (Nvidia RTX 3080)
  • Premium (Nvidia RTX 3090)
  • Max (Nvidia RTX A6000)
  • Highend (Nvidia A100)

All segments are available in two variants: Data Center and Community.

To check the list of segments, use the endpoint below:

https://api.modelserve.ai/api/v1/clusters/gpu-segments/
curl -s -X GET \
     -H 'Accept: application/json' \
     -H 'Content-Type: application/json' \
     -H 'Authorization: Bearer X' \
     'https://api.modelserve.ai/api/v1/clusters/gpu-segments/'
import requests

r = requests.get(
    "https://api.modelserve.ai/api/v1/clusters/gpu-segments/",
    headers={
        "Accept": "application/json",
        "Content-Type": "application/json",
        "Authorization": "Bearer X",
    },
)
fetch('https://api.modelserve.ai/api/v1/clusters/gpu-segments/', {
  "method": "GET",
  "headers": {
    "Accept": "application/json",
    "Content-Type": "application/json",
    "Authorization": "Bearer X"
  }
});

-

{
    "count": 10,
    "next": null,
    "previous": null,
    "results": [
        {
        "id": 1,
        "name": "BASIC",
        "price": null,
        "representative_gpu": "RTX 3060",
        "type": "datacenter"
        },
        {
        "id": 2,
        "name": "STANDARD",
        "price": "0.5070",
        "representative_gpu": "RTX 3080",
        "type": "datacenter"
        },
        {
        "id": 3,
        "name": "PREMIUM",
        "price": "0.6175",
        "representative_gpu": "RTX 3090",
        "type": "datacenter"
        },
        {
        "id": 4,
        "name": "MAX",
        "price": "1.2545",
        "representative_gpu": "RTX A6000",
        "type": "datacenter"
        },
        {
        "id": 5,
        "name": "HIGHEND",
        "price": "4.1470",
        "representative_gpu": "A100",
        "type": "datacenter"
        },
        {
        "id": 6,
        "name": "BASIC COMMUNITY",
        "price": "0.2285",
        "representative_gpu": "RTX 3060",
        "type": "community"
        },
        {
        "id": 7,
        "name": "STANDARD COMMUNITY",
        "price": "0.2925",
        "representative_gpu": "RTX 3080",
        "type": "community"
        },
        {
        "id": 8,
        "name": "PREMIUM COMMUNITY",
        "price": "0.6488",
        "representative_gpu": "RTX 3090",
        "type": "community"
        },
        {
        "id": 9,
        "name": "MAX COMMUNITY",
        "price": "1.0329",
        "representative_gpu": "RTX A6000",
        "type": "community"
        },
        {
        "id": 10,
        "name": "HIGHEND COMMUNITY",
        "price": "3.2850",
        "representative_gpu": "A100",
        "type": "community"
        }
    ]
 }
Example value:

  • "id": 9 - unique segment number
  • "name": "MAX COMMUNITY" - segment name
  • "price": "1.0329" - segment price
  • "representative_gpu": "RTX A6000" - representative card in the segment
  • "type": "community" - segment type

Remember to replace the "Bearer X" with your real Access Token. Where to find your Access Token (Bearer)? Learn more in the 🚀 Quickstart section.