Using the CLI

Create Inference Service

First list and select a model template to choose from:

perian template get --all

Then you can create an inference service based on one of the previous templates:

perian inference create --model-template-id [Template ID]

Get Inference Service Details

Display all created Inference services for your account:

perian inference get --all

Display information for a specific inference service:

perian inference get [inference-service-id]

Replace [inference-service-id] with a UUID of an inference service you have previously created or use the flag --last.

Cancel Inference Service

Cancel an inference service you have started:

perian inference cancel [inference-service-id]

Replace [inference-service-id] with a UUID of an inference service you have previously created.

This will cancel the inference service immediately.

Create Jobs

Create a job on an instance type with 2 CPU cores:

perian job create --image hello-world --cores 2

Create a job with more specific hardware requirements:

perian job create --image hello-world --cores 8 --memory 45 --accelerator-type V100

Get Job Details

Display all created jobs for your account:

perian job get --all

Display information for a specific job:

perian job get [job-id]

Replace [job-id] with a UUID of a job you have previously created or use the flag --last.

Cancel Jobs

Cancel a job you have started:

perian job cancel [job-id]

Replace [job-id] with a UUID of a job you have previously created.

This will cancel the job immediately.

Browse Model Templates

Browse through all available predefined model templates.

perian template get

Search for a model template by name:

perian template get --name [model-template-name]

Browse Instance Types

Browse through all available instance types.

Search for specific number of cores and gigabytes of ram:

perian instance-type get --cores 4 --memory 16

Search for number of accelerators (GPUs):

perian instance-type get --accelerators 2

Search for specific accelerator type:

perian instance-type get --accelerator-type A100

perian instance-type get --accelerators 2 --accelerator-type A100

Browse Accelerator Types

Get a list of all available accelerator types:

perian accelerator-type get --all

Search for specific amount of accelerator memory:

perian accelerator-type get --memory 32

Get Billing Information

You can always check your total consumption and bill for the latest month:

perian billing get

You can also specify a specific timeframe to get the bill for:

perian billing get --start-time 2024-05-13 --start-time 2024-05-14

Flyte

Dagster

Kubernetes

API

CLI

Integrations