Skip to main content
The ax datasets commands are currently in BETA. The API may change without notice. A one-time warning is emitted on first use. The update command is in ALPHA.
The ax datasets commands let you create, retrieve, and manage datasets on the Arize platform. Supported file formats: CSV, JSON, JSON Lines, Parquet.

ax datasets list

List all datasets in a space.
ax datasets list [--space <id>] [--name <filter>] [--limit <n>] [--cursor <cursor>] [--output <fmt>]
OptionDescription
--space, -sFilter by space name or ID
--name, -nCase-insensitive substring filter on dataset name
--limit, -lMaximum number of results to return (default: 15)
--cursor, -cPagination cursor for the next page
--output, -oOutput format (table, json, csv, parquet) or file path
--verbose, -vEnable verbose logs
Examples:
ax datasets list --space sp_abc123
ax datasets list --space sp_abc123 --output datasets.json

ax datasets create

Create a new dataset from a local file.
ax datasets create --name <name> --space <id> (--file <path> | --json <json-array>) [--output <fmt>]
OptionDescription
--name, -nName for the new dataset
--space, -sSpace name or ID to create the dataset in
--file, -fPath to the data file (CSV, JSON, JSONL, or Parquet), or - for stdin
--jsonInline JSON array of examples, e.g. '[{"question": "...", "answer": "..."}]'
--output, -oOutput format (table, json, csv, parquet) or file path
--verbose, -vEnable verbose logs
Examples:
ax datasets create --name "my-eval-set" --space sp_abc123 --file ./examples.csv
ax datasets create --name "my-eval-set" --space my-space --json '[{"q": "What is AI?"}]'

ax datasets get

Retrieve a dataset by name or ID.
ax datasets get <name-or-id> [--space <id>] [--output <fmt>]
OptionDescription
--space, -sSpace name or ID (required when using a dataset name instead of ID)
--output, -oOutput format (table, json, csv, parquet) or file path
--verbose, -vEnable verbose logs
Example:
ax datasets get ds_xyz789
ax datasets get my-eval-set --space my-space

ax datasets update

Rename a dataset.
ax datasets update <name-or-id> --name <new-name> [--space <id>] [--output <fmt>]
OptionDescription
--nameNew name for the dataset
--space, -sSpace name or ID (required when using a dataset name instead of ID)
--output, -oOutput format (table, json, csv, parquet) or file path
--verbose, -vEnable verbose logs
Examples:
ax datasets update ds_xyz789 --name "renamed-eval-set"
ax datasets update my-eval-set --space my-space --name "renamed-eval-set"

ax datasets delete

Delete a dataset.
ax datasets delete <name-or-id> [--space <id>] [--force]
OptionDescription
--space, -sSpace name or ID (required when using a dataset name instead of ID)
--force, -fSkip the confirmation prompt
--verbose, -vEnable verbose logs
Examples:
ax datasets delete ds_xyz789
ax datasets delete my-eval-set --space my-space --force

ax datasets export

Export examples from a dataset to a JSON file.
ax datasets export <name-or-id> [--space <id>] [--version-id <id>] [--output-dir <path>] [--stdout] [--all]
OptionDescription
--space, -sSpace name or ID (required when using a dataset name instead of ID)
--version-idExport examples from a specific dataset version (default: latest)
--output-dirDirectory to write the output file (default: current directory)
--stdoutPrint JSON to stdout instead of saving to a file
--allUse Arrow Flight for bulk export — streams all examples
--verbose, -vEnable verbose logs
Examples:
ax datasets export ds_xyz789
ax datasets export my-eval-set --space my-space --version-id v_123 --output-dir ./exports
ax datasets export ds_xyz789 --all
ax datasets export ds_xyz789 --stdout | jq 'length'

ax datasets append

Append examples to an existing dataset. Provide examples via --json (inline) or --file. Exactly one input source is required.
ax datasets append <name-or-id> (--json <json-array> | --file <path>) [--space <id>] [--version-id <id>] [--output <fmt>]
OptionDescription
--jsonInline JSON array of examples, e.g. '[{"question": "...", "answer": "..."}]'
--file, -fPath to the data file (CSV, JSON, JSONL, or Parquet), or - for stdin
--space, -sSpace name or ID (required when using a dataset name instead of ID)
--version-idDataset version to append to (default: latest version)
--output, -oOutput format (table, json, csv, parquet) or file path
--verbose, -vEnable verbose logs
Examples:
# Append from a file
ax datasets append ds_xyz789 --file ./new_examples.csv

# Append inline JSON
ax datasets append ds_xyz789 --json '[{"question": "What is AI?", "answer": "..."}]'

# Append to a specific version
ax datasets append my-eval-set --space my-space --file ./more.jsonl --version-id v_123

ax datasets annotate-examples

Annotate a batch of examples in a dataset. Provide annotations via --file (JSON, JSONL, CSV, or Parquet; use - for stdin). Each record must have a record_id (the dataset example ID) and values (a list of annotation dicts with at least name, plus optionally score, label, or text). Annotations are upserted; up to 1000 examples may be annotated per request.
ax datasets annotate-examples <name-or-id> --file <path> [--space <id>]
OptionDescription
--file, -fPath to a file containing annotation records (JSON, JSONL, CSV, Parquet), or - for stdin
--space, -sSpace name or ID (required when using a dataset name instead of ID)
--verbose, -vEnable verbose logs
Examples:
ax datasets annotate-examples ds_xyz789 --file ./annotations.jsonl
ax datasets annotate-examples my-eval-set --space my-space --file ./annotations.json