Starburst AI data product enrichment #
Starburst Galaxy supports data product enrichment using Starburst AI
Agent. This feature automatically generates descriptions for data products and
datasets using metadata and relationships between data elements.
Requirements #
To use the data product enrichment feature, you need:
- A role that has been granted the privileges to access a cluster and query the
underlying schema of the data product that you want to enrich.
Enrich a data product #
Follow these steps to generate metadata for a data product:
- In the Galaxy navigation menu, click Data > Data products.
- Click the
Enrich with AI button to open the Datasets
section of the Enrich data product dialog.
-
Click
AI generate all datasets to enrich each dataset within the
data product:
- Column descriptions are generated based on their names, types, any existing
descriptions, and the surrounding context such as dataset and data product
names.
- Dataset descriptions are generated based on column descriptions, existing
dataset descriptions (if available), and context.
To generate metadata for a specific dataset, select it, then choose
AI generate this dataset from the
AI generate all drop-down menu.
- Click Next.
-
In the Data product details section, click
AI generate all fields.
This generates a description for the data product, based on its name, current
description (if any), and its datasets and their descriptions. The
Summary is based on the generated description.
- Click Save.
Enrich a specific field #
To enrich a specific dataset field using AI agent:
- Navigate to the Data products pane and select a data product.
- Click
Enrich with AI.
- Click the AI generate button for the field you want to generate metadata
for. You can generate metadata for an individual column, all columns, or the
dataset description.
Resetting dataset fields #
You can reset metadata fields during the initial AI generation, either for all
datasets or for specific ones.
To reset a single field within a dataset, navigate to the field and click the
corresponding autorenew reset button.
To reset all fields in a specific dataset:
- Navigate to the Data products pane and select a data product.
- Select the dataset you want to reset.
- Click
Enrich with AI.
- From the
AI generate all datasets drop-down menu, select
Reset this dataset.
- Click autorenew Reset this
dataset.
- Click Next.
- In the Data product details section, select Reset values for all
fields from the
AI generate all datasets drop-down menu.
- Click autorenew Reset values for
all fields.
- Click Save.
To reset all fields across all datasets:
- Navigate to the Data products pane and select a data product.
- Click
Enrich with AI.
- From the
AI generate all datasets drop-down menu, select
Reset all.
- Click autorenew Reset all to
reset all fields across all datasets in the selected data product.
- Click Next.
- In the Data product details section, select Reset values for all
fields from the
AI generate all datasets drop-down menu.
- Click autorenew Reset values for
all fields.
- Click Save.
Is the information on this page helpful?