Query federation in Starburst Galaxy

Cast: Lester Martin

Video date: April 6, 2022

Running time:

Abstract: In Starburst Galaxy, you can access data from multiple data sources with a single query. For example, join historic log data stored in S3 object storage with customer data stored in a MySQL relational database.

Product: Starburst Galaxy

Description

A Starburst Galaxy cluster can include very different data sources in the same cluster. This allows you to write queries that join data from data sources of different types in different locations. For example, you can write a single query that joins data from an AWS S3 bucket, an Amazon Redshift database, and a traditional MySQL database.

This is the detailed version of Query federation in brief.

Timestamps

Clicking the timestamp links below will take you to YouTube, or you can skip to that timestamp in the video player above.

  • Welcome - 0:02
  • What is query federation - 0:57
  • Add an S3 catalog - 3:04
  • Bind new catalog to cluster - 4:01
  • Open query editor, test data in new catalog 4:32
  • Run joined query within one catalog - 5:10
  • Add a Redshift catalog; add and bind to cluster - 5:29
  • Query editor; test Redshift data; add to previous join - 6:27
  • Add MySQL catalog; add and bind to cluster - 7:39
  • Query editor: test MySQL data; add third layer to joins - 8:58
  • Recap and conclusion - 9:42