Querybook is a browser-based data analysis tool that turns SQL queries into natural language reports and graphs called DataDocs. After you clone and build the Querybook GitHub project, it runs as a Docker application. You can connect to Starburst Galaxy, Starburst Enterprise platform (SEP), and Trino clusters.
To run Querybook and connect it to clusters, you must have:
Querybook is a Python-based web application that runs in your browser, but it is
delivered as a Docker application that you build and run locally. As the Docker
application assembles, it pulls Python 3.6+ and the Python trino
package
into the Docker container image. This explains why Querybook does not need
either Python or the trino
package to be installed locally.
There are three steps to install and run Querybook.
Clone Querybook’s GitHub repository:
cd <work-directory>
git clone git@github.com:pinterest/querybook.git
Go into the cloned directory and run make
.
cd querybook
make
The Docker image assembly process can take several minutes.
When the Docker build finishes, it leaves a web server running locally. Open
a browser on https://localhost:10001
.
Querybook can run without configuration, which allows you to create a local login name and password and run the built-in demo.
Querybook configuration can be complex. The web
documentation is also delivered in Markdown
format in docs_website/docs
.
You can make configuration changes with the Admin UI at
https://localhost:10001/admin
or by directly editing configuration files,
then re-running make
.
In general, you must:
The following shows one possible configuration of container components,
specified in a file named local.txt
in the requirements
folder.
Querybook has built-in support for the Trino search engine, so it
only needs to be invoked by name from the engine
folder.
# Engines
-r engine/trino.txt
# Metastores
-r metastore/hms.txt
-r metastore/glue.txt
# Auth
-r auth/oauth.txt
-r auth/ldap.txt
# Result Store
-r platform/aws.txt
Is the information on this page helpful?
Yes
No