GCP – Announcing BigQuery repositories: Git-based collaboration in BigQuery Studio
Modern data teams want to use Git to collaborate effectively and adopt software engineering best practices for managing their data pipelines and analytics code. But most tools used by data teams don’t offer integration with Git version control systems, making a Git workflow feel out of reach. This forces users to copy and paste code between UIs, which is not only time-consuming but also error-prone.
To help, we’re introducing repositories in BigQuery in Preview, a new experience in BigQuery Studio that helps data teams collaborate on code stored in Git repositories.
Develop with Git in BigQuery Studio
BigQuery repositories provide a comprehensive set of features to integrate Git workflows directly into your BigQuery environment:
-
Set up new repositories in BigQuery Studio where you can develop SQL queries, Notebooks, data preparation, data canvases, or text files with any file extension.
-
Connect your repositories to remote git hosts like GitHub, GitLab, and other popular Git platforms.
-
Edit the code in your repositories within a dedicated workspace, on your own copy of the code, before publishing changes to branches.
-
Perform most Git operations with a user-friendly interface that lets you inspect differences, commit changes, push updates, and create pull requests — all within BigQuery Studio.
- aside_block
- <ListValue: [StructValue([(‘title’, ‘$300 in free credit to try Google Cloud data analytics’), (‘body’, <wagtail.rich_text.RichText object at 0x3e2d7a064e20>), (‘btn_text’, ‘Start building for free’), (‘href’, ‘http://console.cloud.google.com/freetrial?redirectPath=/bigquery/’), (‘image’, None)])]>
Software engineering best practices for all data practitioners
BigQuery repositories help organizations standardize the way code is developed, version, and deployed. Data teams with members of different levels of technical expertise can all collaborate on the same code base, following the same software engineering best practices.
Data analysts can contribute to code repositories via a simple GUI interface that lets them create workspaces, commit changes, and push code to branches.
Data engineers can develop in BigQuery Studio or with their favourite local IDE on the same codebase.
Data scientists can develop Colab Enterprise notebooks from BigQuery Studio, within their organization’s VPC, but back the code in a remote repository where they can manage versions and ask peers for code reviews.
Getting started
To begin using BigQuery repositories, navigate to BigQuery Studio in the Google Cloud console or visit the documentation for detailed instructions.
Read More for the details.