BigQuery as a Source Walkthrough

2 Minutes to read

Print
Share
Dark
Light
PDF

Article summary

Did you find this summary helpful?

Thank you for your feedback!

Introduction

This document provides a comprehensive walkthrough in BigQuery as a Source, starting with establishing a new River within Rivery, then proceeding to the selection of extraction modes and the administration of your data storage.
By the end of this guide, you will have a clear understanding of how to effectively harness Rivery's capabilities to extract data from BigQuery and seamlessly merge it into your data ecosystem.

Prerequisite

Please ensure the creation of a correct Connection for BigQuery Source within Rivery.

River Modes

When using BigQuery as a source, you have the choice to select between two River modes:

Multi-Tables (Standard Extraction)
Custom Query.

Multi-Tables (Standard Extraction)

This mode in Rivery combines data from various tables into a single schema before transferring it to the destination. It establishes table relationships to ensure consistent linking and loading. Rivery's Multi-Tables River mode mainly employs SQL queries for transformations, with scheduling or manual triggering options.

To obtain further details about using Multi-Tables (Standard Extraction), kindly refer to our documentation on Databases River Modes.

Custom Query

Rivery's Custom Query River mode empowers users to input data into the platform via personalized SQL queries, offering utmost control over data loading and transformations. Users can specify data and transformations precisely, using SQL, pulling from databases or data warehouses. Data is then scheduled for automatic or on-demand loading into Rivery, ensuring real-time data access.

Please Note:

When using a custom query, specific data types are deliberately treated as strings. The data types labeled as "TO_JSON_STRING" are the ones enclosed in double quotes when converted into strings.
When opting for the Incremental Extract mode, it's crucial to recognize that you have a choice of only 2 options: Datetime and Running number.

To obtain further details about using Custom Query, kindly refer to our documentation on Databases River Modes.

Limitations

Data Export Bucket Region
The region designated in the 'Region' input for the connection must align with the region of the Data export bucket (Custom Filezone).
Project-Based Connection
Rivery enables connections on a per-region basis. When dealing with multiple BigQuery projects, establish a distinct connection for each project.
Mapping for Primary and Foreign Keys
Automatic detection of mapping match keys is restricted to native BigQuery Primary and Foreign key fields only. Custom keys may not be recognized during the mapping process.
Partition and Cluster Fields
The mapping procedure within Rivery does not allocate Partition and Cluster Fields.
Absence of Deleted Records Indication
Data which has been removed is not flagged, leading to the fact that its absence is not indicated within the Target.

Was this article helpful?

What's Next

BigQuery Datatype Mapping

Table of contents

Introduction
Prerequisite
River Modes
Limitations