BigQuery Documentation Guide

Enable the BigQuery sandbox | Google Cloud

Product overview

How does BigQuery work?

Get started

Use the BigQuery sandbox

Quickstarts

Try the Cloud console

Try the command-line tool

Explore BigQuery tools

Migrate

Overview

Migrate a data warehouse

Migrate SQL

Migration guides

Amazon Redshift

Apache Hive

IBM Netezza

Netezza is a data warehouse system that offers analytics, AI, and machine learning (ML) capabilities. It's a subsidiary of IBM, and is available on IBM Cloud, AWS, and Microsoft Azure.

Features

Scalability: Scales up and down based on usage
Open formats: Supports open formats like Parquet and Iceberg for secure data sharing
In-database analytics: Allows users to run complex queries and build models directly in the database
Geospatial capabilities: Built-in geospatial capabilities for analyzing data
Solid-state disks: Data is stored on solid-state disks (SSDs) that are self-encrypting drives (SEDs)
Migrate from IBM Netezza
SQL translation reference

Oracle

Snowflake

Teradata

Design

Datasets

Tables

BigQuery tables

External tables

Views

Logical views

Materialized views

Routines

Connections

Indexes

Search indexes

Vector indexes

Load, transform, and export

Introduction

Load data

Introduction

BigQuery Data Transfer Service

Introduction
Data location and transfers
Authorize transfers
Enable transfers
Manage transfers
Transfer run notifications
Troubleshoot transfer configurations
Use service accounts
Use third-party transfers
Use custom organization policies
Data source change log
Transfer guides
- Amazon S3
- Azure Blob Storage
- Campaign Manager
  - Schedule transfers
  - Report transformation
- Cloud Storage
- Comparison Shopping Service Center
- Display & Video 360
  - Schedule transfers
  - Report transformation
- Facebook Ads
  - Schedule transfers
  - Report transformation
- Google Ad Manager
  - Schedule transfers
  - Report transformation
- Google Ads
  - Schedule transfers
  - Report transformation
- Google Merchant Center
  - Introduction
  - Schedule transfers
  - Transfer report schema
- Google Play
  - Schedule transfers
  - Transfer report transformation
- Oracle
  - Schedule transfers
- Salesforce
  - Schedule transfers
- Salesforce Marketing Cloud
  - Schedule transfers
- Search Ads 360
- ServiceNow
  - Schedule transfers
- YouTube channel
  - Schedule transfers
  - Transfer report transformation
- YouTube content owner
  - Schedule transfers
  - Transfer report transformation

Batch load data

Write and read data with the Storage API

Transform data

Introduction

Prepare data

Transform data with workflows

Export data

Analyze

Introduction

Explore your data

Query BigQuery data

Query data with SQL

Use geospatial analytics

Introduction
Work with geospatial analytics
Best practices for spatial analysis
Visualize geospatial data
Grid systems for spatial analysis
Geospatial analytics syntax reference
Geospatial analytics tutorials
- Get started with geospatial analytics
- Use geospatial analytics to plot a hurricane's path

Search data

Work with queries

Save queries

Continuous queries

Work with sessions

Optimize queries

Query external data sources

Manage open source metadata

Use external tables and datasets

Amazon S3 data
- Query Amazon S3 data
- Export query results to Amazon S3
Query Apache Iceberg data
Query open table formats with manifests
Azure Blob Storage data
- Query Azure Blob Storage data
- Export query results to Azure Blob Storage
Query Cloud Bigtable data
Cloud Storage data
- Query Cloud Storage data in BigLake tables
- Query Cloud Storage data in external tables
Work with Salesforce Data Cloud data
Query Google Drive data
Create AWS Glue federated datasets
Create Spanner external datasets

Run federated queries

Use notebooks

Introduction

Use Colab notebooks

Use DataFrames

Use Jupyter notebooks

Use analysis and BI tools

Google Cloud Ready - BigQuery

Entity resolution

AI and machine learning

Introduction

Generative AI and pretrained models

Choose generative AI and task-specific functions

Generative AI

Overview

Tutorials

Task-specific solutions

Overview

Tutorials

Natural language processing
- Understand text
- Translate text
Document processing
- Process documents
- Parse PDFs in a retrieval-augmented generation pipeline
Speech recognition
- Transcribe audio files
Computer vision

Machine learning

ML models and MLOps

Use cases

Tutorials

Get started with BigQuery ML
Regression and classification
Clustering
- Cluster data with a k-means model
Recommendation
- Create recommendations based on explicit feedback with a matrix factorization model
- Create recommendations based on implicit feedback with a matrix factorization model
Time series forecasting
Anomaly detection
- Anomaly detection with a multivariate time series
Imported and remote models
Hyperparameter tuning
- Improve model performance with hyperparameter tuning
Export models
- Export a BigQuery ML model for online prediction

Augmented analytics

Contribution analysis

Tutorials

Create and manage features

Work with models

Administer

Introduction

Manage resources

Manage code assets

Manage tables

Manage table clones

Manage table snapshots

Orchestrate resources

Introduction

Orchestrate code assets

Orchestrate jobs and queries

Workload management

Use reservations

Manage jobs

Legacy reservations

Manage BI Engine

Monitor workloads

Optimize resources

Control costs

Optimize with recommendations

Organize with labels

Manage data quality

Govern

Introduction

Control access to resources

Introduction

Control access with IAM

Control access with authorization

Restrict network access

Control column and row access

Control access to table columns

Manage policy tags

Control access to table rows

Protect sensitive data

Mask data in table columns

Anonymize data with differential privacy

Manage encryption

Audit workloads

Develop

BigQuery API basics

BigQuery APIs and libraries overview

Authentication

How does BigQuery work?​

Get started​

Quickstarts​

Try the Cloud console​

Try the command-line tool​

Explore BigQuery tools​

Migrate​

Migrate a data warehouse​

Migrate SQL​

Migration guides​

Amazon Redshift​

Apache Hive​

IBM Netezza​

Features​

Oracle​

Snowflake​

Teradata​

Design​

Datasets​

Tables​

BigQuery tables​

External tables​

Views​

Logical views​

Materialized views​

Routines​

Connections​

Indexes​

Search indexes​

Vector indexes​

Load, transform, and export​

Load data​

BigQuery Data Transfer Service​

Batch load data​

Write and read data with the Storage API​

Transform data​

Prepare data​

Transform data with workflows​

Export data​

Analyze​

Explore your data​

Query BigQuery data​

Query data with SQL​

Use geospatial analytics​

Search data​

Work with queries​

Save queries​

Continuous queries​

Work with sessions​

Optimize queries​

Query external data sources​

Manage open source metadata​

Use external tables and datasets​

Run federated queries​

Use notebooks​

Use Colab notebooks​

Use DataFrames​

Use Jupyter notebooks​

Use analysis and BI tools​

Google Cloud Ready - BigQuery​

Share with Analytics Hub​

Entity resolution​

AI and machine learning​

Generative AI and pretrained models​

Choose generative AI and task-specific functions​

Generative AI​

Tutorials​

Task-specific solutions​

Tutorials​

Machine learning​

ML models and MLOps​

Use cases​

Tutorials​

Augmented analytics​

Tutorials​

Create and manage features​

Work with models​

Administer​

Manage resources​

Manage code assets​

How does BigQuery work?

Get started

Quickstarts

Try the Cloud console

Try the command-line tool

Explore BigQuery tools

Migrate

Migrate a data warehouse

Migrate SQL

Migration guides

Amazon Redshift

Apache Hive

IBM Netezza

Features

Oracle

Snowflake

Teradata

Design

Datasets

Tables

BigQuery tables

External tables

Views

Logical views

Materialized views

Routines

Connections

Indexes

Search indexes

Vector indexes

Load, transform, and export

Load data

BigQuery Data Transfer Service

Batch load data

Write and read data with the Storage API

Transform data

Prepare data

Transform data with workflows

Export data

Analyze

Explore your data

Query BigQuery data

Query data with SQL

Use geospatial analytics

Search data

Work with queries

Save queries

Continuous queries

Work with sessions

Optimize queries

Query external data sources

Manage open source metadata

Use external tables and datasets

Run federated queries

Use notebooks

Use Colab notebooks

Use DataFrames

Use Jupyter notebooks

Use analysis and BI tools

Google Cloud Ready - BigQuery

Share with Analytics Hub

Entity resolution

AI and machine learning

Generative AI and pretrained models

Choose generative AI and task-specific functions

Generative AI

Tutorials

Task-specific solutions

Tutorials

Machine learning

ML models and MLOps

Use cases

Tutorials

Augmented analytics

Tutorials

Create and manage features

Work with models

Administer

Manage resources

Manage code assets