8000 GitHub - gadiener/bigquery-airflow-logger: This makes the Kubernetes pod logs visible in the Airflow UI interface during runtime in GKE
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

gadiener/bigquery-airflow-logger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BigQuery logger handler for Airflow

Installation

pip install airflow-bigquerylogger

Configuration

AIRFLOW__CORE__REMOTE_LOGGING='true'
AIRFLOW__CORE__REMOTE_BASE_LOG_FOLDER='gs://bucket/path'
AIRFLOW__CORE__REMOTE_LOG_CONN_ID='gcs_log'
AIRFLOW__CORE__LOGGING_CONFIG_CLASS='bigquerylogger.config.LOGGING_CLASS'
AIRFLOW__CORE__LOG_BIGQUERY_DATASET='dataset.table'
AIRFLOW__CORE__LOG_BIGQUERY_LIMIT=50

Google Cloud BigQuery

Rows that were written to a table recently via streaming (using the tabledata.insertall method) cannot be modified using UPDATE, DELETE, or MERGE statements. I recommend setting up a table retention!

Credits

Thanks to Bluecore engineering team for this usefull article.

< 5CF0 /react-partial>

About

This makes the Kubernetes pod logs visible in the Airflow UI interface during runtime in GKE

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

0