Data Engineering

A collection of 13 posts

Checking the public IP address of a GCP Cloud Function cover image
Cloud Infrastructure,  Data Engineering,  DevOps

Checking the public IP address of a GCP Cloud Function

Public IP address of a GCP Cloud Function When you deploy a Cloud Function without configuring a VPC, it operates within Google’s internal…

Debugging Dataform with Node.js in VScode cover image
Cloud Infrastructure,  Data Engineering

Debugging Dataform with Node.js in VScode

Debugging Dataform with Node.js in VScode Debugging Dataform JavaScript If you happen to be a data engineer who is not familiar with the…

GCP Cloud Functions for Web Scraping AFL data cover image
Cloud Infrastructure,  Data Engineering

GCP Cloud Functions for Web Scraping AFL data

Scraping web data using GCP Cloud Functions The main reason for this blog post is a practical example of writing a GCP Cloud Function with…

Converting table schemas from any database to BigQuery for Dataflow cover image
Dataflow,  Data Engineering,  GCP,  BigQuery

Converting table schemas from any database to BigQuery for Dataflow

Why do this? When using Dataflow Google-provided templates for any RDBMS to BigQuery (such as PostgreSQL to BigQuery) the target table in…

KMS encrypted credentials with Dataflow on GCP cover image
Dataflow,  Data Engineering,  GCP,  KMS

KMS encrypted credentials with Dataflow on GCP

Encrypting credentials Google KMS Cloud Key Management Service (Cloud KMS) lets you create and manage encryption keys for use in compatible…

Python Beam and DataFlow to write data to BigQuery with a custom Docker image cover image
Apache Beam,  Data Engineering,  GCP,  BigQuery,  Docker

Python Beam and DataFlow to write data to BigQuery with a custom Docker image

Dataflow build and run Basic example of how to develop an Apache Beam pipeline written in Python that writes data to BigQuery. To then build…

Copying a table from Teradata to Google BigQuery cover image
Teradata,  Data Engineering,  GCP,  BigQuery

Copying a table from Teradata to Google BigQuery

Copying a table from Teradata to Google BigQuery For this demo I will be using a Windows PC including: Teradata Studio Teradata ClearScape…

Connecting Nifi to Teradata cover image
Teradata,  Data Engineering

Connecting Nifi to Teradata

Connecting Nifi to Teradata For this demo I will be using a Windows PC including: Docker desktop PowerShell Core 7.3.6 VMWare Workstation…

Connecting to Teradata from Python cover image
Data Engineering,  Python

Connecting to Teradata from Python

Reading and Writing data between Python and Teradata This blog post is paired with the associated Github repository https://github.com…

Databricks sample NFL dataset cover image
Data Engineering,  Data Science

Databricks sample NFL dataset

Create a Databricks database with sample NFL data The code this post is relating to is here: https://github.com/mortie23/databricks-nfl-data…

Installing Prefect Server on Azure cover image
System Administration,  Cloud Infrastructure,  Data Engineering

Installing Prefect Server on Azure

Reading Firstly I came across the following docs on how to install prefect server Prefect Server 101 Deploying to GCP Deploying to a Single…

Tableau Web Data Connector Node Generator cover image
Business Intelligence,  Web Development,  Data Engineering

Tableau Web Data Connector Node Generator

Using Tableau Web Data Connector (WDC) as the data source for dashboard. This post is in reference to the associated Github repo at https…

Tableau Web Data Connector cover image
Business Intelligence,  Data Engineering

Tableau Web Data Connector

This post is in reference to the associated Github repo at https://github.com/mortie23/tableau-wdc. Tableau WDC Quick example of a Tableau…