Creating a Data Management StrategyThis article looks at how different practice areas within the DAMA DMBOK interact and support each other in managing and leveraging data…Apr 91Apr 91
Is anybody loading data in fact and dimension tables the pure-Python way?The objective of the blog is to implement Slowly Changing Dimensions Type 2 (SCD2) and fact tables with a lookup to an SCD2 using Redshift…Mar 162Mar 162
CI/CD pipeline with Google Compute Engine and GitHub Actions, part IIIn the previous blog post, I reviewed how CI/CD could keep the team in sync and save time by automatically merging code changes. Today I…Sep 30, 2023Sep 30, 2023
Published inPlumbers Of Data ScienceBuilding a CI/CD Pipeline for Apache Airflow DAGs, part II was sitting and looking at the formula that calculates ROI for task automation: TIME (spent on a single manual task) x FREQUENCY (of…Sep 16, 2023Sep 16, 2023
Part II or Stale data detection with dbt and Redshift metadataIn an ideal world, your SaaS provider, e.g. Xero allows you to export your data or has some systems in place that push data to your…Mar 27, 2023Mar 27, 2023
To continue with a metrics store, part II‘Why do we have X as a revenue in the report A and Y, in application B?’ You start digging and realize that in the first case, the analyst…Mar 19, 2023Mar 19, 2023
Stale data detection with dbt and BigQuery dataset metadataSee Part II or last modified in Redshift at…Mar 5, 2023Mar 5, 2023
Integration of dbt with other systems“I could do X with the system but cannot complete Y. What do I do?” The rise of the modern data stack trend caused many vendors to appear…Feb 7, 2023Feb 7, 2023