The Whats and Hows of Data audit — Technology ComponentsExpectation setting: This article might be more suited for you if you’re a data practitioner who does or wants to conduct a data audit…Mar 29Mar 29
Data Vault in RedshiftImagine you have the ERP with the market price of products that you are selling. An example of such data is Simusolar Kina sold at USD1500…Mar 15Mar 15
Data Protection Laws adoption in AfricaYou’ve probably heard it a hundred times: “Just move your data to the cloud.” While this is often viewed as a straightforward solution for…Feb 10Feb 10
Creating a Data Management StrategyThis article looks at how different practice areas within the DAMA DMBOK interact and support each other in managing and leveraging data…Apr 9, 20241Apr 9, 20241
Is anybody loading data in fact and dimension tables the pure-Python way?The objective of the blog is to implement Slowly Changing Dimensions Type 2 (SCD2) and fact tables with a lookup to an SCD2 using Redshift…Mar 16, 20242Mar 16, 20242
CI/CD pipeline with Google Compute Engine and GitHub Actions, part IIIn the previous blog post, I reviewed how CI/CD could keep the team in sync and save time by automatically merging code changes. Today I…Sep 30, 2023Sep 30, 2023
Published inPlumbers Of Data ScienceBuilding a CI/CD Pipeline for Apache Airflow DAGs, part II was sitting and looking at the formula that calculates ROI for task automation: TIME (spent on a single manual task) x FREQUENCY (of…Sep 16, 2023Sep 16, 2023
Part II or Stale data detection with dbt and Redshift metadataIn an ideal world, your SaaS provider, e.g. Xero allows you to export your data or has some systems in place that push data to your…Mar 27, 2023Mar 27, 2023