Skip to main content

data-engineering

2024

2022

2021

DE Log 7: Migrating from Airflow 1 to 2

·2135 words·11 mins
Today I have successfully migrated my current Airflow setup from v1.10.14 to v2.0.2. This article will not be a very detailed step by step guide for upgrading, instead I will introduce the general migration step worked very specifically for my setup, and share some of the problems I encountered during the process, and finally some general feelings with Airflow 2.

2020

DE Log 5: Thoughts on Analytical Tables

·1886 words·9 mins
I have taken some Database design course back in university days, though I have skipped almost all of the lectures as I was also self-learning during my first internship on the same matter, in a much more practical manner. I did not even know the word OLAP back then. However, I am not dismissing the importance of data modelling in data engineering. On the contrary, data modelling is one of the important skills if you want to be a data engineer.

DE Log 4: ETL vs ELT

·1128 words·6 mins
We have all heard of the term ETL. If you are working in the data field, you might have been asked to do some sort of ETL work regardless of your actual job description.

2019