2025 Learning Log
In keeping up with my goal to do more learning this year, I’m allotting some time outside the daily grind for learning. My overarching goal is to explore popular data engineering tools such as dbt...
In keeping up with my goal to do more learning this year, I’m allotting some time outside the daily grind for learning. My overarching goal is to explore popular data engineering tools such as dbt...
This is an extension of my 2025 Learning Log. Reviewing Spark (PySpark) through this course Taming Big Data With Apache Spark Setting up Apache Spark 3.x is only compatible with Java 8, Java 11,...
This is an extension of my 2025 Learning Log. I started learning Apache Kafka. I wanted to study Flink actually but since it comes downstream of Kafka, I figured I might as well learn a bit more a...
This is an extension of my 2025 Learning Log. About Loguru is a python package for logging. It is intended to make logging simpler, less painful, and enjoyable. Installation pip install loguru ...
I’ve recently studied Snowflake through this course: Snowflake masterclass. Here’s my certificate. This course covered a lot of topics, and it’s been a fulfilling couple of weeks learning the ins a...
This is an extension of my 2025 Learning Log. PyAutoGUI lets your Python scripts control the mouse and keyboard to automate interactions with other applications. PyAutoGUI Docs Install pip in...
This is an extension of my 2025 Learning Log. Python collections module contains specialized container datatypes that provide alternatives to Python’s general purpose built-in containers: dict,...
This is an extension of my 2025 Learning Log. I’ve reviewed ERDs (Entity Relationship Diagrams) and Mermaid - a tool to create diagrams using code. ERD Then and now, I find the Lucid chart expla...
I’ve recently created a learning log for this year (2025). The first topic that I wanted to go over is dbt (data-build-tool). Here are some of my notes. I’m putting more materials or notes here abo...
I’ve recently learned about Cumulative tables, a table design to do historical tracking. The timeframe (however it is defined), are inserted sequentially at every time partition, which eliminates t...