Help Branching Out / Upgrading Skillset
Airflow Summit 2023 - Recordings Now Available
The 6 columns essential to a $6B/year database table
ARCHIVE_DT or when you can finally delete some shit
Reference table design
Rate histories or cleanly storing history
UNIQUE_TRANS_ID or letting you track what occurred together.
HISTORY_SEQ column or sanity checking basic mode
_A tables or how not to accidentally lose your shit
How do I convince my data engineer to not modify data before including it in our db?
Citus Data - Distributed Postgres
Cloud Backed SQLite
Apache Arrow
Data-Oriented Design (2018)
CAP Theorem Simplified
Introducing English as the New Programming Language for Apache Spark
What is Data Lineage?
Design Thinking Bootleg (Stanford)
Array programming with NumPy
The Missing Semester of Your CS Education