Skip to content

2025

The Lakehouse Series: OLTP vs. OLAP (A Parquet Primer)

TLDR

After reading this article, you will learn:

  • The key differences between OLTP and OLAP workloads, and why storage format matters
  • How Parquet organizes data internally and optimizes data storage using compression techniques like dictionary encoding and RLE
  • Where Parquet falls short in today's data landscape

How to Use MkDocs to Integrate GitHub Actions and Git Submodule for Cross-repo Documentation

TLDR

After reading this article, you will learn how to:

  • Use Git Submodule to centrally manage documentation sources across multiple projects
  • Configure GitHub Actions for cross-project automation and integration workflows
  • Utilize Reusable Workflows to reuse CI/CD scripts and reduce maintenance costs
  • Leverage MkDocs Monorepo Plugin to merge documentation from multiple projects into a single website