Top 3 Themes from the discussion
-
Separating hot vs. cold data
Streaming replication gives full row history, but OLTP disks cannot store that history at scale. Most teams move “cold/archival” data to an OLAP store and keep only recent rows in the OLTP DB.“We ended up with 'hot' data in oltp and 'cold/archival' data in olap because the storage size of oltp has always been limited.” – eveningtree
-
SCD Type 2 via CDC pipelines or time‑travel tables
Historical rows are built either by capturing logical changes in a pipeline (windowed lag on timestamps) or by using warehouse features such as Iceberg/Delta time travel. This creates a reliable versioned view without forcing SCD 2 tables inside the source DB.“If safe keeper exposes the changes to the tables somehow, a type2 scd is just a windowed lag over the primary key sorted by the timestamp.” – nikita
-
Skepticism about unified storage / LTAP architecture
Many commenters doubt that a single storage engine can replace purpose‑built OLTP and OLAP systems, citing CDC overhead, timestamp mis‑alignment, and real‑world cost concerns (e.g., S3 pricing, replication tax).“I’m skeptical of the idea of unifying storage just because it sounds ‘elegant’ or ‘cool’.” – saisrirampur