Overview

Redshift is a Massive Parallel Processing Columnar database storage engine offering from AWS. Users have the ability to fine tune and customize every thing to their specific use cases.

The main focus of this article is on ETL where there will be a need to join tables on composite keys. What this means irrespective of what kind of distribution strategy is used (except ALL which we cannot use for facts), there will be data distribution. How can we trick redshift not to distribute but do a composite key join with lightening fast processing.

This article touches overview of table internals but doesn’t go in depth to internal implementations. …

Harsha Vardhan Lella

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store