Data warehouse architectures
Data Warehousing Concepts
Aaren Stubberfield
Data Scientist
Inmon - top-down
Inmon - top-down
Must decide:
On all data definitions, cleaning, and business rules
Before any data enters warehouse
Inmon - top-down
Inmon - top-down
Pros and cons of top-down
Advantages:
Single source of truth for organization
Normalization = less storage
Easy to change data marts to support reporting changes
Disadvantages:
More joins = slower response time
Lengthy upfront work
Higher startup cost
Kimball - bottom-up
Denormalizes data
Focus on departmental data mart
Data moves directly from ETL to data marts
Kimball - bottom-up
Pros and cons of bottom-up
Advantages:
Upfront development speed
Lower startup cost
Denormalized = user friendly
Disadvantages:
Increased ETL processing time
Greater possibility of duplicate data
Ongoing development needed
Let's practice!
Data Warehousing Concepts
Preparing Video For Download...