I am in process of designing a Data Warehouse Architecture. While exploring various options to Extract data from Production and putting into Data Warehouse, I came across many articles which mainly suggested following two approaches -
- Production DB ----> Data Warehouse (Star Schema) ----> OLAP Cube
- Production DB ----> Staging Database ----> Data Warehouse (Star Schema) ----> OLAP Cube
I am still not sure which one is the better approach in terms of Performance and reducing processing load on Production database.
Which approach you find better while designing Data Warehouse ?
Below points are taken from, DWBI Organization's article
Staging area may be required if you have any of the following scenarios:
Performance and reduced processing may not be only considerations. Adding a staging may sometimes increase latency
(i.e. time delay between occurrence of a business incidence and it's reporting). But I hope above points will help you to make a better judgement.