My first Publication Agile-Data-Warehouse-Design-eBook | Page 36

How to Model a Data Warehouse 15 When source database schemas are not yet available, ETL development can still proceed if ETL and OLTP designers can agree on flat file data extracts. Once OLTP have committed to provide the specified extracts on a schedule to meet BI needs, ETL transformation and load routines can be developed to match this source to the proactive data warehouse design target. Challenges of Proactive Analysis for Data Warehousing While being proactive has great potential benefits for DW/BI, the late appearance of data on the Figure 1-6 timeline unfortunately heralds further analysis challenges for data warehouse designers: BI requirements gathering must take place before any real data is available. Under these circumstances proactive data modelers can rely even less upon traditional analysis techniques to provide BI data requirements to match their aggressive schedule. Proactive Reporting-Driven Analysis Challenges Traditional interviewing techniques for gathering reporting requirements are problematic when stakeholders haven’t seen the data or applications that will fuel their BI imagination. With no existing reports to work from, business analysts can’t ask their preferred icebreaker question: “How can your favorite reports be improved?” and they have nothing to point at if and ask: “How do you use this data to make decisions?”. Even more open questions such as “What decisions do you make and what information will help you to make them quicker/better?” can fall flat when a new operational systems will shortly enable an entirely new business process that stakeholders have no prior experience of measuring, or managing. Proactive Data-Driven Analysis Challenges IT cannot fall back on data-driven analysis: data profiling tools and database remodeling skills are of little use when new source databases don’t exist, are still under development, or contain little or no representative data (only test data). Even when new operational systems are implemented using package applications with stable, (well) documented database schemas they are often too complicated for untargeted data profiling: it would take too long and be of little value if only a small percentage of the database is currently used/populated and well understood by the available IT resources. Data then Requirements: a ‘Chicken or the egg’ Conundrum Before there is data and users have lived with it for a time (with less than perfect BI access) both IT and business stakeholders cannot define genuine BI requirements in sufficient detail. Without these early detailed requirements proactive data warehouse designs routinely fail to provide the right information on time to avoid a BI backlog building up as soon as data is available. To solve this ‘data then re- quirements’/‘chicken or the egg’ conundrum, proactive data warehousing needs a new approach to database analysis and design: not your father’s data modeling, not even your father’s dimensional modeling! Proactive analysis takes place before data exists Reporting-driven analysis is difficult before data exists Data-driven analysis is impossible with no data to profile Proactive DW design requires a new approach to data analysis, modeling and design