My first Publication Agile-Data-Warehouse-Design-eBook | Page 95

74 Chapter 3 Hierarchy definition provides a hook for catching additional attributes Hierarchies help expose “informal” stakeholder maintained data sources that can greatly enrich dimensions Hierarchies help you discover planning processes BI users and BI tools require default hierarchies to enable simple drill- down Hierarchies are used to optimize query performance Hierarchies provide a necessary hook to catch dimensional attributes. When you ask stakeholders about hierarchies you are asking them how they (would like to) organize their data. Discussing this activity is one technique for adding some otherwise missing narrative to dimension stories and prompting stake- holders for the BI friendly attributes you need to model good dimensions. When stakeholders think about their 7Ws hierarchically, they describe low level attributes that can be used as discriminators for similar dimensional members, and higher level attributes that can group together many dimen- sional members. When stakeholders describe their favorite hierarchy levels they will frequently provide you with additional “informal“ data sources (spreadsheets, personal databases) they own that contain this categorical information. These stake- holder maintained sources often contain hierarchy definitions, vital to BI, that are missing from “formal” OLTP databases because they are nonessential for operational activity. Many operational applications happily perform their func- tion at the bottom of each hierarchy with no knowledge of the higher level groupings that are imposed upon their raw transactions for reporting purposes. For example, orders can be processed day in, day out without the order proc- essing system knowing how a single date is rolled into a fiscal period. Similarly, items can be shipped to the correct street number/postal code without knowing how the business currently organizes sales geographically (or how they might have been organized differently last year). Hierarchies exist so that organizations can plan. Discussing hierarchies with stakeholders will get them thinking about their planning processes, and will likely help you discover additional events and data sources that represent budg- ets, targets or forecasts. You must make sure the dimensions you design con- tain the common levels needed to roll up actual event measures for comparison against these plans. BI users like default hierarchies and BI tool “click to drill” functionality that allows them to quickly drill-down on an attribute without having to manually decide each time what to show next. For example, if users drill on “Quarter” they usually want to see monthly detailed data by default. Explicit hierarchies establish predictable analytical workflows that are very helpful to (new) BI us- ers exploring the data for the first time. “Clicking to drill” is less laborious and error prone than manually adding and removing report row headers. Everyone wants common drill-down and drill-up requests to happen quickly. Explicit hierarchies are needed to define efficient data aggregation strategies in the data warehouse. On-Line Analytical Processing (OLAP) cubes in multidi- mensional databases, aggregate navigation /query rewrite optimization in rela- tional databases and prefetched micro cubes in BI tools, all take advantage of hierarchy definitions to maximize query performance.