My first Publication Agile-Data-Warehouse-Design-eBook | Page 115
94
Chapter 3
Summary
A dimensional data warehouse is only as good as its dimensions. Good dimensions contain
dimensional attributes that describe business events using terms that are familiar and
meaningful to the BI stakeholders. This is the best reason for asking stakeholders to modelstorm
the dimensions they need using examples to clarify the terms they use.
Dimensions themselves are discovered by event modeling; most who, what, when, where and
why event details become dimensions. Dimensional attributes are discovered by modeling each
of these details as the subject of its own BEAM ✲ dimension table. How details can be
dimensionalized in this way too but they typically do not have additional descriptive attributes.
Non-descriptive how details become degenerate dimensions, stored in fact tables along with the
how many details which become facts.
The first additional attribute that must be modeled for a dimension subject is its identity. This is
the business key (BK) which uniquely identifies each dimension member and defines the
dimension granularity. If a dimension is created from multiple source systems there may be
more than one BK for a dimension member. The unique BK for a dimension can be a
composite of multiple source systems keys.
Further dimension attribute requirements are gathered by asking stakeholders to provide
example descriptive data for each dimension (using the 7Ws as an attribute type checklist).
Examples help to discover mandatory attributes (MD), and exclusive attribute combinations
(Xn) together with their defining characteristics (DCn,n).
Hierarchy charts describe how stakeholders (want to) organize dimensional members
hierarchically to support drill-down analysis and plan vs. actual reporting. Drawing these charts
helps to prompt stakeholders for additional hierarchical attributes and data sources. Hot
hierarchy levels, that represent popular levels of summarization, help to identify additional
planning events, rollup dimensions and aggregation opportunities.
Hierarchies can be balanced, ragged or variable depth. Each type can be single or multi-parent.
Single parent, balanced hierarchies are the easiest hierarchies to implement dimensionally and
the simplest to work with for BI. Additional techniques are needed to balance ragged
hierarchies and represent multi-parent and variable depth hierarchies (See Chapter 6).
Change stories describe how dimensional history is handled. The short codes CV, HV, FV, PV
are used to document the temporal properties of each attribute. These temporal codes can be
numbered to define group change rules involving multiple attributes.
Minor events are events that occur infrequently and contain few details. They typically do not
represent significant business processes that warrant modeling as separate events. Often they
can be modeled as HV dimensional attributes, or as additional details of other major events
(recognizable business processes).