The Doppler Quarterly Summer 2016 | Page 37

KEY CONCEPT

Data Lake

A data lake is a storage repository that holds a vast amount of raw data in its native format until it is needed . While a hierarchical data warehouse stores data in files or folders , a data lake uses a flat architecture to store data .
Data lakes complement existing systems by ensuring that analytical workloads , development , testing and machine learning model creation will not impact production workloads in other performance-optimized systems .
SUMMER 2016 | THE DOPPLER | 35