Online Analytical Processing (OLAP)
Online Analytical Processing (OLAP) is a technique for storing and accessing data in a data warehouse for multi-dimensional data analysis and is the foundational theory behind star schema design.
OLAP works by forming data into OLAP cubes, which are multi-dimensional arrays of interrelated data, then performing operations on those cubes. An operation is an extraction of data from an OLAP cube.
OLAP cubes
A dimension is an attribute of interest in the data that can be summarized numerically. For example, as part of an analysis an analyst may wish to summarize orders by customer segment, product, and location; these are the dimensions of data being analyzed. Each cell of an OLAP cube represents some measure at the intersection of each dimension; for example, the order revenue for a specific product, for a specific customer segment, at a specific time.
Types of OLAP cubes
Multidimensional OLAP (MOLAP) cubes store pre-computed data directly into a multidimensional database. This allows for excellent performance on complex calculations, slicing, and dicing, but can be difficult to scale for large datasets.
Relational OLAP (ROLAP) cubes store data as relational data in columns and rows and retrieves data on-demand to perform aggregate functions, meaning performance is directly related to the size of the queried dataset. ROLAP uses less storage than MOLAP because it does not store pre-computed data.
Hybrid OLAP (HOLAP) cubes combine MOLAP and ROLAP to get the benefits of both approaches.
OLAP operations
A slice operation retrieves a subset of data from a cube that consists of data in which one dimension is filtered to have a single value. For example, summarizing orders for a single customer segment. Similarly, a dice operation filters data to form a cube within a cube, such as summarizing orders for a specific product in a specific location.
A roll-up operation aggregates data from a cube by combining dimension data. For example, by combining zip code data to form state-level data or by computing average sales revenue. Conversely, a drill-down operation breaks down rolled up data into finer-grained dimensions.
A pivot or rotation operation swaps two axises, such as the columns and rows.
Deeper Knowledge on Online Analytical Processing (OLAP)
Star Schemas
Schemas to organize data by facts and dimensions for analysis
Broader Topics Related to Online Analytical Processing (OLAP)
Data warehouses
Data warehouses: Architecture, data flow, and related topics
Data Analysis
The transformation of data to information