Generated by GPT-5-mini| OLAP | |
|---|---|
| Name | OLAP |
| Full name | Online Analytical Processing |
| Type | Data processing paradigm |
| Introduced | 1990s |
| Primary users | Business analysts, data scientists |
| Related technologies | Data warehouse, ETL, Online Transaction Processing |
OLAP
Online Analytical Processing (OLAP) is a data processing paradigm for fast, multidimensional analysis of large datasets used by analysts in organizations. It supports complex queries, trend analysis, and ad hoc reporting by enabling aggregated views and slice-and-dice operations across dimensions such as time, geography, product, and customer. OLAP implementations integrate with data warehousing, extract-transform-load pipelines, and business intelligence platforms to support decision-making in enterprises.
OLAP systems enable multidimensional analysis across dimensions used by Coca-Cola Company, Walmart, Microsoft, IBM, and Amazon (company) to examine metrics related to Fortune 500 activities and strategic planning. Analysts at institutions such as Goldman Sachs, JPMorgan Chase, Morgan Stanley, McKinsey & Company, and Boston Consulting Group apply OLAP to revenue, margin, and risk reporting, while public organizations like World Bank, International Monetary Fund, United Nations, European Central Bank, and Federal Reserve System use aggregated time series for macroeconomic studies. OLAP underpins dashboards and reporting solutions in products from Tableau Software, Qlik, SAP SE, Oracle Corporation, and Google LLC, and integrates with analytics from SAS Institute, Teradata, Snowflake (company), Cloudera, and Databricks.
Typical OLAP architectures include multidimensional cubes, star schemas, and snowflake schemas implemented on platforms from Microsoft SQL Server, Oracle Database, IBM Db2, and PostgreSQL. Data modeling in OLAP involves dimensions such as fiscal calendars used by General Electric, hierarchical geographies like those in United States Department of Commerce publications, and product taxonomies similar to classifications from GS1. Data integration often leverages ETL tools from Informatica, Talend, Pentaho, and Microsoft Azure Data Factory to consolidate sources such as SAP ERP, Salesforce, Workday, Oracle E-Business Suite, and streaming inputs from Apache Kafka. Storage strategies range from MOLAP cubes on specialized engines to ROLAP tables in relational engines and HOLAP hybrids combining both approaches, deployed on infrastructure from Amazon Web Services, Microsoft Azure, Google Cloud Platform, and on-premises solutions by Hewlett Packard Enterprise and Dell Technologies.
Core OLAP operations include slice, dice, roll-up, and drill-down that analysts apply in environments like Bloomberg L.P., Reuters, Nielsen Holdings, Kantar Group, and Gartner, Inc.. Query languages and interfaces include MDX in products from Microsoft, SQL extensions used by Oracle, and APIs used by SAP BusinessObjects and IBM Cognos. OLAP queries frequently join fact tables and dimension tables as in schemas from Teradata implementations, and are optimized for aggregations, window functions, and materialized views used in Google BigQuery, Amazon Redshift, Snowflake (company), and Azure Synapse Analytics.
Implementation types are commonly classified as MOLAP, ROLAP, and HOLAP, with commercial and open-source products from Microsoft, Oracle Corporation, SAP SE, IBM, Teradata, Informatica, Pentaho, Actian, and MicroStrategy. Open-source data stores and engines used for OLAP-like analytics include Apache Druid, Apache Pinot, ClickHouse, Presto (Trino), Apache Spark, and ClickHouse (alt). Columnar storage formats and technologies such as Apache Parquet, Apache ORC, and Delta Lake accelerate aggregation workloads, while indexing strategies from Lucene and compression approaches used in Zstandard and Snappy (compression) reduce I/O. Integration with visualization and BI tools from Tableau, Qlik, Power BI, Looker, and Sisense provides user-facing OLAP functionality.
Performance tuning in OLAP uses partitioning strategies employed by Teradata and Greenplum, materialized views as in Oracle Database and Microsoft SQL Server, and in-memory caches leveraged by SAP HANA, Microsoft Analysis Services, and Redis. Scalability patterns involve distributed processing frameworks such as Apache Hadoop, Apache Spark, and cloud-native architectures from Amazon Web Services, Google Cloud Platform, and Microsoft Azure. Optimization techniques draw on algorithms and research from institutions like MIT, Stanford University, Carnegie Mellon University, University of California, Berkeley, and ETH Zurich to improve query planning, indexing, and pre-aggregation. Workloads in companies such as Netflix, Airbnb, Uber Technologies, Lyft, and Spotify illustrate large-scale OLAP-like analytics with real-time and near-real-time requirements.
Common OLAP use cases include financial reporting at JPMorgan Chase and Citigroup, sales and inventory analysis at Walmart and Target Corporation, customer analytics at Amazon (company) and eBay, marketing attribution at Procter & Gamble and Unilever, and fraud detection efforts at PayPal, Visa Inc., and Mastercard. Healthcare analytics initiatives at Mayo Clinic, Cleveland Clinic, and Johns Hopkins Hospital use OLAP-style aggregation for outcomes research, while telecommunications firms like AT&T, Verizon Communications, and Vodafone analyze call detail records with similar techniques. Government statistical agencies such as United States Census Bureau and Office for National Statistics use multidimensional aggregates for demographic and economic indicators.
Foundational research and commercialization in the 1990s involved academic and corporate contributors linked to Paul Gray (information systems), Edgar F. Codd, and vendors including Arbor Software and Hyperion Solutions Corporation. Standardization efforts led to query and metadata models such as MDX and XMLA used by Microsoft, various vendors and interoperability initiatives promoted by The Data Warehousing Institute (TDWI), ISO, and ANSI. The evolution of OLAP intersects with developments in data warehousing from Bill Inmon and Ralph Kimball, and with advances in columnar databases, cloud data platforms, and streaming architectures influenced by Amazon Web Services and Google Cloud Platform.
Category:Data processing