AIFeb 15, 2016

Towards reducing the multidimensionality of OLAP cubes using the Evolutionary Algorithms and Factor Analysis Methods

arXiv:1602.04613v15 citations
Originality Synthesis-oriented
AI Analysis

This work addresses data reduction for decision support systems in data warehousing, presenting an incremental improvement by combining existing methods.

The paper tackles the problem of high memory and computation costs in analyzing multidimensional OLAP cubes in data warehouses by proposing a hybrid approach using Genetic Algorithms and Multiple Correspondence Analysis to reduce dimensionality, resulting in identifying a reduced subset of dimensions that closely matches a reference profile.

Data Warehouses are structures with large amount of data collected from heterogeneous sources to be used in a decision support system. Data Warehouses analysis identifies hidden patterns initially unexpected which analysis requires great memory and computation cost. Data reduction methods were proposed to make this analysis easier. In this paper, we present a hybrid approach based on Genetic Algorithms (GA) as Evolutionary Algorithms and the Multiple Correspondence Analysis (MCA) as Analysis Factor Methods to conduct this reduction. Our approach identifies reduced subset of dimensions from the initial subset p where p'<p where it is proposed to find the profile fact that is the closest to reference. GAs identify the possible subsets and the Khi formula of the ACM evaluates the quality of each subset. The study is based on a distance measurement between the reference and n facts profile extracted from the Warehouses.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes