Research Paper | Computer Science & Engineering | India | Volume 4 Issue 11, November 2015
Data Cube Materialization with MR Cube and CM Sketch Approach
Amar Sawant, Madhav Ingle
Data cube computations plays an important role in data warehouse systems. Applications with multidimensional data analysis are looking for unusual patterns. Here aggregation of data is done across many dimensions. Aggregation is done by making use of SQL aggregate functions and Group by operators. As there is need for multidimensional generalization of these operators, data cube is used which is a way for structuring data in multidimensions so that analysis can be done on some measures of interest. One of the key tasks in data warehouse is data cube computations. Several techniques for data cube computations are available but there are some limitations so MapReduce based approach can be used to overcome the limitations. MR-Cube, which is Mapreduced based approach creates lattices using derived data set which are further partitioned using value partitioning techniques followed by batch areas creation, makes an effective distribution of data and computation workload. Data cube computations in parallel using partially algebraic measures is done using MapReduced based algorithm. Extreme data skew is detected for a few cube groups that are unusually large. CM-Sketch is a Count Min Sketch approach, which is a compressed counting data structures used as a solution for extreme data skews.
Keywords: cube analysis, holistic measures, map reduce, data skew, CM sketch
Edition: Volume 4 Issue 11, November 2015
Pages: 1885 - 1889
How to Cite this Article?
Amar Sawant, Madhav Ingle, "Data Cube Materialization with MR Cube and CM Sketch Approach", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=NOV151648, Volume 4 Issue 11, November 2015, 1885 - 1889