PROJECT GOALS
Develop, implement and evaluate techniques for efficiently reducing the storage needs in a multidimensional cube. The techniques will be based in modeling regions of the cube thereby providing succinct descriptions of the data, and at the same time storing the cell values that do not fit well in the models (outliers) to reduce the errors caused by the estimation. Several models need to be evaluated with respect to issues such as time to build the models, storage savings achieved, and errors incurred in the estimation. We will implement the techniques and evaluate them experimentally. Develop, implement and evaluate techniques to support multiresolution in data cubes. These techniques will allow users to obtain results with a requested error level attached to the answer. The larger the error, the faster the answer will be obtained. The users can ``zoom in'' as desired, reducing the level of error incrementally. Alternatively, the system can do it automatically, providing rough estimates at the beginning, and then incrementally refining the answers in front of the users. This procedure has the advantage of eliminating lengthy waits for the answers to materialize (latency), which tends to irritate most users. Moreover, it puts the users in control of the system, allowing them to stop the computations when the quality of the answers satisfies their needs. We will implement the techniques which will be also based in modeling regions of the cube and classifying the cells according to their error level. We will evaluate the techniques experimentally and study the tradeoffs involved. Develop, implement and evaluate techniques to do data mining on multidimensional data. The information obtained in the process of modeling regions of the cube is a valuable start point for mining patterns in the data. Moreover, the information about outliers is valuable in identifying cells that have an ``abnormal'' behavior (respect to other cells in the region). We hope to develop a set of tools to effectively mine data in the cube. Implement a prototype Database Management System for OLAP, using the concepts and tradeoffs studied in the previous objectives. This prototype will be able to manage large multidimensional datasets and provide the tools to do mining and analysis of the data.PUBLICATIONS
PEOPLE