Using distributed query result caching to evaluate queries for parallel data mining algorithms
Author(s)
Editor(s)
Arabnia, H R
Publisher
Athens: C S R E a Press
Date issued
1998
In
International Conference on Parallel and Distributed Processing Techniques and Applications, Vols I-Iv, Proceedings
From page
1127
To page
1132
Subjects
parallel query caching discriminant rules
Abstract
An increase in the speed of data mining algorithms can be achieved by improving the efficiency of the underlying technologies. Query engines are key components ill many knowledge discovery systems and the appropriate use of query engines can impact the performance of data mining algorithms. By laking advantage of hypothesis generation patterns, queries, generated from the hypotheses, call be evaluated more efficiently. Caching query results and using the cached results to evaluate new queries with similar constraints reduces the complexity of query evaluation and improves the performance of data mining algorithms. In a multi-processor environment, distributing the query result caches can improve the performance of parallel query evaluations. This Idea has been used in the ParDRI system and has resulted in significant improvements in the execution times of ParDRI.
Publication type
book part
