Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
|
 |
Parallel SQL Based Association Rule Mining on Large Scale PC Cluster: Performance Comparison with Directly Coded C Implementation
| |
|
Parallel SQL Based Association Rule Mining on Large Scale PC Cluster: Performance Comparison with Directly Coded C Implementation
Iko Pramudiono3 , Takahiko Shintani3 , Takayuki Tamura3, 4 and Masaru Kitsuregawa3 
| (3) |
Institute of Industrial Science, The University of Tokyo, 7-22-1 Roppongi, Minato-ku, Tokyo 106, Japan |
| (4) |
Information & Communication System Development Center, Ohfuna 5-1-1, Kamakura-shi Kanagawa-ken, 247-8501, Japan |
Abstract
Data mining is becoming increasingly important since the size of databases grows even larger and the need to explore hidden
rules from the databases becomes widely recognized. Currently database systems are dominated by relational database and the
ability to perform data mining using standard SQL queries will definitely ease implementation of data mining. However the
performance of SQL based data mining is known to fall behind specialized implementation. In this paper we present an evaluation
of parallel SQL based data mining on large scale PC cluster. The performance achieved by parallelizing SQL query for mining
association rule using 4 processing nodes is even with C based program.
Keywords data mining - parallel SQL - query optimization - PC cluster
Fulltext Preview (Small, Large)
 References secured to subscribers.
|
|
|
|
|
|