Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Parallel SQL Based Association Rule Mining on Large Scale PC Cluster: Performance Comparison with Directly Coded C Implementation

Iko PramudionoContact Information, Takahiko ShintaniContact Information, Takayuki Tamura3, 4 Contact Information and Masaru KitsuregawaContact Information

(3)  Institute of Industrial Science, The University of Tokyo, 7-22-1 Roppongi, Minato-ku, Tokyo 106, Japan
(4)  Information & Communication System Development Center, Ohfuna 5-1-1, Kamakura-shi Kanagawa-ken, 247-8501, Japan
Abstract
Data mining is becoming increasingly important since the size of databases grows even larger and the need to explore hidden rules from the databases becomes widely recognized. Currently database systems are dominated by relational database and the ability to perform data mining using standard SQL queries will definitely ease implementation of data mining. However the performance of SQL based data mining is known to fall behind specialized implementation. In this paper we present an evaluation of parallel SQL based data mining on large scale PC cluster. The performance achieved by parallelizing SQL query for mining association rule using 4 processing nodes is even with C based program.

Keywords  data mining - parallel SQL - query optimization - PC cluster


Contact Information Iko Pramudiono
Email: iko@tkl.iis.u-tokyo.ac.jp

Contact Information Takahiko Shintani
Email: shintani@tkl.iis.u-tokyo.ac.jp

Contact Information Takayuki Tamura
Email: tamura@tkl.iis.u-tokyo.ac.jp

Contact Information Masaru Kitsuregawa
Email: kitsure@tkl.iis.u-tokyo.ac.jp
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.109 • Server: mpweb20
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)