The problem of mining hidden associations present in the large amounts of data has seen widespread applications in many practical
domains such as customer-oriented planning and marketing, telecommunication network monitoring, and analyzing data from scientific
experiments. The combinatorial complexity of the problem and phenomenal growth in the sizes of available datasets motivate
the need for efficient and scalable parallel algorithms. The design of such algorithms is challenging. This chapter presents
an evolutionary and comparative review of many existing representative serial and parallel algorithms for discovering two
kinds of associations. The first part of the chapter is devoted to the non-sequential associations, which utilize the relationships
between events that happen together. The second part is devoted to the more general and potentially more useful sequential
associations, which utilize the temporal or sequential relationships between events. It is shown that many existing algorithms
actually belong to a few categories which are decided by the broader design strategies. Overall the aim of the chapter is
to provide a comprehensive account of the challenges and issues involved in effective parallel formulations of algorithms
for discovering associations, and how various existing algorithms try to handle them.
This work was supported by NSF grant ACI-9982274, by Army High Performance Computing Research Center cooperative agreement
number DAAH04-95-2-0003/contract number DAAH04-95-C-0008, the content of which does not necessarily reflect the position or
the policy of the government, and no official endorsement should be inferred. Access to computing facilities was provided
by AHPCRC, Minnesota Supercomputer Institute. Related papers are available via WWW at URL: http://www.cs.umn.edu/~kumar.